Derived column transform
The Derived Column transform allows you to define a new column based on a math formula or SQL expression in which you can use other columns in the data, as well as constants and literals. For instance, to derive a “percentage” column from the columns "success" and "count", you can enter the SQL expression: "success * 100 / count || '%'".
Example result:
success | count | percentage |
---|---|---|
14 | 100 | 14% |
6 | 20 | 3% |
3 | 40 | 7.5% |
To add a Derived Column transform:
-
Navigate to your visual ETL job in Amazon SageMaker Unified Studio.
-
Choose the plus icon to open the Add nodes menu.
-
Under Transforms, choose Derived Column.
-
Select the diagram to add the node to your visual ETL job.
-
Select the node on the diagram to view details about the transform.
-
Under Name of derived column, enter the name of a new column that will be generated.
-
Under Column expression, enter a SQL expression to define the new column based on existing columns.