Select fields processor
Renames and reorganizes your fields.
The Select fields processor allows you to manipulate the schema of the input flow and keep only the relevant fields depending on your needs. You can move fields around and rename them.
Usage
The Select fields processor requires one input flow and can generate only one output flow.
Properties
Properties to configure to select and restructure your data.
Property | Configuration |
---|---|
Distinct |
Enable this option to return only different (distinct) values while selecting your fields. If this option is disabled, all duplicate values will be kept in the output. |
Select all | Select this checkbox to select all fields at once. It is particularly useful if you have a large number of fields to edit. |
Hold this icon to move your field up and down in the existing schema. The fields will be reorganized once you save your changes. | |
Select this icon that is displayed when hovering the field name to rename your field. |
To rename the processor or edit its description, click the Edit icon next to the processor name in the Properties panel.
Example
The data used as source in this example contains data about sales, such as number of units sold, costs, prices, revenue and profits.
You can see that three field names have a different format and case from the rest, namely Total_cost, Total_revenue and Total_profit, and it would make more sense for general readability to put them together at the end of the dataset. You can use the Select fields processor to reorganize and rename your fields to improve the consistency of your data.
In the processor properties, select the Select all checkbox so that all the fields are kept in the output flow. Point your mouse over the fields to rename and click the Edit icon to edit the three field names as follow:
-
Total_cost as Total Cost
-
Total_revenue as Total Revenue
-
Total_profit as Total Profit
Click the Submit button or press Enter.
Use the = icon to drag and drop the three fields at the end of the list in this order: Total Revenue, Total Cost, Total Profit.
The output of the data flow will be easier to read and the field will have a consistent naming convention.