Skip to Main Content

Alteryx: Getting Started

This guide introduces users to some of the basic tools in Alteryx. Using data for Halloween candy, users will learn to import data into the workspace, cleanse data, and use basic Preparation, Join and Transform tools to create the basics of a workflow.

Preparation tools

Select tool       

The Select tool allows for the inclusion, exclusion, or reordering of data that pass through a workflow. This tool can also be used to modify the type and size of data, rename a column, or add a description.


To include a column in data, select the check box to the left of the column name. Deselect the check box to exclude the column.  There is also an "Unknown field" which can be used for adding new fields to work and is present for many of the tools. For the workflow,  select State,Top Candy and Pounds A.  Once this is done, click the Run button.

 

In checking the results, only the state, top candy for that state and the number of pounds are seen.

 

 

Filter tool          

The Filter tool queries records by using an expression and splits data into two outputs: True, where the data meets the specified criteria, and False, where the data does not meet the specified criteria. Use this tool to identify records in the data that meet a specified criteria. 

Select the type of filter to use

Basic filter: Use the basic filter to quickly build a simple query on a single column of data.

  1. Click Select column to select the data to filter.
  2. In the next drop down, select the operator to use depending on the data type of the selected column.
  3. Complete the expression by typing a value or selecting the available date values.

Custom filter: Use the custom filter to build a more complex expression or to query from multiple fields in the data stream.

Example 1.  Select the Basic filter, then select PoundsA as the data to filter.  Choose the >= from the menu of operators.  Then type 50000 in box for the value.  Click the Run button.

 

By clicking on the T anchor, the data which meets the criteria can be seen. Clicking on the F anchor reveals data that did not meet the criteria. 

 

 

Example 2. Select the Custom Filter.  In the Formula Expression box, type in the following formula: [PoundsC]<5000

Once the formula is entered, click RUN.  When the workflow has run, click on the output anchor to see the results.

Formula tool   

The Formula tool creates a new column, or updates an existing column by using one or more expressions to perform a variety of calculations and operations. 

Configure the tool

Using the Input Data tool, bring the datafile into the workspace.  Drag the Formula tool onto the workspace and connect it to the Input Data tool.

In the Configuration window, select an Output Column of data in Select Column; choose an existing column or add a new column.

  • Click on Select Column
  • Click + Add Column and type the new column name.
  • Click the expression editor and build an expression.

Once the workflow has been run, the Data Preview box displays the first row of data from the specified column with the expression applied.

To get started,  select +Add Column.  For the column name, type in Candy Total.

For the expression, type in [PoundsA]+[PoundsB]+[PoundsC]     When typing the expression, a dialog box will pop up and allow for selection of data columns that wiil be used to calculate the new column.

Once the expression is entered, click the Run button.

Once the workflow has been run, the Results pane should show the first rows of data.  Notice that a new column, Candy Total, is now the last column in the results.

Imputation tool   

The Imputation tool replaces a specified value within one or more numeric data fields with another specified value.  This is particularly heplful in replacing NULL values or empty spaces in datasets.

Configure the tool:

  1. Select the field to be imputed
  2. Incoming value to replace: Specify one of two choices.
    • Null()
    • User specified value: Use the text box to enter a value.
  3. Replace with value: Specify one of four choices.
    • Average: Average of all values, not including “Incoming value to replace”.
    • Median: Value separating the higher half from the lower half, or the “middle” value.
    • Mode: Value that appears most often.  
    • User specified value: Use the text box to enter a value.

Download the datafile CandyProduction to the desktop, then use the Input data tool to bring the file into the workspace.  This spreadsheet details the number of boxes of Mars candies packed.  Since there are blanks (NULL values) in the data, the Imputation tool can be used to change these values.  Drag the Imputation tool onto the workspace and connect it to the Input Data tool.

This connection will bring up the Configuration box which allows for selection of fields to correct.

In the Configuration box, select which fields to impute, which values to replace and what value to replace them with.   Replace all the NULL values with a zero (0).  Make the appropriate selections for this and click on RUN.

Once the workflow has run, view the results,  All the NULL values have been replaced with zero.

University Libraries

One Bear Place #97148
Waco, TX 76798-7148

(254) 710-6702