Grouping and summarizing your data
In this recipe, we’ll be creating a design pattern, or a combination of tools, that will allow us to solve a very common format problem found in data preparation.
We have a dataset of registered flight delays (in minutes). The data has two records per registered flight. The first row is the delay registered at departure, and the second is the delay registered at arrival.
Figure 8.17: Dataset preview
There are many ways of solving this situation (sampling alternating rows, filtering, and more), but we are going to do the easiest one, based on the Multi-Row Formula tool, a very powerful tool that Alteryx provides.
Getting ready
We prepared this test set for you to follow along with the recipe: https://github.com/PacktPublishing/Alteryx-Designer-Cookbook/tree/main/ch8/Recipe3
As you can see in the data preview, we again used a dataset that is the result of previous preparation tasks, and we face the...