Profiling data with summary and statistical aggregations
Once you have your dataset, getting an idea of what values are in it allows you to understand what the data looks like. Knowing what the data is like provides a reference when you are ready to compare across runs.
In Alteryx, there are a number of ways in which to investigate the range of values that appear in the dataset. In this section, we are going to look at the following three areas:
- What is the variation in the dataset and the size of the range?
- How is the dataset distributed?
- What proportion of your records is missing values?
In each of these areas, Alteryx provides tools for answering the questions quickly and also has methods for those answers to be persisted in your logging systems.
Investigating the variation and size range of your dataset
The first area to investigate is the spread of the data. Understanding the aggregated spread of the records in each field will give you an understanding...