Auto-validating data
Data validation can be a time-consuming task where we have to determine whether a value is accurate or not. One of the most typical data validation issues relates to misspelling and labeling the same thing differently. For example, the city of New York might be present in your data more than once, with different labels:
- New York
- NY
- NYC
- New York, NY
- New York, New York
- New York, US
- And so on…
To make the process of validating data easier, Tableau Prep uses data roles. A data role compares your data against a list of known values or specific patterns. This allows us to quickly identify problematic values in our data and take action to resolve them.
Getting ready
To follow along with this recipe, download the Sample Files 3.4 folder from this book's GitHub repository.
How to do it…
Open up Tableau Prep and connect to the User List.csv file from the Sample Files 3.4 folder and follow the steps:
...