Let's get going.
In Watson Analytics Refine, it is easy to test our data quality column by column (click Refine, select our file, then click on the Data Metrics icon, which is shown as follows):
It would seem that there are plenty of columns that have very high (98 or above) data quality scores, for example, Advertisement ID, Age Range, Ethnic, and so on. This not only gives us confidence in whatever predictions Watson may come up with, but also affords us the ability to further refine our data to focus more on the specific needs or interests we may have.
For example, suppose we want to establish personal recommendations based upon user behavior of a certain age range? Or perhaps user ethnicities?
Initially, we are going to target those users who fall into the following categories:
- Retirees (65 -74 years old and older than 75 years too)
- Caucasian ...