Identifying gold mines in data for decision making
As a first step, before we dig deeper into the data exploration and analysis phase, we need to identify the gold mines in data. In the previous chapter, we designed the heuristic-driven hypotheses (HDH) while defining the problem. We now need to revisit the list and explore it to understand whether we are in a position to solve the problem using the data. We will be able to do this by examining and validating the data sources for the identified hypotheses. In case we do not have data to prove/disprove majority of our important hypotheses, it would not add any value by proceeding any further with the current approach. With data being available, we can get our hands dirty with codes for the solution.
Examining data sources for the hypotheses
If we take a look at the Prioritize and structure hypotheses based on the availability of data section in the previous chapter, we can see that we have listed a couple of hypotheses that could be potential...