The best way to get started is by understanding the bigger picture—gauging the magnitude of the work ahead of us. In this sense, we have identified two broad tasks:
- Setting up the prerequisite software.
- Developing two pipelines, starting with data collection and building a workflow sequence that could end with predictions. Those pipelines are as follows:
- A Random Forests pipeline
- A logistical regression pipeline
We will talk about setting up the prerequisite software in the next section.