Before we start coding, we should probably set up our development environment. For data scientists and analysts using Python, the Jupyter Notebook is, by far, the most popular tool for development. Therefore, we strongly advise that you use this environment.
We will also need to download the pandas library. The easiest way to obtain both is to download Anaconda. Anaconda is a distribution that comes with the Jupyter software and the SciPy packages (which includes pandas).
The next step is to create a new folder (I'm going to name it RecoSys) in your desired location. This will be the master folder that contains all the code we write as part of this book. Within this folder, create another folder named Chapter2, which will contain all the code we write as part of this chapter...