Data analysis and insights
Remember what we said about the importance of having a question in mind when starting to work on a data science project? This is especially true during this phase, where we explore our dataset and extract insights, which should revolve around our initial question – the connection between typing speed and whether a patient has PD or not.
Throughout this section, we will be working with the EDA.ipynb
file, located in the notebooks
folder of our current project. In the following subsections, we will be looking at the code included in this notebooks
folder. Go ahead and open this Jupyter notebook in your PyCharm editor, or, if you are following our discussions and entering your own code, create a new Jupyter notebook.
Starting the notebook and reading in our processed data
Remember that when you open a Jupyter notebook in Python, you can see the code, but Jupyter won’t run unless you click the Run button. You can see PyCharm ready for this...