Chapter materials
We will be building a simulation package to generate the data for this chapter; it is on GitHub at https://github.com/stefmolin/login-attempt-simulator/tree/2nd_edition. This package was installed from GitHub when we set up our environment back in Chapter 1, Introduction to Data Analysis; however, you can follow the instructions in Chapter 7, Financial Analysis – Bitcoin and the Stock Market, to install a version of the package that you can edit.
The repository for this chapter, which can be found at https://github.com/stefmolin/Hands-On-Data-Analysis-with-Pandas-2nd-edition/tree/master/ch_08, has the notebook we will use for our actual analysis (anomaly_detection.ipynb
), the data files we will be working with in the logs/
folder, the data used for the simulation in the user_data/
folder, and the simulate.py
file, which contains a Python script that we can run on the command line to simulate the data for the chapter.