Chapter materials
The materials for this chapter can be found at https://github.com/stefmolin/Hands-On-Data-Analysis-with-Pandas-2nd-edition/tree/master/ch_11. In this chapter, we will be revisiting attempted login data; however, the simulate.py
script has been updated to allow additional command-line arguments. We won't be running the simulation this time, but be sure to take a look at the script and check out the process that was followed to generate the data files and create the database for this chapter in the 0-simulating_the_data.ipynb
notebook. The user_data/
directory contains the files used for this simulation, but we won't be using them directly in this chapter.
The simulated log data we will be using for this chapter can be found in the logs/
directory. The logs_2018.csv
and hackers_2018.csv
files are logs of login attempts and a record of hacker activity from all 2018 simulations, respectively. Files with the hackers
prefix are treated as the labeled data we...