Three Python scripts will be used for the demonstrations in this chapter. The first of these scripts, regex_intro.py, will be a program to introduce and demonstrate the use of regular expressions in Python. The second, explore_addresses.py, will be a simple program to explore the dataset and look for patterns. The third, extract_street_names.py, will be a program to extract the street names from the original dataset and output a revised dataset with a column for street names. The finished product for each of these files is available in the code folder of the reference material. All of the reference material can be found at the following link: https://goo.gl/8S58ra.
Logistical overview
Data
For the...