Technical requirements
A web browser (Google Chrome or Mozilla Firefox) will be required and we will be using JupyterLab for the Python code.
Please refer to the Setting things up and Creating a virtual environment sections in Chapter 2 to continue setting up and using the environment created. Refer to https://pypdf2.readthedocs.io/en/3.0.0/user/installation.html to install PyPDF2.
The Python libraries that are required for this chapter are as follows:
requests
re
pypdf2
The code files for this chapter are available online in this book’s GitHub repository: https://github.com/PacktPublishing/Hands-On-Web-Scraping-with-Python-Second-Edition/tree/main/Chapter09.