You're reading from Python for Security and Networking Leverage Python modules and tools in securing your network and applications

Product type Paperback

Published in Jun 2023

Publisher Packt

ISBN-13 9781837637553

Length 586 pages

Edition 3rd Edition

Languages

Python

Concepts

Information Security

Author (1):

José Manuel Ortega

View More author details

Table of Contents (23) Chapters

Preface

1. Section 1: Python Environment and System Programming Tools

2. Working with Python Scripting FREE CHAPTER

3. System Programming Packages

4. Section 2: Network Scripting and Packet Sniffing with Python

5. Socket Programming

6. HTTP Programming and Web Authentication

7. Analyzing Network Traffic and Packet Sniffing

8. Section 3: Server Scripting and Port Scanning with Python

9. Gathering Information from Servers with OSINT Tools

10. Interacting with FTP, SFTP, and SSH Servers

11. Working with Nmap Scanner

12. Section 4: Server Vulnerabilities and Security in Web Applications

13. Interacting with Vulnerability Scanners

14. Interacting with Server Vulnerabilities in Web Applications

15. Obtain Information from Vulnerabilities Databases

16. Section 5: Python Forensics

17. Extracting Geolocation and Metadata from Documents, Images, and Browsers

18. Python Tools for Brute-Force Attacks

19. Cryptography and Code Obfuscation

20. Assessments – Answers to the End-of-Chapter Questions

21. Other Books You May Enjoy

22. Index

Extracting metadata with PyPDF2

We will start with PyPDF2, whose module can be installed directly with the following command:

$ pip install PyPDF2

This module offers us the ability to extract document information using the PdfFileReader class and the getDocumentInfo() method, which returns a dictionary with the data of the document.

We could start by extracting the number of pages using the getNumPages() method from the PdfFileReader class. We could also use the output of the pdfinfo command to obtain this information. You can find the following code in the get_num_pages_pdf.py file in the pypdf2 folder:

from PyPDF2 import PdfFileReader
pdf = PdfFileReader(open('pdf/XMPSpecificationPart3.pdf','rb'))
print(str(pdf.getNumPages()))
from subprocess import check_output
def get_num_pages(pdf_path):
    output = check_output(["pdfinfo", pdf_path]).decode()
    pages_line = [line for line in output.splitlines() if "Pages:" in line]...