You're reading from Mastering Python for Networking and Security Leverage the scripts and libraries of Python version 3.7 and beyond to overcome networking and security issues

Product type Paperback

Published in Jan 2021

Publisher Packt

ISBN-13 9781839217166

Length 538 pages

Edition 2nd Edition

Languages

Python

Tools

Tor

Concepts

Network Security

Author (1):

José Manuel Ortega

View More author details

Table of Contents (22) Chapters

Preface

1. Section 1: The Python Environment and System Programming Tools

2. Chapter 1: Working with Python Scripting FREE CHAPTER

3. Chapter 2: System Programming Packages

4. Section 2: Network Scripting and Extracting Information from the Tor Network with Python

5. Chapter 3: Socket Programming

6. Chapter 4: HTTP Programming

7. Chapter 5: Connecting to the Tor Network and Discovering Hidden Services

8. Section 3: Server Scripting and Port Scanning with Python

9. Chapter 6: Gathering Information from Servers

10. Chapter 7: Interacting with FTP, SFTP, and SSH Servers

11. Chapter 8: Working with Nmap Scanner

12. Section 4: Server Vulnerabilities and Security in Python Modules

13. Chapter 9: Interacting with Vulnerability Scanners

14. Chapter 10: Identifying Server Vulnerabilities in Web Applications

15. Chapter 11: Security and Vulnerabilities in Python Modules

16. Section 5: Python Forensics

17. Chapter 12: Python Tools for Forensics Analysis

18. Chapter 13: Extracting Geolocation and Metadata from Documents, Images, and Browsers

19. Chapter 14: Cryptography and Steganography

20. Assessments

21. Other Books You May Enjoy

Leave a review - let other readers know what you think

Extracting metadata from PDF documents

Document metadata is a type of information that is stored within a file and is used to provide additional information about that file. This information could be related to the software used to create the document, the name of the author or organization, as well as the date and time the file was created or modified.

Each application stores metadata differently, and the amount of metadata that is stored in a document will almost always depend on the software used to create the document.

In this section, we will review how to extract metadata from PDF documents with the pyPDF2 module. The module can be installed directly with the pip install utility since it is located in the official Python repository:

$ pip3 install PyPDF2

At the URL https://pypi.org/project/PyPDF2, we can see the last version of this module:

>>> import PyPDF2
>>> dir(PyPDF2)
['PageRange', 'PdfFileMerger', 'PdfFileReader...

The rest of the chapter is locked

You're reading from Mastering Python for Networking and Security Leverage the scripts and libraries of Python version 3.7 and beyond to overcome networking and security issues

Table of Contents (22) Chapters

Extracting metadata from PDF documents

Authors (1)

Other recommended products

Personalised recommendations for you

You're reading from Mastering Python for Networking and Security Leverage the scripts and libraries of Python version 3.7 and beyond to overcome networking and security issues

Table of Contents (22) Chapters

Extracting metadata from PDF documents

Unlock this book and the full library FREE for 7 days

Authors (1)

Other recommended products

Personalised recommendations for you