Recipe Difficulty: Easy
Python Version: 2.7 or 3.5
Operating System: Any
While PDF documents can represent a wide variety of media, including images, text, and forms, they contain structured embedded metadata in the Extensible Metadata Platform (XMP) format that can provide us with some additional information. Through this recipe, we access a PDF using Python and extract metadata describing the creation and lineage of the document.