Conventions
In this book, you will find a number of styles of text that distinguish between different kinds of information. Here are some examples of these styles, and an explanation of their meaning.
Code words in text, database table names, folder names, filenames, file extensions, pathnames, dummy URLs, user input, and Twitter handles are shown as follows: "The prettify()
method can be called either on a Beautiful Soup object or any of the Tag
objects."
A block of code is set as follows:
html_markup = """<html> <body>& & ampersand ¢ ¢ cent © © copyright ÷ ÷ divide > > greater than </body> </html> """ soup = BeautifulSoup(html_markup,"lxml") print(soup.prettify())
When we wish to draw your attention to a particular part of a code block, the relevant lines or items are set in bold:
UserWarning: "http://www.packtpub.com/books" looks like a URL. Beautiful Soup is not an HTTP client. You should probably use an HTTP client to get the document behind the URL, and feed that document to Beautiful Soup
Any command-line input or output is written as follows:
sudo easy_install beautifulsoup4
New terms and important words are shown in bold. Words that you see on the screen, in menus or dialog boxes for example, appear in the text like this: "The output methods in Beautiful Soup escape only the HTML entities of >,<, and & as >, <, and &."
Note
Warnings or important notes appear in a box like this.
Tip
Tips and tricks appear like this.