Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Learning Python Networking A complete guide to build and deploy strong networking capabilities using Python 3.7 and Ansible

Product type Paperback

Published in Mar 2019

Publisher

ISBN-13 9781789958096

Length 490 pages

Edition 2nd Edition

Languages

Python

Tools

Ansible

Concepts

Networking

Authors (3):

Dr. M. O. Faruque Sarker

José Manuel Ortega

Sam Washington

View More author details

Table of Contents (19) Chapters

Preface

1. Section 1: Introduction to Network and HTTP Programming FREE CHAPTER

2. Network Programming with Python

3. Programming for the Web with HTTP

4. Section 2: Interacting with APIs, Web Scraping, and Server Scripting

5. Application Programming Interface in Action

6. Web Scraping with BeautifulSoup and Scrapy

7. Engaging with Email

8. Interacting with Remote Systems

9. Section 3: IP Address Manipulation and Network Automation

10. Working with IP and DNS

11. Implementing IPv6 and Address Manipulation

12. Performing Network Automation with Python and Ansible

13. Section 4: Sockets and Server Programming

14. Programming with Sockets

15. Designing Servers and Asynchronous Programming

16. Designing Applications on the Web

17. Assessment

18. Another Book You May Enjoy

Leave a review - let other readers know what you think

Web Scraping with BeautifulSoup and Scrapy

When we want to extract the content of a web page by automating the extraction of information, we often find that the website does not offer any API to obtain the data you need and it is necessary to resort to scraping techniques to recover data automatically. Some of the most powerful tools can be found in Python 3.7, among which we shall highlight BeautifulSoup and Scrapy.

Scrapy is a framework written in Python for the extraction of data in an automated way that can be used for a wide range of applications, such as the processing of data mining.

The following topics will be covered in this chapter:

Introduction to web scraping
Extracting information from web pages and parsing HTML with BeautifulSoup
Introduction to Scrapy components and architecture
Scrapy as a framework for performing web crawling processes and data analysis
Working...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (3)

José Manuel Ortega

José Manuel Ortega is a software engineer, focusing on new technologies, open source, security, and testing. His career goal has been to specialize in Python and security testing projects. In recent years, he has developed an interest in security development, especially in pentesting with Python. Currently, he is working as a security tester engineer and his functions in the role involves the analysis and testing of the security of applications in both web and mobile environments. He has taught at university level and collaborated with the official school of computer engineers. He has also been a speaker at various conferences. He is eager to learn about new technologies and loves to share his knowledge with the community.

See other products by José Manuel Ortega

Sam Washington

Sam Washington currently works at University College London as a systems administrator in the platform integration team of the central IT department, supporting a variety of web hosting and network services. He enjoys the daily challenges of managing the demands of full-stack enterprise web applications and looking for ways to employ new technologies to improve services and workflows. He has been using Python for professional and personal projects for over 10 years.

See other products by Sam Washington

Dr. M. O. Faruque Sarker

Dr. M. O. Faruque Sarker is a software architect based in London; he has shaped various Linux and open source software solutions mainly on cloud computing platforms for various institutions. Over the past 10 years, he has led numerous Python software development and cloud infrastructure automation projects. In 2009, he started using Python and shepherded a fleet of miniature E-puck robots at the University of South Wales, Newport, UK. Later, he was invited to work on the Google Summer of Code (2009/2010) programs to contribute to the BlueZ and Tahoe-LAFS open source projects. He is the author of Python Network Programming Cookbook, Packt Publishing and received his PhD in multirobot systems at the University of South Wales.

See other products by Dr. M. O. Faruque Sarker