4. Collecting Text Data from the Web
Learning Objectives
By the end of this chapter, you will be able to:
- Extract and process data from web pages
- Describe different kinds of semi-structured data, such as JSON and XML
- Extract real-time data using Application Programming Interfaces
- Extract data from various file formats
In this chapter, you will learn how to collect data from different file formats.