Introduction
In the last chapter, we learned about developing a simple classifier using feature extraction methods. We also covered different algorithms that fall under supervised and unsupervised learning. In this chapter, you will learn about collecting data by scraping web pages and then processing it. You will also learn how to handle various kinds of semi-structured data, such as JSON and XML. We will cover different methods for extracting data using Application Programming Interfaces (APIs). Finally, we will explore different ways to extract data from different types of files.