The project and analysis in the chapter will cover the data gathered from the Twitter feeds through the Twitter API. Working with the API, the user has a selection of different endpoints (functionalities). We will focus on two of the most popular: the streaming and the search endpoints (REST API). The first one gives access to real-time data, showing tweets as they are published (in fact the access is to the sample, not all tweets). The latter allows to query historical tweets (up to about a week), based on several criteria, which is more suitable for a static analysis. The following are the steps to gather the data from the Twitter feeds:
- Getting the data
- Data pull
- Data cleaning
Let us take a look at each one in detail.