Predicting news popularity in social media
In this section, we will create a model that will find out the popularity score for an article on social media platforms, based on its text. For this, we will train the model with a News Popularity dataset collected between 2015 and 2016 (https://archive.ics.uci.edu/ml/datasets/News+Popularity+in+Multiple+Social+Media+Platforms).
As we want to approximate a score (number of likes), we will use a text regressor for this task.
In the next screenshot, you can see some samples taken from this dataset:
This notebook with the complete source code can be found at https://colab.research.google.com/github/PacktPublishing/Automated-Machine-Learning-with-AutoKeras/blob/main/Chapter05/Chapter5_SpamDetector.ipynb.
We will now explain the relevant code cells of the notebook in detail, as follows:
- Getting the articles dataset: Before training, we...