Dataset overview
Let’s explore the dataset we’re about to use. Like we did in other chapters on machine learning, we start with a dataset, this one being a dataset of Amazon book reviews.
The dataset contains information about different products and their reviews. It includes the following columns:
marketplace
(string): Location of the productcustomer_id
(string): Unique ID of the customerreview_id
(string): Review IDproduct_id
(string): Unique ID of the productproduct_parent
(string): Parent productproduct_title
(string): Title of the product reviewedproduct_category
(string): Different product categoriesstar_rating
(int): Rating of the product out of 5helpful_votes
(int): Number of helpful votes for the producttotal_votes
(int): Total number of votes for the productreview_headline
(string): Heading of the reviewreview_body
(string): Content of the reviewreview_date
(string...