Dataset
For our case study on customer segmentation using clustering, we will be using a dataset from UCI repository of datasets for a UK online retail store. This retail store has shared its data with UCI and the dataset is freely available on their website. This data is essentially the transactions of different customers made on the online retail store. The transactions were made from different countries and the dataset size is good (thousands of rows). Let's go through the attributes of the dataset:
Attribute name |
Description |
---|---|
|
Invoice number; a number uniquely assigned to each transaction |
|
Product (item) code; a 5-digit integral number uniquely assigned to each distinct product |
|
Product item name |
|
Quantity of items purchased in a single transaction |
|
Date of the transaction |
|
Price of the item (in pounds) |
|
Unique ID of the person making the transaction |
|
Country from where... |