In this section, we are going to discuss how to conduct product analytics using the pandas and matplotlib packages in Python. For those readers who would like to use R, instead of Python, for this exercise, you can skip to the next section. We will start this section by analyzing the overall time series trends in the revenue and numbers of purchases, and the purchase patterns of repeat purchase customers, and then we will move on to analyze the trends in products being sold.
For this exercise, we will be using one of the publicly available datasets from the UCI Machine Learning Repository, which can be found using this link: http://archive.ics.uci.edu/ml/datasets/online+retail#. From this link, you can download the data in Microsoft Excel format, named Online Retail.xlsx. Once you have downloaded this data, you can load it into your Jupyter Notebook...