Regression
Now that we have seen the machine learning workflow, we will take two widely used types of machine learning algorithms: regression and classification; both employ supervised learning to train the models. The entire theme of this book revolves around these two types of algorithms. The Beijing PM2.5 dataset will be used extensively in demonstrating both these types. The dataset will help in understanding how one can convert a regression problem into a classification problem and vice versa.
Simple and Multiple Linear Regression
Regression is one of the most useful and essential tools in analytics and econometrics (the branch of economics concerned with the use of mathematical methods, especially statistics, in describing economic systems). In many ways, modern machine learning has its roots in statistics, and one can attribute that mostly to Sir Francis Galton's work. Galton was an English Victorian-era statistician and polymath with deep interest and expertise in fields such as genetics...