In this chapter, we introduced the important concepts of linear models and described how linear regression works. In particular, we focused on the basic model and its main variants: Lasso, Ridge, and ElasticNet. They don't modify the internal dynamics, but work as normalizers for the weights to avoid common problems when the dataset contains unscaled samples. These penalties have specific peculiarities. While Lasso promotes sparsity, Ridge tries to find a minimum with the constraint that the weights must lie within a circle centered at the origin (whose radius is parametrized to increase/decrease the normalization strength). ElasticNet is a mix of both these techniques, and it tries to find a minimum where the weights are small enough and a certain degree of sparsity is achieved.
We also discussed advanced techniques such as RANSAC, which allows us to cope with outliers...