Polynomial regression
The previous analysis has been centered around the idea of obtaining a linear equation to represent a given dataset. However, many datasets derive from non-linear relationships. Fortunately, there are alternative mathematical models from which to choose.
The simplest nonlinear functions are polynomials: y = f(x) b0 + b1x + b2x2 + …+bdxd, where d is the degree of the polynomial and b0, b1, b2, ..., bm are the coefficients to be determined.
Of course, a linear function is simply a first-degree polynomial: y = b0 + b1x. We have already solved that problem (in the previous derivation, we called the coefficients m and b instead of b1 and b0). We used the method of least squares to derive the formulas for the coefficients:
data:image/s3,"s3://crabby-images/d1912/d19124fbcadc801ddcdf4d69011fa4146c6c8e95" alt=""
data:image/s3,"s3://crabby-images/3875a/3875a3f08bb1a8bc1238495eedfe7db8d4d7fc1b" alt=""
Those formulas were derived from the normal equations:
data:image/s3,"s3://crabby-images/d2b69/d2b6985845f4d6ec7b38ba195550c95f35c8dec0" alt=""
data:image/s3,"s3://crabby-images/36fa2/36fa292e31ca2668fc0d1737231826cf16ed50ee" alt=""
The equations were obtained by minimizing the sum of squares:
data:image/s3,"s3://crabby-images/fb383/fb3835e5b199205fb095b576abcb3d9d4862bcfe" alt=""
We can apply the same least squares method to find the best-fitting polynomial of any degree d for a given dataset, provided that d is less than...