Generalized linear regression
Recall that in linear regression, we assume the following functional form between the dependent variable Y and independent variable X:
data:image/s3,"s3://crabby-images/aa300/aa300c8960402927c636048d5d6296e4f4bdc686" alt="Generalized linear regression"
Here, is a set of basis functions and
is the parameter vector. Usually, it is assumed that
, so
represents an intercept or a bias term. Also, it is assumed that
is a noise term distributed according to the normal distribution with mean zero and variance
. We also showed that this results in the following equation:
data:image/s3,"s3://crabby-images/83a2c/83a2c00e1ed91c86187b4e5235f48e2251da1ad4" alt="Generalized linear regression"
One can generalize the preceding equation to incorporate not only the normal distribution for noise but any distribution in the exponential family (reference 1 in the References section of this chapter). This is done by defining the following equation:
data:image/s3,"s3://crabby-images/78462/7846261eb118898f3a495216bcdc3f8e1df3167a" alt="Generalized linear regression"
Here, g is called a link function. The well-known models, such as logistic regression, log-linear models, Poisson regression, and so on, are special cases of GLM. For example, in the case of ordinary linear regression, the link function would be . For logistic regression...