Consider a dataset of real-values vectors:
Each input vector is associated with a real value yi:
A linear model is based on the assumption that it's possible to approximate the output values through a regression process based on the rule:
In other words, the strong assumption is that our dataset and all other unknown points lie on a hyperplane and the maximum error is proportional to both the training quality and the adaptability of the original dataset. One of the most common problems arises when the dataset is clearly non-linear and other models have to be considered (such as neural networks or kernel support vector machines).