So far, we have only used a single variable for prediction: the number of rooms per dwelling. This is, obviously, not the best we can do. We will now use all the data we have to fit a model, using multidimensional regression. We now try to predict a single output (the average house price) based on multiple inputs.
The code looks very much like before. In fact, it's even simpler as we can now pass the value of boston.data directly to the fit method:
x = boston.data
y = boston.target
lr.fit(x, y)
Using all the input variables, the root mean squared error is only 4.7, which corresponds to a coefficient of determination of 0.74 (the code to compute these is the same as the previous example). This is better than what we had before, which indicates that the extra variables did help. But we can no longer easily display the regression line as we did before...