Chapter 4 – Optimizing Your Solutions with K-Means Clustering
- Can a prototype be built with random data in corporate environments? (Yes | No)
The answer is yes, and no. To start developing a prototype, using random data can help make sure that the basic algorithm works as planned.
However, once the prototype is advanced, it will be more reliable to use a well-designed dataset. Then, once the training has been accomplished, random data can again help to see how your system behaves in all situations.
- Do design matrices contain one example per matrix? (Yes | No)
The answer is no. A good design matrix contains one example in each row or each column depending on the shape you want it to have. But be careful; a design matrix that contains data that is too efficient might overfit. That means the learning algorithm will be efficient with that data but not adapt to new data. On the other hand, if the dataset contains too many...