Why do we need the Pipeline API?
Before digging into the details of the Pipeline API, it is important to understand what a machine learning pipeline means, and why we need a Pipeline
API.
It is important to understand that you cannot have an efficient machine learning platform if the only thing you provide is a bunch of algorithms for people to use. Machine learning is quite an involved process, which involves multiple steps, and a machine learning algorithm itself is just one (though very important) part of the step. As an example, let's consider a text classification example, where you have a corpus of text, and you want to classify if that is a sports article or not a sports article. We would like to simplify it to a 1 and a 0, where a 1 indicates it is about sports and 0 indicates it is not about sports. This is a supervised machine learning flow, where we will use data with existing labels, to predict the labels for data with no labels.
You would need to collect this data. Preprocess...