To illustrate the concepts that we will cover in this chapter, we will use the ABC Millions Headlines dataset. This dataset contains approximately a million news headlines. In the github repository associated with this chapter, you should find the following files:
- abcnews-date-text.zip: The input dataset
- libraries-mleap: MLeap libraries (includes a .jar file and a Python wrapper for the .jar)
Let's begin by looking at the architecture of an inference pipeline.