Chapter 7: Open Source Machine Learning Platforms
In the previous chapter, we covered how Kubernetes can be used as the foundational infrastructure for running machine learning (ML) tasks, such as running model training jobs or building data science environments such as Jupyter notebook servers. However, to perform these tasks at scale and more efficiently for large organizations, you will need to build ML platforms with the capabilities to support the full data science life cycle. These capabilities include scalable data science environments, model training services, model registries, and model deployment capabilities.
In this chapter, we will discuss the core components of an ML platform and additional open source technologies that can be used for building ML platforms. We will start with technologies for building a data science environment that can support a large number of users for experiments, and then discuss other technologies for model training, model registries, model...