Model deployment and monitoring
If you are new to deployment, you might think of it as how to develop a frontend, mobile application, or API for end users of your models. But that is not what we want to talk about in this book. There are two important aspects of deployment that we want to cover here and in future chapters: the actions needed to provide a model in production and integrating a model into a process that is supposed to benefit the users.
When you deploy your model, your code should run properly in the designated environment and have access to the required hardware, such as the GPU, and users’ data needs to be accessible in the right format for your model to work. Some of the tests that we talked about in the testing stage of the life cycle make sure that your model runs as expected in the production environment.
When we talk about providing a model in a production environment, it either gets used behind the scenes for the benefit of the user, such as when...