Reviewing the standard Pachyderm workflow
As you probably noticed when you were creating a pipeline, there is a certain workflow that you will need to follow when working with Pachyderm. Depending on your automation tools, your team processes, and the software that you use, it might differ, but in general, it boils down to the following common steps:
- Add your data to Pachyderm.
- Create a pipeline or pipelines.
- Add more data.
- Modify the parameters.
- Modify your pipeline or pipelines.
- Output the result to an output repository.
- Repeat Steps 3–6 as many times as needed or continuously.
- Serve your model or package your library and make it available for use.
The following diagram demonstrates this process:
Depending on whether you keep your code in a Docker image, in the pipeline itself, or you use a build pipeline with your Python code, you need to rebuild your Docker image every...