Exploring transformation
The transformation
section is where you define your pipeline transformation code. It is the core of your pipeline's functionality. Most pipelines, unless they are a connector between two pipelines or a pipeline that exports results outside of Pachyderm, must have a transformation
section.
The most important parameters of a transformation section – and the ones that are most commonly used – are image
and cmd
or stdin
, env
, and secrets
.
Let's look at these parameters in more detail.
image
The image
parameter defines a Docker image that your pipeline will run. A Docker image contains information about the environment in your pipeline container. For example, if you are running Python code, you will need to have some version of Python in your pipeline image. There are many publicly available containers that you can use for your pipeline.
You can also include your scripts in that container. Unless your code is just a Bash script...