Mesos fetcher
The Mesos fetcher is a way by which resources can be downloaded in the task sandbox
directory while preparing the task execution. The framework requesting the execution of the task sends a list of CommandInfo::URI
values as part of the TaskInfo
message, which in turn serves as the Mesos fetcher input.
The Mesos fetcher natively supports the FTP and HTTP protocols, and is also able to copy over files from a filesystem. It also supports all Hadoop client protocols such as Amazon Simple Storage Service (S3), Hadoop distributed Filesystem (HDFS), and so on.
Every Uniform Resource Identifier (URI) that is requested gets downloaded, by default, directly into the sandbox
directory. Multiple requests for the same URI results in copies of that resource being downloaded again and again. The downloaded URIs can also alternatively be cached in a specified directory for reuse.
Mechanism
The mechanism comprises the following:
Each slave consists of one internal fetcher instance which is leveraged...