This chapter explored different ways of ingesting data from files, relational and NoSQL databases, and S3-based object storage systems using the DeepLearning4j DataVec library and the Apache Spark (core and Spark SQL modules) framework, and showed some examples of how to transform the raw data. All of the examples presented represent data ingestion and transformation in a batch fashion.
The next chapter will focus on ingesting and transforming data to train or test your DL model in streaming mode.