In this chapter, we covered three popular frameworks: Apache Spark, Apache Flink, and Apache Storm. We briefly covered their architecture, internal working, and the set of APIs that's provided by them. We also covered best practices for all processing engines and studied their importance.
In the next chapter, we will talk about a few widely used components such as Apache Pig, Apache Kafka, Apache Flume, and Apache HBase. We will go though the internals of each component and walk you through certain examples. The primary area of focus would be to understand when to use which component and what the best practices to be followed are with these applications.Â