Summary
In this chapter we covered several real-world considerations you need to think about when planning your Flume implementation, including the following:
Transport time not always matching event time
The mayhem introduced with daylight savings time to your time-based logic
Capacity planning considerations
Items to consider when you have more than one data center
Data compliance
Data expiration
I hope you enjoyed this book. Hopefully you will be able to apply much of this information directly in your application/Hadoop integration efforts.
Thanks. This was fun.