Summary
The goal of this chapter was to explain what Apache Arrow is, get you acquainted with the format, and have you use it in some simple use cases. This knowledge forms the baseline of everything else for us to talk about in the rest of this book!
Just as a reminder, you can check out this book’s GitHub repository (https://github.com/PacktPublishing/In-Memory-Analytics-with-Apache-Arrow-Second-Edition) for the solutions to the exercises presented here and for the full code samples so that you can make sure you understand the concepts!
The examples and exercises that were provided in this chapter are all fairly trivial and are meant to help reinforce the concepts that were introduced about Arrow’s format and specification while helping you get familiar with using Arrow in code.
In Chapter 2, Working with Key Arrow Specifications, you’ll learn how to read your data into the Arrow format, whether it’s on your local disk, Hadoop Distributed File...