Technical requirements
Throughout this chapter, I’ll be providing various code samples while using the Python, C++, and Golang Arrow libraries and the public NYC Taxi Trip Duration dataset.
To run the practical examples in this chapter, you will need the following:
- A computer that’s connected to the internet
- Your preferred IDE (VS Code, Sublime, Emacs, Vim, and so on)
- Python 3.8+ with the
pyarrow
,pandas
, andpolars
modules installed - Go 1.21+
- A C++ compiler (capable of compiling C++17 or newer) with the Arrow libraries installed
- The
sample_data
folder from this book’s GitHub repository: https://github.com/PacktPublishing/In-Memory-Analytics-with-Apache-Arrow-Second-Editiongit-lfs
is required to be installed to download the sample data; instructions to install it are provided in this chapter
Let’s get started!