Powerful ETL Libraries and Tools in Python
Up to this point in the book, we have covered the fundamentals of building data pipelines. We’ve introduced some of Python’s most common modules that can be utilized to establish rudimentary iterations of data pipelines. While this is a great place to start, these methods are far from the most realistic approach; there is no lack of space for improvement. There are several powerful, ETL-specific Python libraries and pipeline management platforms that we can use to our advantage to make more durable, scalable, and resilient data pipelines suitable for real-world deployment scenarios.
We will divide this chapter into two parts. We start by introducing six of Python’s most popular ETL pipeline libraries. We will use the same “seed” ETL activities with each library, walking through how each of the following resources can be used to create an organized, reusable data ETL pipeline:
- Part 1 – ETL...