Summary
In this chapter, we learned how to use DuckDB effectively for performing a range of data manipulation activities, focusing in particular on how we can wrangle and clean data in order to prepare it for analysis. We took raw logs from a web server and converted them into a structured format that allowed us to perform some simple analysis on the data, such as seeing which types of users visit our website. We also performed the data processing required to prepare thousands of rows of taxi-trip data, allowing us to see how passengers travel in and around New York.
We have now seen a variety of core data manipulation steps and you should have a feel for how to use aggregate and window functions to discover interesting insights within large datasets using DuckDB.
Now that we have seen some techniques and strategies for using DuckDB to manipulate and analyze data with DuckDB, in the next chapter, we will further explore DuckDB techniques and features, this time looking at how...