Getting Started with DuckDB: A practical guide for accelerating your data science, data analytics, and data engineering workflows

Simon Aubury

Ned Letcher

$19.99 per month

5 (1 Ratings)

Paperback Jun 2024 382 pages 1st Edition

Simon Aubury

Ned Letcher

$19.99 per month

5 (1 Ratings)

Paperback Jun 2024 382 pages 1st Edition

What do you get with a Packt Subscription?

Free for first 7 days. $19.99 p/m after that. Cancel any time!

Unlimited ad-free access to the largest independent learning library in tech. Access this title and thousands more!

50+ new titles added per month, including many first-to-market concepts and exclusive early access to books as they are being written.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

Thousands of reference materials covering every tech concept you need to stay up to date.

Subscribe now

View plans & pricing

View table of contents

Preview Book

Download Code

Key benefits

Use DuckDB to rapidly load, transform, and query data across a range of sources and formats
Gain practical experience using SQL, Python, and R to effectively analyze data
Learn how open source tools and cloud services in the broader data ecosystem complement DuckDB’s versatile capabilities
Purchase of the print or Kindle book includes a free PDF eBook

Description

DuckDB is a fast in-process analytical database. Getting Started with DuckDB offers a practical overview of its usage. You'll learn to load, transform, and query various data formats, including CSV, JSON, and Parquet. The book covers DuckDB's optimizations, SQL enhancements, and extensions for specialized applications. Working with examples in SQL, Python, and R, you'll explore analyzing public datasets and discover tools enhancing DuckDB workflows. This guide suits both experienced and new data practitioners, quickly equipping you to apply DuckDB's capabilities in analytical projects. You'll gain proficiency in using DuckDB for diverse tasks, enabling effective integration into your data workflows.

Who is this book for?

If you’re interested in expanding your analytical toolkit, this book is for you. It will be particularly valuable for data analysts wanting to rapidly explore and query complex data, data and software engineers looking for a lean and versatile data processing tool, along with data scientists needing a scalable data manipulation library that integrates seamlessly with Python and R. You will get the most from this book if you have some familiarity with SQL and foundational database concepts, as well as exposure to a programming language such as Python or R.

What you will learn

Understand the properties and applications of a columnar in-process database
Use SQL to load, transform, and query a range of data formats
Discover DuckDB's rich extensions and learn how to apply them
Use nested data types to model semi-structured data and extract and model JSON data
Integrate DuckDB into your Python and R analytical workflows
Effectively leverage DuckDB's convenient SQL enhancements
Explore the wider ecosystem and pathways for building DuckDB-powered data applications

What do you get with a Packt Subscription?

Free for first 7 days. $19.99 p/m after that. Cancel any time!

Unlimited ad-free access to the largest independent learning library in tech. Access this title and thousands more!

50+ new titles added per month, including many first-to-market concepts and exclusive early access to books as they are being written.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

Thousands of reference materials covering every tech concept you need to stay up to date.

Subscribe now

View plans & pricing

Frequently bought together

Machine Learning with PyTorch and Scikit-Learn

$54.99

$49.99

$54.99

Total $ 159.97

Vishnuvardhan Oct 17, 2024

"Getting Started with DuckDB" provides an excellent, hands-on introduction to DuckDB, showcasing its speed and versatility in data analytics and engineering. The practical examples and easy-to-follow explanations make it a valuable resource for anyone looking to enhance their workflows. Ideal for beginners and experienced professionals alike, it bridges the gap between theory and application effectively

Amazon Verified review

Getting Started with DuckDB: A practical guide for accelerating your data science, data analytics, and data engineering workflows

What do you get with a Packt Subscription?