0

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Free Learning

Pandas 1.x Cookbook

You're reading from Pandas 1.x Cookbook Practical recipes for scientific computing, time series analysis, and exploratory data analysis using Python

Product type Paperback

Published in Feb 2020

Publisher Packt

ISBN-13 9781839213106

Length 626 pages

Edition 2nd Edition

Languages

Python

Tools

Pandas

Concepts

Data Analysis

Authors (2):

Theodore Petrou

Matthew Harrison

View More author details

Table of Contents (17) Chapters

Preface

1. Pandas Foundations

2. Essential DataFrame Operations FREE CHAPTER

3. Creating and Persisting DataFrames

4. Beginning Data Analysis

5. Exploratory Data Analysis

6. Selecting Subsets of Data

7. Filtering Rows

8. Index Alignment

9. Grouping for Aggregation, Filtration, and Transformation

10. Restructuring Data into a Tidy Form

11. Combining Pandas Objects

12. Time Series Analysis

13. Visualization with Matplotlib, Pandas, and Seaborn

14. Debugging and Testing Pandas

15. Other Books You May Enjoy

16. Index

Apply performance

The .apply method on a Series and DataFrame is one of the slowest operations in pandas. In this recipe, we will explore the speed of it and see if we can debug what is going on.

How to do it…

Let's time how long one use of the .apply method takes using the %%timeit cell magic in Jupiter. This is the code from the tweak_kag function that limits the cardinality of the country column (Q3):

>>> %%timeit
>>> def limit_countries(val):
...      if val in  {'United States of America', 'India', 'China'}:
...          return val
...      return 'Another'
>>> q3 = df.Q3.apply(limit_countries).rename('Country')
6.42 ms ± 1.22 ms per loop (mean ± std. dev. of 7 runs, 100 loops each)

Let's look at using the .replace method instead of .apply and see if that improves performance:
```
>>> %%timeit
>>> other_values = df...
```

The rest of the chapter is locked

Register for a free Packt account to unlock a world of extra content!

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (2)

Matthew Harrison

Matthew Harrison

Matt Harrison has been using Python since 2000. He runs MetaSnake, which provides corporate training for Python and Data Science. He is the author of Machine Learning Pocket Reference, the bestselling Illustrated Guide to Python 3, and Learning the Pandas Library, among other books

See other products by Matthew Harrison

Theodore Petrou

Theodore Petrou

Theodore Petrou is the founder of Dunder Data, a training company dedicated to helping teach the Python data science ecosystem effectively to individuals and corporations. Read his tutorials and attempt his data science challenges at the Dunder Data website.

See other products by Theodore Petrou

Other recommended products

Related to this chapter

Mastering Exploratory Analysis with pandas

Mastering Exploratory Analysis with pandas

Exploratory data analysis exploits the visual properties of the datasets that are commonly used by data scientists. It helps you build custom data pipelines to address data analysis tasks. This book uses pandas, the most popular Python library for data analysis, and helps you build end-to-end exploratory data-analysis solutions

Sep 2018 4h 40m

Python Data Cleaning Cookbook

Python Data Cleaning Cookbook

The book shows you how to view data from multiple perspectives, including data frame and column attributes. You will cover common and not-so-common challenges that are faced while cleaning messy data for complex situations. You will learn to manipulate data and get them down to a form that can be useful for making the right decisions.

Dec 2020 14h 32m

Learning pandas

Learning pandas

Pandas is a popular Python package used for practical, real world data analysis. It provides efficient fast, high-performance data structures that makes data exploration and analysis very easy. This learner's guide will help you through a comprehensive set of features provided by the pandas library to perform efficient data manipulation and analysis.

Jun 2017 14h 52m

Hands-On Data Analysis with NumPy and Pandas

Hands-On Data Analysis with NumPy and Pandas

In this book, you will explore two important Python packages used by Data Analysts, NumPy & pandas. You will dive into different concepts such as reading, sorting, grouping of data, and also learn how to work with different data formats for your data analysis projects.

Jun 2018 5h 36m

Mastering pandas

Mastering pandas

pandas is a popular Python library used by data scientists and analysts worldwide to manipulate and analyze their data. This book presents useful techniques and real-world examples on getting the most out of pandas for expert-level data manipulation, analysis and visualization.

Oct 2019 22h 28m

Hands-On Data Analysis with Pandas

Hands-On Data Analysis with Pandas

This book will be a handy guide to quickly learn pandas and understand how it can empower you in the exciting world of data manipulation, analysis, and data science. You will learn how to use pandas to perform numeric and statistical analysis using real-world examples. You will also visualize statistical data and apply pandas to different domains.

Jul 2019 24h 40m

Hands-On Data Analysis with Pandas

Hands-On Data Analysis with Pandas

Knowing how to work with data to extract insights generates significant value. This book will help you to develop data analysis skills using a hands-on approach and real-world data. You'll get up to speed with pandas 1.x in no time and build some software engineering skills in the process, vastly expanding your data science toolbox.

Apr 2021 26h 16m

Personalised recommendations for you

Based on your interests and search pattern

Pragmatic Microservices with C# and Azure

Pragmatic Microservices with C# and Azure

This book empowers you with in-depth knowledge of microservices architecture using .NET and Azure. Through hands-on tutorials, you'll be able to build, deploy, and manage scalable applications, gaining crucial skills for modern software development.

May 2024 16h 56m

Mastering Python Design Patterns

Mastering Python Design Patterns

Unlock the power of design patterns to build maintainable and scalable software and applications using Python. Authored by Python veterans, this book is your guide to mastering design patterns in Python.

May 2024 9h 52m

System Programming Essentials with Go

System Programming Essentials with Go

From file operations to process management and network programming, this hands-on guide equips software engineers with the skills to build efficient, reliable applications and optimize their performance.

Jun 2024 13h 36m

Modern Python Cookbook

Modern Python Cookbook

The new edition of Modern Python Cookbook provides over 130 recipes for solving real-world problems with Python. Updated for Python 3.12 with new recipes and chapters. This practical guide will enhance your skills and teach you advanced techniques.

Jul 2024 27h 16m

The Ultimate Zoom Cookbook

The Ultimate Zoom Cookbook

This cookbook is an in-depth guide to using Zoom effectively. You'll be able to follow each recipe easily to harness the power of the communication and collaboration tools in Zoom.

May 2024 11h 20m

Enterprise Architecture with .NET

Enterprise Architecture with .NET

This book will help you create applications that integrate correctly into complex and ever-changing information systems. You'll execute this by using industry standards that reduce the app's technical debt and elevate software development practices.

May 2024 25h 44m

Salesforce B2C Solution Architect's Handbook

Salesforce B2C Solution Architect's Handbook

Discover how Salesforce Customer 360 unifies Marketing Cloud, B2C Commerce, Data Cloud, and Service Cloud into one solution, and learn the capabilities, integration options, limitations, and workflows to create value for your organization.

May 2024 15h 28m

Systems Programming with C# and .NET

Systems Programming with C# and .NET

Unlock the full potential of C# and .NET Core in systems programming to secure, deploy, and maintain robust applications. With this book, you'll focus on low-level APIs, memory management, and performance optimization.

Jul 2024 15h 48m

Technical Program Manager's Handbook

Technical Program Manager's Handbook

Unlock the full potential of your career as a TPM with this comprehensive guide, featuring new chapters on AI and more. Learn advanced techniques and gain insights from industry leaders. Elevate your skills and thrive in the Big Five tech companies.

Sep 2024 12h 16m

Microsoft Power Pages in Action

Microsoft Power Pages in Action

Packed with real-world examples, low-code coding techniques, and insights into crafting responsive pages, automating apps, and enhancing virtual agents, Microsoft Power Pages in Action is a valuable resource for building feature-rich web apps.

Jun 2024 11h 40m