Search icon CANCEL
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Conferences
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
The Statistics and Calculus with Python Workshop

You're reading from   The Statistics and Calculus with Python Workshop A comprehensive introduction to mathematics in Python for artificial intelligence applications

Arrow left icon
Product type Paperback
Published in Aug 2020
Publisher Packt
ISBN-13 9781800209763
Length 740 pages
Edition 1st Edition
Languages
Arrow right icon
Authors (6):
Arrow left icon
Ajinkya Sudhir Kolhe Ajinkya Sudhir Kolhe
Author Profile Icon Ajinkya Sudhir Kolhe
Ajinkya Sudhir Kolhe
Quan Nguyen Quan Nguyen
Author Profile Icon Quan Nguyen
Quan Nguyen
Marios Tsatsos Marios Tsatsos
Author Profile Icon Marios Tsatsos
Marios Tsatsos
Alexander Joseph Sarver Alexander Joseph Sarver
Author Profile Icon Alexander Joseph Sarver
Alexander Joseph Sarver
Peter Farrell Peter Farrell
Author Profile Icon Peter Farrell
Peter Farrell
Alvaro Fuentes Alvaro Fuentes
Author Profile Icon Alvaro Fuentes
Alvaro Fuentes
+2 more Show less
Arrow right icon
View More author details
Toc

Table of Contents (14) Chapters Close

Preface
1. Fundamentals of Python 2. Python's Main Tools for Statistics FREE CHAPTER 3. Python's Statistical Toolbox 4. Functions and Algebra with Python 5. More Mathematics with Python 6. Matrices and Markov Chains with Python 7. Doing Basic Statistics with Python 8. Foundational Probability Concepts and Their Applications 9. Intermediate Statistics with Python 10. Foundational Calculus with Python 11. More Calculus with Python 12. Intermediate Calculus with Python Appendix

3. Python's Statistical Toolbox

Activity 3.01: Revisiting the Communities and Crimes Dataset

Solution

  1. The libraries can be imported, and pandas can be used to read in the dataset as follows:
    import pandas as pd
    import numpy as np
    import matplotlib.pyplot as plt
    import seaborn as sns
    df = pd.read_csv('CommViolPredUnnormalizedData.txt')
    df.head()

    Your output should be the following:

    Figure 3.29: The first five rows of the dataset

  2. To replace the special character with the np.nan object, we can use the following code:
    df = df.replace('?', np.nan)
  3. To compute the actual count for the different age groups, we can simply use the expression df['population'] * df['agePct...'], which computes the count in a vectorized way:
    age_groups = ['12t21', '12t29', '16t24', '65up']
     
    for group in age_groups:
        df['ageCnt' + group] = (df['population'] * \
      ...
lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime