Search icon CANCEL
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Conferences
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
The Statistics and Calculus with Python Workshop

You're reading from   The Statistics and Calculus with Python Workshop A comprehensive introduction to mathematics in Python for artificial intelligence applications

Arrow left icon
Product type Paperback
Published in Aug 2020
Publisher Packt
ISBN-13 9781800209763
Length 740 pages
Edition 1st Edition
Languages
Arrow right icon
Authors (6):
Arrow left icon
Ajinkya Sudhir Kolhe Ajinkya Sudhir Kolhe
Author Profile Icon Ajinkya Sudhir Kolhe
Ajinkya Sudhir Kolhe
Quan Nguyen Quan Nguyen
Author Profile Icon Quan Nguyen
Quan Nguyen
Marios Tsatsos Marios Tsatsos
Author Profile Icon Marios Tsatsos
Marios Tsatsos
Alexander Joseph Sarver Alexander Joseph Sarver
Author Profile Icon Alexander Joseph Sarver
Alexander Joseph Sarver
Peter Farrell Peter Farrell
Author Profile Icon Peter Farrell
Peter Farrell
Alvaro Fuentes Alvaro Fuentes
Author Profile Icon Alvaro Fuentes
Alvaro Fuentes
+2 more Show less
Arrow right icon
View More author details
Toc

Table of Contents (14) Chapters Close

Preface
1. Fundamentals of Python 2. Python's Main Tools for Statistics FREE CHAPTER 3. Python's Statistical Toolbox 4. Functions and Algebra with Python 5. More Mathematics with Python 6. Matrices and Markov Chains with Python 7. Doing Basic Statistics with Python 8. Foundational Probability Concepts and Their Applications 9. Intermediate Statistics with Python 10. Foundational Calculus with Python 11. More Calculus with Python 12. Intermediate Calculus with Python Appendix

2. Python's Main Tools for Statistics

Activity 2.01: Analyzing the Communities and Crime Dataset

Solution:

  1. Once the dataset has been downloaded, the libraries can be imported, and pandas can be used to read in the dataset in a new Jupyter notebook, as follows:
    import pandas as pd
    import numpy as np
    import matplotlib.pyplot as plt
    df = pd.read_csv('CommViolPredUnnormalizedData.txt')
    df.head()

    We are also printing out the first five rows of the dataset, which should be as follows:

    Figure 2.21: The first five rows of the dataset

  2. To print out the column names, we can simply iterate through df.columns in a for loop, like so:
    for column in df.columns:
        print(column)
  3. The total number of columns in the dataset can be computed using the len() function in Python:
    print(len(df.columns))
  4. To replace the special character '?' with np.nan objects, we can use the replace() method:
    df = df.replace('?', np.nan)
  5. To print out...
lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime