You're reading from Beginning Data Science with Python and Jupyter Use powerful industry-standard tools within Jupyter and the Python ecosystem to unlock new, actionable insights from your data

Product type Paperback

Published in Jun 2018

Publisher

ISBN-13 9781789532029

Length 194 pages

Edition 1st Edition

Languages

Python

Tools

Jupyter

Concepts

Data Analysis

Author (1):

Alex Galea

View More author details

Table of Contents (5) Chapters

Preface

1. Jupyter Fundamentals

2. Data Cleaning and Advanced Machine Learning FREE CHAPTER

3. Web Scraping and Interactive Visualizations

Index

What This Book Covers

Lesson 1, Jupyter Fundamentals, covers the fundamentals of data analysis in Jupyter. We will start with usage instructions and features of Jupyter such as magic functions and tab completion. We will then transition to data science specific material. We will run an exploratory analysis in a live Jupyter Notebook. We will use visual assists such as scatter plots, histograms, and violin plots to deepen our understanding of the data. We will also perform simple predictive modeling.

Lesson 2, Data Cleaning and Advanced Machine Learning, shows how predictive models can be trained in Jupyter Notebooks. We will talk about how to plan a machine learning strategy. This lesson also explains the machine learning terminology such as supervised learning, unsupervised learning, classification, and regression. We will discuss methods for preprocessing data using scikit-learn and pandas.

Lesson 3, Web Scraping and Interactive Visualizations, explains how to scrap web page tables and then use interactive visualizations to study the data. We will start by looking at how HTTP requests work, focusing on GET requests and their response status codes. Then, we will go into the Jupyter Notebook and make HTTP requests with Python using the Requests library. We will see how Jupyter can be used to render HTML in the notebook, along with actual web pages that can be interacted with. After making requests, we will see how Beautiful Soup can be used to parse text from the HTML, and used this library to scrape tabular data.