0

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Free Learning

Become a Python Data Analyst

You're reading from Become a Python Data Analyst Perform exploratory data analysis and gain insight into scientific computing using Python

Product type Paperback

Published in Aug 2018

Publisher Packt

ISBN-13 9781789531701

Length 178 pages

Edition 1st Edition

Languages

Python

Tools

Pandas

Concepts

Data Analysis

Author (1):

Alvaro Fuentes

View More author details

Table of Contents (8) Chapters

Preface

1. The Anaconda Distribution and Jupyter Notebook

2. Vectorizing Operations with NumPy FREE CHAPTER

3. Pandas - Everyone's Favorite Data Analysis Library

4. Visualization and Exploratory Data Analysis

5. Statistical Computing with Python

6. Introduction to Predictive Analytics Models

7. Other Books You May Enjoy

Leave a review - let other readers know what you think

What this book covers

Chapter 1, The Anaconda Distribution and Jupyter Notebook, covers the most important libraries for data science with Python. This is a well-charted overview of the main objects, attributes, methods, and functions that we will use for doing predictive analytics with Python.

Chapter 2, Vectorizing Operations with NumPy, explores Numpy—this is the library upon which almost all other scientific computing in Python projects are based. Learning how to handle NumPy arrays is crucial for doing anything related to data science in Python.

Chapter 3, Pandas - Everyone's Favorite Data Analysis Library, gives an overview of pandas which is a library that provides high performance, easy-to-use data structures, and data analysis tools for the Python programming language. We data scientists love it, and it is one of the key reasons behind Python’s popularity in the data science community. In this section, we show by example how to perform descriptive analysis with pandas.

Chapter 4, Visualization and Explanatory Data Analysis, explains that visualization is a key topic for data science. Python provides a lot of options for doing visualizations for different purposes. In this volume, we learn about two of the most popular libraries, matplotlib and seaborn, and perform exploratory data analysis on real-world datasets.

Chapter 5, Statistical Computing with Python, explains how to perform common statistical computations with Python and use them to make sense of a dataset that contains information about the alcohol consumption of teenagers.

Chapter 6, Introduction to Predictive Analytics Models, gives a brief introduction to predictive analytics and builds a model to predict the drinking habits of teenagers.

The rest of the chapter is locked

Register for a free Packt account to unlock a world of extra content!

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (1)

Alvaro Fuentes

Alvaro Fuentes

Alvaro Fuentes is a senior data scientist with a background in applied mathematics and economics. He has more than 14 years of experience in various analytical roles and is an analytics consultant at one of the ‘Big Three' global management consulting firms, leading advanced analytics projects in different industries like banking, technology, and consumer goods. Alvaro is also an author and trainer in analytics and data science and has published courses and books, such as 'Become a Python Data Analyst' and 'Hands-On Predictive Analytics with Python'. He has also taught data science and related topics to thousands of students both on-site and online through different platforms such as Springboard, Simplilearn, Udemy, and BSG Institute, among others.

See other products by Alvaro Fuentes

Other recommended products

Related to this chapter

Data Visualization with Python for Beginners

Data Visualization with Python for Beginners

Utilizing tools and operations from several major libraries, this book will teach you to visualize data with Python comfortably and confidently in no time at all.

Mar 2021 9h 20m

Mastering Exploratory Analysis with pandas

Mastering Exploratory Analysis with pandas

Exploratory data analysis exploits the visual properties of the datasets that are commonly used by data scientists. It helps you build custom data pipelines to address data analysis tasks. This book uses pandas, the most popular Python library for data analysis, and helps you build end-to-end exploratory data-analysis solutions

Sep 2018 4h 40m

Big Data Analysis with Python

Big Data Analysis with Python

Processing big data in real time is challenging due to scalability, information inconsistency, and fault tolerance. Big Data Analysis with Python teaches you how to use tools that can control the data avalanche for you. With this book, you'll learn effective techniques to aggregate data into useful dimensions for posterior analysis, extract statistical measurements, and transform datasets into features for other systems.

Apr 2019 9h 12m

Hands-On Predictive Analytics with Python

Hands-On Predictive Analytics with Python

This book will teach you all the processes you need to build a predictive analytics solution: understanding the problem, preparing datasets, exploring relationships, model building, tuning, evaluation, and deployment. You'll earn to use Python and its data analytics ecosystem to implement the main techniques used in real-world projects.

Dec 2018 11h 0m

Applied Data Science with Python and Jupyter

Applied Data Science with Python and Jupyter

Applied Data Science with Python and Jupyter teaches you the skills you need for entry-level data science. You'll learn about some of the most commonly used libraries that are part of the Anaconda distribution, and then explore machine learning models with real datasets to give you the skills and exposure you need for the real world. You'll finish up by learning how easy it can be to scrape and gather your own data from the open web so that you can apply your new skills in an actionable context.

Oct 2018 6h 24m

Beginning Data Science with Python and Jupyter

Beginning Data Science with Python and Jupyter

Get to grips with the skills you need for entry-level data science in this hands-on Python and Jupyter course. You'll learn about some of the most commonly used libraries that are part of the Anaconda distribution, and then explore machine learning models with real datasets to give you the skills and exposure you need for the real world. We'll finish up by showing you how easy it can be to scrape and gather your own data from the open web, so that you can apply your new skills in an actionable context.

Jun 2018 6h 28m

Scientific Computing with Python

Scientific Computing with Python

Python is an efficient tool for coupling scientific computing and mathematics. This book teaches you how to use it for linear algebra, arrays, plotting, iterating, functions, and polynomials. You'll explore task automation and understand essential math concepts and algorithms along with integrations for faster computation in scientific computing.

Jul 2021 13h 4m

Mastering Predictive Analytics with scikit-learn and TensorFlow

Mastering Predictive Analytics with scikit-learn and TensorFlow

In this book, you will find a range of methods to improve the performance of almost any predictive model, from ensemble methods to dimensionality reduction and cross-validation. You will learn the tools to produce advanced predictive models. In addition, you will dive into the exiting field of Deep Learning using TensorFlow.

Matplotlib 2.x By Example

Matplotlib 2.x By Example

Big data analytics are driving innovations in scientific research, digital marketing, policymaking and much more. Matplotlib offers simple but powerful plotting interface, versatile plot types and robust customizations, which help resolve the complexity in Big data visualization. “Matplotlib 2.x By Example” illustrates the methods and applications of various plot types through real world examples. It begins by giving readers the basic knowhow on how to create and customize plots by Matplotlib. It further covers how to plot different types of economic data in the form of 2D and 3D graphs, which give insights from a deluge of data from public repositories, such as Quandl Finance. You will learn to visualize geographical data on maps and implement interactive charts. By the end of this book, you will become well versed with Matplotlib in your day-to-day work to perform advanced data visualization.

Aug 2017 11h 8m

Learning pandas

Learning pandas

Pandas is a popular Python package used for practical, real world data analysis. It provides efficient fast, high-performance data structures that makes data exploration and analysis very easy. This learner's guide will help you through a comprehensive set of features provided by the pandas library to perform efficient data manipulation and analysis.

Jun 2017 14h 52m

Hands-On Exploratory Data Analysis with Python

Hands-On Exploratory Data Analysis with Python

This book provides practical knowledge about the main pillars of EDA including data cleaning, data preparation, data exploration, and data visualization. You can leverage the power of Python to understand, summarize and investigate your data in the best way possible. The book presents a unique approach to exploring hidden features in your data.

Mar 2020 11h 44m

The Statistics and Calculus with Python Workshop

The Statistics and Calculus with Python Workshop

The Statistics and Calculus with Python Workshop is ideal for those who need a refresher in the mathematics that make the practical applications of modern artificial intelligence possible. Starting with foundational mathematical concepts like functions and matrices, the book covers the entire spectrum of calculus and statistics, teaching you the techniques you need to solve mathematical challenges with Python and AI.

Aug 2020 24h 40m

Personalised recommendations for you

Based on your interests and search pattern

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Data Governance Handbook

Data Governance Handbook

This book provides a highly focused view of real business outcomes powered by data governance, that resonate with non-data executives such as CFOs and CEOs. You'll also find useful insights into how to implement data governance initiatives.

May 2024 13h 8m

Data Engineering with Databricks Cookbook

Data Engineering with Databricks Cookbook

This book shows you how to use Apache Spark, Delta Lake, and Databricks to build data pipelines, manage and transform data, optimize performance, and more. Additionally, you'll implement DataOps and DevOps practices, and orchestrate data workflows.

May 2024 14h 36m

Azure Data Engineer Associate Certification Guide

Azure Data Engineer Associate Certification Guide

Unlock the power of Azure data engineering with this certification guide, elevating your skills in data processing, storage, and security with the help of practical insights, hands-on exercises, and the latest advancements.

May 2024 18h 16m

Microsoft Power BI Cookbook

Microsoft Power BI Cookbook

Microsoft Power BI is the most sought-after platform for BI professionals' visualization needs. Explore the latest Power BI features, future AI enhancements, and integration with other Power Platform tools via new recipes in this updated edition.

Jul 2024 19h 56m

Python Data Cleaning Cookbook

Python Data Cleaning Cookbook

The book shows you how to clean, wrangle, and view data from multiple perspectives, including dataset and column attributes. You will cover common and not-so-common challenges that are faced while cleaning messy data for complex situations and learn to manipulate data to get it down to a form that can be useful for making the right decisions.

May 2024 16h 12m

Microsoft Azure AI Fundamentals AI-900 Exam Guide

Microsoft Azure AI Fundamentals AI-900 Exam Guide

This AI-900 study guide will help you prepare and practice for the certification exam. You'll delve into AI workloads, ML principles, computer vision, NLP, knowledge mining, and generative AI using Azure cloud services.

May 2024 9h 36m

Using Stable Diffusion with Python

Using Stable Diffusion with Python

This book shows you how to use Python to control Stable Diffusion and generate high-quality images. In addition to covering the basic usage of the diffusers package, the book provides solutions for extending the package for more advanced purposes.

Jun 2024 11h 44m

Getting Started with DuckDB

Getting Started with DuckDB

This hands-on book teaches you to analyze large datasets with blazing speed and ease. You will learn how to use DuckDB to quickly load, query, transform, analyze, and visualize data effectively through a series of practical examples.

Jun 2024 12h 44m

Databricks Certified Associate Developer for Apache Spark Using Python

Databricks Certified Associate Developer for Apache Spark Using Python

This guide gets you ready for certification with expert-backed content, key exam concepts, and topic reviews. Additionally, you'll be able to make the most of Apache Spark 3.0 to modernize workloads and more using specific tools and techniques.