You're reading from Data Analysis with Python A Modern Approach

Product type Paperback

Published in Dec 2018

Publisher Packt

ISBN-13 9781789950069

Length 490 pages

Edition 1st Edition

Languages

Python

Tools

Jupyter

Concepts

Data Analysis

Author (1):

David Taieb

View More author details

Table of Contents (14) Chapters

Preface

1. Programming and Data Science – A New Toolset FREE CHAPTER

2. Python and Jupyter Notebooks to Power your Data Analysis

3. Accelerate your Data Analysis with Python Libraries

4. Publish your Data Analysis to the Web - the PixieApp Tool

5. Python and PixieDust Best Practices and Advanced Concepts

6. Analytics Study: AI and Image Recognition with TensorFlow

7. Analytics Study: NLP and Big Data with Twitter Sentiment Analysis

8. Analytics Study: Prediction - Financial Time Series Analysis and Forecasting

9. Analytics Study: Graph Algorithms - US Domestic Flight Data Analysis

10. The Future of Data Analysis and Where to Develop your Skills

A. PixieApp Quick-Reference

Other Books You May Enjoy

Leave a review – let other readers know what you think

Index

Deep diving into a concrete example

Early on, we wanted to build a data pipeline that extracted insights from Twitter by doing sentiment analysis of tweets containing specific hashtags and to deploy the results to a real-time dashboard. This application was a perfect starting point for us, because the data science analytics were not too complex, and the application covered many aspects of a real-life scenario:

High volume, high throughput streaming data
Data enrichment with sentiment analysis NLP
Basic data aggregation
Data visualization
Deployment into a real-time dashboard

To try things out, the first implementation was a simple Python application that used the tweepy library (the official Twitter library for Python: https://pypi.python.org/pypi/tweepy) to connect to Twitter and get a stream of tweets and textblob (the simple Python library for basic NLP: https://pypi.python.org/pypi/textblob) for sentiment analysis enrichment.

The results were then saved into a JSON file for analysis. This prototype was a great way to getting things started and experiment quickly, but after a few iterations we quickly realized that we needed to get serious and build an architecture that satisfied our enterprise requirements.

You're reading from Data Analysis with Python A Modern Approach

Table of Contents (14) Chapters

Deep diving into a concrete example

Authors (1)

Other recommended products

Personalised recommendations for you