Packt+ | Advance your knowledge in tech

You're reading from Practical Data Analysis For small businesses, analyzing the information contained in their data using open source technology could be game-changing. All you need is some basic programming and mathematical skills to do just that.

Product type Paperback

Published in Oct 2013

Publisher Packt

ISBN-13 9781783280995

Length 360 pages

Edition 1st Edition

Languages

Python

Tools

NLTK

Concepts

Data Analysis

Author (1):

Hector Cuesta

View More author details

Table of Contents (24) Chapters

Practical Data Analysis

Credits

Foreword

About the Author

Acknowledgments

About the Reviewers

www.PacktPub.com

Preface

1. Getting Started FREE CHAPTER

2. Working with Data

3. Data Visualization

4. Text Classification

5. Similarity-based Image Retrieval

6. Simulation of Stock Prices

7. Predicting Gold Prices

8. Working with Support Vector Machines

9. Modeling Infectious Disease with Cellular Automata

10. Working with Social Graphs

11. Sentiment Analysis of Twitter Data

12. Data Processing and Aggregation with MongoDB

13. Working with MapReduce

14. Online Data Analysis with IPython and Wakari

Setting Up the Infrastructure

Index

Sentiment classification

In sentiment classification, one message can be classified as either positive or negative. This is excellent to get insight about how the public think about a person, products, or services.

In this chapter, we will classify the tweets to get a personal positive or negative feeling. It is important to clarify that tweets are limited to 140-characters length with a very casual language and in many cases the message may be very noisy with usernames, links, repeated letters, and emoticons. However, Twitter provides a way to get feedback about large amount of topics in real-time. We can see sample tweets as follows:

"Photoshop, I hate it when you crash " - Negative
"@Ms_HipHop im glad ur doing weeeell " - Positive

The general process of the sentiment classification is presented in the following screenshot. We start extracting the features (words) from the training data (Text Corpus). Then, we need to train the classifier with a bag of words, which is a list of words and...

The rest of the chapter is locked

You're reading from Practical Data Analysis For small businesses, analyzing the information contained in their data using open source technology could be game-changing. All you need is some basic programming and mathematical skills to do just that.

Table of Contents (24) Chapters

Sentiment classification

Authors (1)

Personalised recommendations for you

You're reading from Practical Data Analysis For small businesses, analyzing the information contained in their data using open source technology could be game-changing. All you need is some basic programming and mathematical skills to do just that.

Table of Contents (24) Chapters

Sentiment classification

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you