Search icon CANCEL
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Conferences
Free Learning
Arrow right icon
Python Social Media Analytics
Python Social Media Analytics

Python Social Media Analytics: Analyze and visualize data from Twitter, YouTube, GitHub, and more

Arrow left icon
Profile Icon Siddhartha Chatterjee Profile Icon Baihaqi Siregar Profile Icon Michal Krystyanczuk
Arrow right icon
Free Trial
Full star icon Full star icon Full star icon Full star icon Empty star icon 4 (2 Ratings)
Paperback Jul 2017 312 pages 1st Edition
eBook
S$41.98 S$59.99
Paperback
S$74.99
Subscription
Free Trial
Arrow left icon
Profile Icon Siddhartha Chatterjee Profile Icon Baihaqi Siregar Profile Icon Michal Krystyanczuk
Arrow right icon
Free Trial
Full star icon Full star icon Full star icon Full star icon Empty star icon 4 (2 Ratings)
Paperback Jul 2017 312 pages 1st Edition
eBook
S$41.98 S$59.99
Paperback
S$74.99
Subscription
Free Trial
eBook
S$41.98 S$59.99
Paperback
S$74.99
Subscription
Free Trial

What do you get with a Packt Subscription?

Free for first 7 days. $19.99 p/m after that. Cancel any time!
Product feature icon Unlimited ad-free access to the largest independent learning library in tech. Access this title and thousands more!
Product feature icon 50+ new titles added per month, including many first-to-market concepts and exclusive early access to books as they are being written.
Product feature icon Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.
Product feature icon Thousands of reference materials covering every tech concept you need to stay up to date.
Subscribe now
View plans & pricing
Table of content icon View table of contents Preview book icon Preview Book

Python Social Media Analytics

Harnessing Social Data - Connecting, Capturing, and Cleaning

The first step to realize the promise of social data, which we went through in Chapter 1, Introduction to the Latest Social Media Landscape and Importance, is by harnessing it. A proper harnessing strategy can help to remove obstacles to and expedite processing. As we saw in the last chapter, many sources of social data can be used through the Application Protocol Interfaces (APIs) of these platforms. However, the data coming from APIs is not readily usable for multiple cases, hence it requires several steps before the data is ready to be analyzed and then applied. Therefore, we have dedicated a chapter that explains in detail how to do this. We have briefly touched upon the technical notion of an API in the first chapter. Here we intend to go deeper into it and help you to understand its types and usage. We also want...

APIs in a nutshell

An API is the medium that allows the exchange of data points between a service and the programmer or user. API concepts have been widely used in the software industry when we needed different software to exchange data with with another. Mobile and internet applications have been using web services and APIs to enrich information from external sources. Social media also started creating APIs to share their data with third-party application developers. The popularity of data science has made APIs emerge also as a source for mining and knowledge creation. The nature of all social media is different, so are their APIs. The steps involved in making a connection may not differ greatly, but the data points we capture do.

Different types of API

...

Introduction to authentication techniques

Getting data from different APIs requires a good understanding of the data structure and authentication techniques that allow external users to access the resources served by an API. Historically, there were multiple ways of accessing API resources, but nowadays there is one common protocol used by all the main social media networks. When you get into the developer documentation you will most probably encounter the problem of authentication referred to by an enigmatic term, OAuth.

What is OAuth?

OAuth is simply an authorization protocol that allows users to share data with an application without sharing the password. It is a way to obtain a secure authorization scheme based on a token...

Parsing API outputs

The following subchapter will show you how to connect to different social networks and how to retrieve sample data.

Twitter

Twitter proposes three main APIs: the REST API, Streaming API, and the Ads API. We will be focused on the first two APIs, which provide respectively on-demand or stream data.

Creating application

As explained in the section about OAuth, you have to obtain credentials to be able to collect data from Twitter. There are some simple steps to perform this action:

  1. Create a Twitter account or use your existing one.
  2. Go to https://apps...

Basic cleaning techniques

Social media contains different types of data: information about user profiles, statistics (number of likes or number of followers), verbatims, and media. Quantitative data is very convenient for an analysis using statistical and numerical methods, but unstructured data such as user comments is much more challenging. To get meaningful information, one has to perform the whole process of information retrieval. It starts with the definition of the data type and data structure. On social media, unstructured data is related to text, images, videos, and sound and we will mostly deal with textual data. Then, the data has to be cleaned and normalized. Only after all these steps can we delve into the analysis.

Data type and encoding

...

MongoDB to store and access social data

According to Wikipedia:

MongoDB (from humongous) is a cross-platform document-oriented database. Classified as a NoSQL database, MongoDB eschews the traditional table-based relational database structure in favor of JSON-like documents with dynamic schemas (MongoDB calls the format BSON), making the integration of data in certain types of applications easier and faster. Released under a combination of the GNU Affero General Public License and the Apache License, MongoDB is free and open-source software.

Even though the objective of the chapter is not to explore the architecture of MongoDB in depth we are however going to explain the points that justify the use of MongoDB.

Along with ease of use, MongoDB is recognized for the following advantages:

  • Schema-less design: Unlike traditional relational databases, which require the data to fit its...

APIs in a nutshell


An API is the medium that allows the exchange of data points between a service and the programmer or user. API concepts have been widely used in the software industry when we needed different software to exchange data with with another. Mobile and internet applications have been using web services and APIs to enrich information from external sources. Social media also started creating APIs to share their data with third-party application developers. The popularity of data science has made APIs emerge also as a source for mining and knowledge creation. The nature of all social media is different, so are their APIs. The steps involved in making a connection may not differ greatly, but the data points we capture do.

Different types of API

Currently, two types of API are available. They are as follows:

  • RESTful API
  • Stream API

RESTful API

This is the most common type of API that most social media provides. The information from a REST API is static and is from historical data. The...

Introduction to authentication techniques


Getting data from different APIs requires a good understanding of the data structure and authentication techniques that allow external users to access the resources served by an API. Historically, there were multiple ways of accessing API resources, but nowadays there is one common protocol used by all the main social media networks. When you get into the developer documentation you will most probably encounter the problem of authentication referred to by an enigmatic term, OAuth.

What is OAuth?

OAuth is simply an authorization protocol that allows users to share data with an application without sharing the password. It is a way to obtain a secure authorization scheme based on a token-based authorization mechanism. There are two API authentication models using OAuth:

  • User authentication
  • Application authentication

User authentication

This is the most common form of resource authentication implementation. The signed request both identifies an application...

Left arrow icon Right arrow icon
Download code icon Download Code

Key benefits

  • Acquire data from various social media platforms such as Facebook, Twitter, YouTube, GitHub, and more
  • Analyze and extract actionable insights from your social data using various Python tools
  • A highly practical guide to conducting efficient social media analytics at scale

Description

Social Media platforms such as Facebook, Twitter, Forums, Pinterest, and YouTube have become part of everyday life in a big way. However, these complex and noisy data streams pose a potent challenge to everyone when it comes to harnessing them properly and benefiting from them. This book will introduce you to the concept of social media analytics, and how you can leverage its capabilities to empower your business. Right from acquiring data from various social networking sources such as Twitter, Facebook, YouTube, Pinterest, and social forums, you will see how to clean data and make it ready for analytical operations using various Python APIs. This book explains how to structure the clean data obtained and store in MongoDB using PyMongo. You will also perform web scraping and visualize data using Scrappy and Beautifulsoup. Finally, you will be introduced to different techniques to perform analytics at scale for your social data on the cloud, using Python and Spark. By the end of this book, you will be able to utilize the power of Python to gain valuable insights from social media data and use them to enhance your business processes.

Who is this book for?

If you are a programmer or a data analyst familiar with the Python programming language and want to perform analyses of your social data to acquire valuable business insights, this book is for you. The book does not assume any prior knowledge of any data analysis tool or process.

What you will learn

  • Understand the basics of social media mining
  • Use PyMongo to clean, store, and access data in MongoDB
  • Understand user reactions and emotion detection on Facebook
  • Perform Twitter sentiment analysis and entity recognition using Python
  • Analyze video and campaign performance on YouTube
  • Mine popular trends on GitHub and predict the next big technology
  • Extract conversational topics on public internet forums
  • Analyze user interests on Pinterest
  • Perform large-scale social media analytics on the cloud

Product Details

Country selected
Publication date, Length, Edition, Language, ISBN-13
Publication date : Jul 28, 2017
Length: 312 pages
Edition : 1st
Language : English
ISBN-13 : 9781787121485
Category :
Languages :
Concepts :
Tools :

What do you get with a Packt Subscription?

Free for first 7 days. $19.99 p/m after that. Cancel any time!
Product feature icon Unlimited ad-free access to the largest independent learning library in tech. Access this title and thousands more!
Product feature icon 50+ new titles added per month, including many first-to-market concepts and exclusive early access to books as they are being written.
Product feature icon Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.
Product feature icon Thousands of reference materials covering every tech concept you need to stay up to date.
Subscribe now
View plans & pricing

Product Details

Publication date : Jul 28, 2017
Length: 312 pages
Edition : 1st
Language : English
ISBN-13 : 9781787121485
Category :
Languages :
Concepts :
Tools :

Packt Subscriptions

See our plans and pricing
Modal Close icon
$19.99 billed monthly
Feature tick icon Unlimited access to Packt's library of 7,000+ practical books and videos
Feature tick icon Constantly refreshed with 50+ new titles a month
Feature tick icon Exclusive Early access to books as they're written
Feature tick icon Solve problems while you work with advanced search and reference features
Feature tick icon Offline reading on the mobile app
Feature tick icon Simple pricing, no contract
$199.99 billed annually
Feature tick icon Unlimited access to Packt's library of 7,000+ practical books and videos
Feature tick icon Constantly refreshed with 50+ new titles a month
Feature tick icon Exclusive Early access to books as they're written
Feature tick icon Solve problems while you work with advanced search and reference features
Feature tick icon Offline reading on the mobile app
Feature tick icon Choose a DRM-free eBook or Video every month to keep
Feature tick icon PLUS own as many other DRM-free eBooks or Videos as you like for just S$6 each
Feature tick icon Exclusive print discounts
$279.99 billed in 18 months
Feature tick icon Unlimited access to Packt's library of 7,000+ practical books and videos
Feature tick icon Constantly refreshed with 50+ new titles a month
Feature tick icon Exclusive Early access to books as they're written
Feature tick icon Solve problems while you work with advanced search and reference features
Feature tick icon Offline reading on the mobile app
Feature tick icon Choose a DRM-free eBook or Video every month to keep
Feature tick icon PLUS own as many other DRM-free eBooks or Videos as you like for just S$6 each
Feature tick icon Exclusive print discounts

Frequently bought together


Stars icon
Total S$ 194.97
Learning Data Mining with Python
S$66.99
Python Web Scraping
S$52.99
Python Social Media Analytics
S$74.99
Total S$ 194.97 Stars icon

Table of Contents

9 Chapters
Introduction to the Latest Social Media Landscape and Importance Chevron down icon Chevron up icon
Harnessing Social Data - Connecting, Capturing, and Cleaning Chevron down icon Chevron up icon
Uncovering Brand Activity, Popularity, and Emotions on Facebook Chevron down icon Chevron up icon
Analyzing Twitter Using Sentiment Analysis and Entity Recognition Chevron down icon Chevron up icon
Campaigns and Consumer Reaction Analytics on YouTube – Structured and Unstructured Chevron down icon Chevron up icon
The Next Great Technology – Trends Mining on GitHub Chevron down icon Chevron up icon
Scraping and Extracting Conversational Topics on Internet Forums Chevron down icon Chevron up icon
Demystifying Pinterest through Network Analysis of Users Interests Chevron down icon Chevron up icon
Social Data Analytics at Scale – Spark and Amazon Web Services Chevron down icon Chevron up icon

Customer reviews

Rating distribution
Full star icon Full star icon Full star icon Full star icon Empty star icon 4
(2 Ratings)
5 star 50%
4 star 0%
3 star 50%
2 star 0%
1 star 0%
Neha Aug 14, 2018
Full star icon Full star icon Full star icon Full star icon Full star icon 5
A really good book for a fresher who wants to study social meadia analytics.
Amazon Verified review Amazon
casatishum Dec 12, 2018
Full star icon Full star icon Full star icon Empty star icon Empty star icon 3
some codes no longer works since social media tend to change fast. on top of that, there're also errors in code, which makes it confusing and frustrating to follow. not recommending.
Amazon Verified review Amazon
Get free access to Packt library with over 7500+ books and video courses for 7 days!
Start Free Trial

FAQs

What is included in a Packt subscription? Chevron down icon Chevron up icon

A subscription provides you with full access to view all Packt and licnesed content online, this includes exclusive access to Early Access titles. Depending on the tier chosen you can also earn credits and discounts to use for owning content

How can I cancel my subscription? Chevron down icon Chevron up icon

To cancel your subscription with us simply go to the account page - found in the top right of the page or at https://subscription.packtpub.com/my-account/subscription - From here you will see the ‘cancel subscription’ button in the grey box with your subscription information in.

What are credits? Chevron down icon Chevron up icon

Credits can be earned from reading 40 section of any title within the payment cycle - a month starting from the day of subscription payment. You also earn a Credit every month if you subscribe to our annual or 18 month plans. Credits can be used to buy books DRM free, the same way that you would pay for a book. Your credits can be found in the subscription homepage - subscription.packtpub.com - clicking on ‘the my’ library dropdown and selecting ‘credits’.

What happens if an Early Access Course is cancelled? Chevron down icon Chevron up icon

Projects are rarely cancelled, but sometimes it's unavoidable. If an Early Access course is cancelled or excessively delayed, you can exchange your purchase for another course. For further details, please contact us here.

Where can I send feedback about an Early Access title? Chevron down icon Chevron up icon

If you have any feedback about the product you're reading, or Early Access in general, then please fill out a contact form here and we'll make sure the feedback gets to the right team. 

Can I download the code files for Early Access titles? Chevron down icon Chevron up icon

We try to ensure that all books in Early Access have code available to use, download, and fork on GitHub. This helps us be more agile in the development of the book, and helps keep the often changing code base of new versions and new technologies as up to date as possible. Unfortunately, however, there will be rare cases when it is not possible for us to have downloadable code samples available until publication.

When we publish the book, the code files will also be available to download from the Packt website.

How accurate is the publication date? Chevron down icon Chevron up icon

The publication date is as accurate as we can be at any point in the project. Unfortunately, delays can happen. Often those delays are out of our control, such as changes to the technology code base or delays in the tech release. We do our best to give you an accurate estimate of the publication date at any given time, and as more chapters are delivered, the more accurate the delivery date will become.

How will I know when new chapters are ready? Chevron down icon Chevron up icon

We'll let you know every time there has been an update to a course that you've bought in Early Access. You'll get an email to let you know there has been a new chapter, or a change to a previous chapter. The new chapters are automatically added to your account, so you can also check back there any time you're ready and download or read them online.

I am a Packt subscriber, do I get Early Access? Chevron down icon Chevron up icon

Yes, all Early Access content is fully available through your subscription. You will need to have a paid for or active trial subscription in order to access all titles.

How is Early Access delivered? Chevron down icon Chevron up icon

Early Access is currently only available as a PDF or through our online reader. As we make changes or add new chapters, the files in your Packt account will be updated so you can download them again or view them online immediately.

How do I buy Early Access content? Chevron down icon Chevron up icon

Early Access is a way of us getting our content to you quicker, but the method of buying the Early Access course is still the same. Just find the course you want to buy, go through the check-out steps, and you’ll get a confirmation email from us with information and a link to the relevant Early Access courses.

What is Early Access? Chevron down icon Chevron up icon

Keeping up to date with the latest technology is difficult; new versions, new frameworks, new techniques. This feature gives you a head-start to our content, as it's being created. With Early Access you'll receive each chapter as it's written, and get regular updates throughout the product's development, as well as the final course as soon as it's ready.We created Early Access as a means of giving you the information you need, as soon as it's available. As we go through the process of developing a course, 99% of it can be ready but we can't publish until that last 1% falls in to place. Early Access helps to unlock the potential of our content early, to help you start your learning when you need it most. You not only get access to every chapter as it's delivered, edited, and updated, but you'll also get the finalized, DRM-free product to download in any format you want when it's published. As a member of Packt, you'll also be eligible for our exclusive offers, including a free course every day, and discounts on new and popular titles.