You're reading from Network Science with Python Explore the networks around us using network science, social network analysis, and machine learning

Product type Paperback

Published in Feb 2023

Publisher Packt

ISBN-13 9781801073691

Length 414 pages

Edition 1st Edition

Languages

Python

Tools

PyTorch

Concepts

Machine Learning

Author (1):

David Knickerbocker

View More author details

Table of Contents (17) Chapters

Preface

1. Part 1: Getting Started with Natural Language Processing and Networks

2. Chapter 1: Introducing Natural Language Processing FREE CHAPTER

3. Chapter 2: Network Analysis

4. Chapter 3: Useful Python Libraries

5. Part 2: Graph Construction and Cleanup

6. Chapter 4: NLP and Network Synergy

7. Chapter 5: Even Easier Scraping!

8. Chapter 6: Graph Construction and Cleaning

9. Part 3: Network Science and Social Network Analysis

10. Chapter 7: Whole Network Analysis

11. Chapter 8: Egocentric Network Analysis

12. Chapter 9: Community Detection

13. Chapter 10: Supervised Machine Learning on Network Data

14. Chapter 11: Unsupervised Machine Learning on Network Data

15. Index

Why subscribe?

16. Other Books You May Enjoy

Preparing the data

We should do a few more data checks. Most importantly, let’s check the balance between classes in the training data:

Start with the following code:

clf_df['label'].value_counts()

…

0    66

1    11

Name: label, dtype: int64

The data is imbalanced, but not too badly.

Let’s get this in percentage form, just to make this a little easier to understand:

clf_df['label'].value_counts(normalize=True)

…

0    0.857143

1    0.142857

Name: label, dtype: float64

It looks like we have about an 86/14 balance between the classes. Not awful. Keep this in mind, because the model should be able to predict with about 86% accuracy just based on the imbalance alone. It won’t be an impressive model at all if it only hits 86%.

Next, we need to cut up our data for our model. We will use the features as our X data, and the answers...

The rest of the chapter is locked

You're reading from Network Science with Python Explore the networks around us using network science, social network analysis, and machine learning

Table of Contents (17) Chapters

Preparing the data

Unlock this book and the full library FREE for 7 days

Authors (1)