You're reading from Network Science with Python Explore the networks around us using network science, social network analysis, and machine learning

Product type Paperback

Published in Feb 2023

Publisher Packt

ISBN-13 9781801073691

Length 414 pages

Edition 1st Edition

Languages

Python

Tools

PyTorch

Concepts

Machine Learning

Author (1):

David Knickerbocker

View More author details

Table of Contents (17) Chapters

Preface

1. Part 1: Getting Started with Natural Language Processing and Networks

2. Chapter 1: Introducing Natural Language Processing FREE CHAPTER

3. Chapter 2: Network Analysis

4. Chapter 3: Useful Python Libraries

5. Part 2: Graph Construction and Cleanup

6. Chapter 4: NLP and Network Synergy

7. Chapter 5: Even Easier Scraping!

8. Chapter 6: Graph Construction and Cleaning

9. Part 3: Network Science and Social Network Analysis

10. Chapter 7: Whole Network Analysis

11. Chapter 8: Egocentric Network Analysis

12. Chapter 9: Community Detection

13. Chapter 10: Supervised Machine Learning on Network Data

14. Chapter 11: Unsupervised Machine Learning on Network Data

15. Index

Why subscribe?

16. Other Books You May Enjoy

Data preparation and feature engineering

Before we can use ML, we first need to collect our data and convert it into a format that the model can use. We can’t just feed the graph G to Random Forest and call it a day. We could feed a graph’s adjacency matrix and a set of labels to Random Forest and it’d work, but I want to showcase some feature engineering that we can do.

Feature engineering is using domain knowledge to create additional features (most call them columns) that will be useful for our models. For instance, looking at the networks from the previous section, if we want to be able to spot the revolutionaries, then we may want to give our model additional data such as each node’s number of degrees (connections), betweenness centrality, closeness centrality, page rank, clustering, and triangles:

Let’s start by first building our network. This should be easy by now, as we have done this several times:
```
import networkx as nx
```
```
import pandas...
```

The rest of the chapter is locked

You're reading from Network Science with Python Explore the networks around us using network science, social network analysis, and machine learning

Table of Contents (17) Chapters

Data preparation and feature engineering

Unlock this book and the full library FREE for 7 days

Authors (1)