You're reading from Network Science with Python Explore the networks around us using network science, social network analysis, and machine learning

Product type Paperback

Published in Feb 2023

Publisher Packt

ISBN-13 9781801073691

Length 414 pages

Edition 1st Edition

Languages

Python

Tools

PyTorch

Concepts

Machine Learning

Author (1):

David Knickerbocker

View More author details

Table of Contents (17) Chapters

Preface

1. Part 1: Getting Started with Natural Language Processing and Networks

2. Chapter 1: Introducing Natural Language Processing FREE CHAPTER

3. Chapter 2: Network Analysis

4. Chapter 3: Useful Python Libraries

5. Part 2: Graph Construction and Cleanup

6. Chapter 4: NLP and Network Synergy

7. Chapter 5: Even Easier Scraping!

8. Chapter 6: Graph Construction and Cleaning

9. Part 3: Network Science and Social Network Analysis

10. Chapter 7: Whole Network Analysis

11. Chapter 8: Egocentric Network Analysis

12. Chapter 9: Community Detection

13. Chapter 10: Supervised Machine Learning on Network Data

14. Chapter 11: Unsupervised Machine Learning on Network Data

15. Index

Why subscribe?

16. Other Books You May Enjoy

Additional NLP and network considerations

This has been a marathon of a chapter. Please bear with me a little longer. I have a few final thoughts that I’d like to express, and then we can conclude this chapter.

Data cleanup

First, if you work with language data, there will always be cleanup. Language is messy and difficult. If you are only comfortable working with pre-cleaned tabular data, this is going to feel very messy. I love that, as every project allows me to improve my techniques and tactics.

I showed two different approaches for extracting entities: PoS tagging and NER. Both approaches work very well, but consider which approach gets us closer to a clean and useful entity list the quickest and easiest. With PoS tagging, we get one token at a time. With NER, we very quickly get to entities, but the models occasionally misbehave or don’t catch everything, so there is always cleanup with this as well.

There is no silver bullet. I want to use whatever...

The rest of the chapter is locked

You're reading from Network Science with Python Explore the networks around us using network science, social network analysis, and machine learning

Table of Contents (17) Chapters

Additional NLP and network considerations

Data cleanup

Unlock this book and the full library FREE for 7 days

Authors (1)