You're reading from Data Science for Web3 A comprehensive guide to decoding blockchain data with data analysis basics and machine learning cases

Product type Paperback

Published in Dec 2023

Publisher Packt

ISBN-13 9781837637546

Length 344 pages

Edition 1st Edition

Languages

Python

Tools

Blockchain

Concepts

Blockchain

Author (1):

Gabriela Castillo Areco

View More author details

Table of Contents (23) Chapters

Preface

1. Part 1 Web3 Data Analysis Basics

2. Chapter 1: Where Data and Web3 Meet FREE CHAPTER

3. Chapter 2: Working with On-Chain Data

4. Chapter 3: Working with Off-Chain Data

5. Chapter 4: Exploring the Digital Uniqueness of NFTs – Games, Art, and Identity

6. Chapter 5: Exploring Analytics on DeFi

7. Part 2 Web3 Machine Learning Cases

8. Chapter 6: Preparing and Exploring Our Data

9. Chapter 7: A Primer on Machine Learning and Deep Learning

10. Chapter 8: Sentiment Analysis – NLP and Crypto News

11. Chapter 9: Generative Art for NFTs

12. Chapter 10: A Primer on Security and Fraud Detection

13. Chapter 11: Price Prediction with Time Series

14. Chapter 12: Marketing Discovery with Graphs

15. Part 3 Appendix

16. Chapter 13: Building Experience with Crypto Data – BUIDL

17. Chapter 14: Interviews with Web3 Data Leaders

18. Index

Why subscribe?

19. Other Books You May Enjoy

Appendix 1

1. Appendix 2

2. Appendix 3

Technical requirements

In this chapter, we’ll utilize tools from the libraries that were introduced in Chapter 7 – that is, scikit-learn and Keras. Additionally, we will employ NLTK, a Python library that proves valuable for working with human language data. NLTK includes a range of modules and functions that empower us to execute tasks such as tokenization, stemming, and part-of-speech tagging on our selected databases. This library streamlines the process of processing extensive text datasets so that they’re ready to be integrated with machine learning or deep learning models.

If you have not worked with NLTK before, it can be installed with the following code:

pip install nltk

The documentation for nltk can be found at https://www.nltk.org. Another essential library when handling text manipulation and cleaning is re, short for Regular Expression. A regular expression is a sequence of characters that defines a search pattern. Here’s an example...