You're reading from Data Science for Web3 A comprehensive guide to decoding blockchain data with data analysis basics and machine learning cases

Product type Paperback

Published in Dec 2023

Publisher Packt

ISBN-13 9781837637546

Length 344 pages

Edition 1st Edition

Languages

Python

Tools

Blockchain

Concepts

Blockchain

Author (1):

Gabriela Castillo Areco

View More author details

Table of Contents (23) Chapters

Preface

1. Part 1 Web3 Data Analysis Basics

2. Chapter 1: Where Data and Web3 Meet FREE CHAPTER

3. Chapter 2: Working with On-Chain Data

4. Chapter 3: Working with Off-Chain Data

5. Chapter 4: Exploring the Digital Uniqueness of NFTs – Games, Art, and Identity

6. Chapter 5: Exploring Analytics on DeFi

7. Part 2 Web3 Machine Learning Cases

8. Chapter 6: Preparing and Exploring Our Data

9. Chapter 7: A Primer on Machine Learning and Deep Learning

10. Chapter 8: Sentiment Analysis – NLP and Crypto News

11. Chapter 9: Generative Art for NFTs

12. Chapter 10: A Primer on Security and Fraud Detection

13. Chapter 11: Price Prediction with Time Series

14. Chapter 12: Marketing Discovery with Graphs

15. Part 3 Appendix

16. Chapter 13: Building Experience with Crypto Data – BUIDL

17. Chapter 14: Interviews with Web3 Data Leaders

18. Index

Why subscribe?

19. Other Books You May Enjoy

Appendix 1

1. Appendix 2

2. Appendix 3

Building our pipeline

In an NLP pipeline, preparation generally encompasses a pre-processing step where we clean and normalize the data. Following that, a feature representation step translates the language into input that can be consumed by our chosen models. Once this is completed, we are ready to build, train, and evaluate the model. This strategic plan will be implemented throughout the subsequent sections.

Preparation

Language manifests in numerous variations. There are formatting nuances, such as capitalization or punctuation; words that serve as linguistic aids without true semantic meaning, such as prepositions; and special characters, including emojis, further enrich the landscape. To work with this data, we must transform raw text into a dataset while following a similar criterion as numeric datasets. This cleaning process enables us to eliminate outliers, reduce noise, manage vocabulary size, and optimize data for ingestion by NLP models.

A basic flow diagram of...

The rest of the chapter is locked

You're reading from Data Science for Web3 A comprehensive guide to decoding blockchain data with data analysis basics and machine learning cases

Table of Contents (23) Chapters

Building our pipeline

Preparation

Authors (1)

Personalised recommendations for you

You're reading from Data Science for Web3 A comprehensive guide to decoding blockchain data with data analysis basics and machine learning cases

Table of Contents (23) Chapters

Building our pipeline

Preparation

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you