You're reading from Practical Discrete Mathematics Discover math principles that fuel algorithms for computer science and machine learning with Python

Product type Paperback

Published in Feb 2021

Publisher Packt

ISBN-13 9781838983147

Length 330 pages

Edition 1st Edition

Languages

Python

Tools

NumPy

Concepts

Data Science

Authors (2):

Ryan T. White

Archana Tikayat Ray

View More author details

Table of Contents (17) Chapters

Preface

1. Part I – Basic Concepts of Discrete Math

2. Chapter 1: Key Concepts, Notation, Set Theory, Relations, and Functions FREE CHAPTER

3. Chapter 2: Formal Logic and Constructing Mathematical Proofs

4. Chapter 3: Computing with Base-n Numbers

5. Chapter 4: Combinatorics Using SciPy

6. Chapter 5: Elements of Discrete Probability

7. Part II – Implementing Discrete Mathematics in Data and Computer Science

8. Chapter 6: Computational Algorithms in Linear Algebra

9. Chapter 7: Computational Requirements for Algorithms

10. Chapter 8: Storage and Feature Extraction of Graphs, Trees, and Networks

11. Chapter 9: Searching Data Structures and Finding Shortest Paths

12. Part III – Real-World Applications of Discrete Mathematics

13. Chapter 10: Regression Analysis with NumPy and Scikit-Learn

14. Chapter 11: Web Searches with PageRank

15. Chapter 12: Principal Component Analysis with Scikit-Learn

16. Other Books You May Enjoy

Leave a review - let other readers know what you think

Applying the Algorithm to Real Data

Let's use our Python implementation of the PageRank algorithm to some larger-scale data. We will use a dataset shared by J. Kleinberg at Cornell by crawling the web to find web pages containing the word California. It is a text file in the following form:

Type Source Destination
n 0 http://www.berkeley.edu/
n 1 http://www.caltech.edu/
…
n 9663 http://www.cs.ucl.ac.uk/external/P.Dourish/hotlist.html
e 0 449
e 0 450
…
e 9663 7907

The first part contains 9,663 web pages that have the word California, and the rest is an adjacency list for the graph representing the "internet" of these 9,663 web pages. For example, take the following line:

e 0 499

This means web page 0 has a link to web page 499. In order to implement PageRank on this dataset, we need to create an adjacency matrix.

Let's use some Python code to read this data file into a pandas DataFrame and display it:

# import the pandas library
import...

The rest of the chapter is locked

You're reading from Practical Discrete Mathematics Discover math principles that fuel algorithms for computer science and machine learning with Python

Table of Contents (17) Chapters

Applying the Algorithm to Real Data

Authors (2)

Other recommended products

Personalised recommendations for you

You're reading from Practical Discrete Mathematics Discover math principles that fuel algorithms for computer science and machine learning with Python

Table of Contents (17) Chapters

Applying the Algorithm to Real Data

Unlock this book and the full library FREE for 7 days

Authors (2)

Other recommended products

Personalised recommendations for you