Packt+ | Advance your knowledge in tech

You're reading from Mastering Clojure Data Analysis If you'd like to apply your Clojure skills to performing data analysis, this is the book for you. The example based approach aids fast learning and covers basic to advanced topics. Get deeper into your data.

Product type Paperback

Published in May 2014

Publisher

ISBN-13 9781783284139

Length 340 pages

Edition Edition

Languages

Clojure

Concepts

Data Analysis

Author (1):

Eric Richard Rochester

View More author details

Table of Contents (17) Chapters

Mastering Clojure Data Analysis

Credits

About the Author

About the Reviewers

www.PacktPub.com

Preface

1. Network Analysis – The Six Degrees of Kevin Bacon FREE CHAPTER

2. GIS Analysis – Mapping Climate Change

3. Topic Modeling – Changing Concerns in the State of the Union Addresses

4. Classifying UFO Sightings

5. Benford's Law – Detecting Natural Progressions of Numbers

6. Sentiment Analysis – Categorizing Hotel Reviews

7. Null Hypothesis Tests – Analyzing Crime Data

8. A/B Testing – Statistical Experiments for the Web

9. Analyzing Social Data Participation

10. Modeling Stock Data

Index

Failing Benford's Law

So far, we've seen several datasets, all of which conform to Benford's Law, most of them quite strongly. We haven't yet seen a dataset that does not conform to this distribution of initial digits. What would a failing dataset look like?

There are many ways in which we could get data that doesn't conform. Any linear data, for example, would have a more uniform distribution of the initial digits. However, we can also simulate fraudulent data easily, and in the process, we can learn just how much noise a dataset can handle before Benford's Law begins to have trouble with it.

We'll start this experiment with the population data that we looked at earlier. We'll progressively introduce more and more junk into the dataset. We'll randomly replace items in the dataset with a random value and re-run incanter.stats/benford-test on it. When it finally fails, we can note how many items we've replaced and how far off the new distribution is.

The primary function is shown as follows...

The rest of the chapter is locked

You're reading from Mastering Clojure Data Analysis If you'd like to apply your Clojure skills to performing data analysis, this is the book for you. The example based approach aids fast learning and covers basic to advanced topics. Get deeper into your data.

Table of Contents (17) Chapters

Failing Benford's Law

Authors (1)

Personalised recommendations for you

You're reading from Mastering Clojure Data Analysis If you'd like to apply your Clojure skills to performing data analysis, this is the book for you. The example based approach aids fast learning and covers basic to advanced topics. Get deeper into your data.

Table of Contents (17) Chapters

Failing Benford's Law

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you