Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Conferences
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Numpy Beginner's Guide (Update)

You're reading from   Numpy Beginner's Guide (Update) Build efficient, high-speed programs using the high-performance NumPy mathematical library

Arrow left icon
Product type Paperback
Published in Jun 2015
Publisher
ISBN-13 9781785281969
Length 348 pages
Edition 1st Edition
Languages
Tools
Arrow right icon
Author (1):
Arrow left icon
Ivan Idris Ivan Idris
Author Profile Icon Ivan Idris
Ivan Idris
Arrow right icon
View More author details
Toc

Table of Contents (16) Chapters Close

Preface 1. NumPy Quick Start FREE CHAPTER 2. Beginning with NumPy Fundamentals 3. Getting Familiar with Commonly Used Functions 4. Convenience Functions for Your Convenience 5. Working with Matrices and ufuncs 6. Moving Further with NumPy Modules 7. Peeking into Special Routines 8. Assuring Quality with Testing 9. Plotting with matplotlib 10. When NumPy Is Not Enough – SciPy and Beyond 11. Playing with Pygame A. Pop Quiz Answers B. Additional Online Resources C. NumPy Functions' References
Index

Time for action – performing simple statistics


We can use some kind of threshold to weed out outliers, but there is a better way. It is called the median, and it basically picks the middle value of a sorted set of values (see https://www.khanacademy.org/math/probability/descriptive-statistics/central_tendency/e/mean_median_and_mode). One half of the data is below the median and the other half is above it. For example, if we have the values of 1, 2, 3, 4, and 5, then the median will be 3, since it is in the middle.

These are the steps to calculate the median:

  1. Create a new Python script and call it simplestats.py. You already know how to load the data from a CSV file into an array. So, copy that line of code and make sure that it only gets the close price. The code should appear like this:

    c=np.loadtxt('data.csv', delimiter=',', usecols=(6,), unpack=True)
  2. The function that will do the magic for us is called median(). We will call it and print the result immediately. Add the following line of code...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime