Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Mastering Numerical Computing with NumPy Master scientific computing and perform complex operations with ease

Product type Paperback

Published in Jun 2018

Publisher Packt

ISBN-13 9781788993357

Length 248 pages

Edition 1st Edition

Languages

Python

Tools

NumPy

Concepts

Scientific Computing

Authors (3):

Tiago Antao

Mert Cuhadaroglu

Umit Mert Cakmak

View More author details

Table of Contents (11) Chapters

Preface

1. Working with NumPy Arrays FREE CHAPTER

2. Linear Algebra with NumPy

3. Exploratory Data Analysis of Boston Housing Data with NumPy Statistics

4. Predicting Housing Prices Using Linear Regression

5. Clustering Clients of a Wholesale Distributor Using NumPy

6. NumPy, SciPy, Pandas, and Scikit-Learn

7. Advanced Numpy

8. Overview of High-Performance Numerical Computing Libraries

9. Performance Benchmarks

10. Other Books You May Enjoy

Leave a review - let other readers know what you think

Modifying our algorithm

Now you have understood the internal of k-means on a single variable, you can extend this implementation to multiple variables and apply it to a more realistic dataset.

The dataset to be used in this section is from the UCI Machine Learning Repository (https://archive.ics.uci.edu/ml/datasets/wholesale+customers), and it includes the client information of wholesales distributor. There 440 customers with eight features. In the following list, first six features are related to annual spending for corresponding products, seventh feature shows the channel that this product is bought and the eighth feature shows the region:

FRESH
MILK
GROCERY
FROZEN
DETERGENTS_PAPER
DELICATESSEN
CHANNEL
REGION

First download the dataset and read the it as a numpy array:

from numpy import genfromtxt
wholesales_data = genfromtxt('Wholesale customers data.csv', delimiter...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (3)

Mert Cakmak

Umit Mert Cakmak is a data scientist at IBM, where he excels at helping clients solve complex data science problems, from inception to delivery of deployable assets. His research spans multiple disciplines beyond his industry and he likes sharing his insights at conferences, universities, and meet-ups.

See other products by Mert Cakmak

Tiago Antao

Tiago Antao is a bioinformatician currently working in the field of genomics. A former computer scientist, Tiago moved into computational biology with an MSc in Bioinformatics from the Faculty of Sciences at the University of Porto (Portugal) and a PhD on the spread of drug-resistant malaria from the Liverpool School of Tropical Medicine (UK). Postdoctoral, Tiago has worked with human datasets at the University of Cambridge (UK) and with mosquito whole genome sequencing data at the University of Oxford (UK), before helping to set up the bioinformatics infrastructure at the University of Montana. He currently works as a data engineer in the biotechnology field in Boston, MA. He is one of the co-authors of Biopython, a major bioinformatics package written in Python.

See other products by Tiago Antao

Cuhadaroglu

Mert Cuhadaroglu is a BI Developer in EPAM, developing E2E analytics solutions for complex business problems in various industries, mostly investment banking, FMCG, media, communication, and pharma. He consistently uses advanced statistical models and ML algorithms to provide actionable insights. Throughout his career, he has worked in several other industries, such as banking and asset management. He continues his academic research in AI for trading algorithms.

See other products by Cuhadaroglu