Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Statistics for Machine Learning Techniques for exploring supervised, unsupervised, and reinforcement learning models with Python and R

Product type Paperback

Published in Jul 2017

Publisher Packt

ISBN-13 9781788295758

Length 442 pages

Edition 1st Edition

Languages

Python

Concepts

Machine Learning

Author (1):

Pratap Dangeti

View More author details

Table of Contents (10) Chapters

Preface

1. Journey from Statistics to Machine Learning FREE CHAPTER

2. Parallelism of Statistics and Machine Learning

3. Logistic Regression Versus Random Forest

4. Tree-Based Machine Learning Models

5. K-Nearest Neighbors and Naive Bayes

6. Support Vector Machines and Neural Networks

7. Recommendation Engines

8. Unsupervised Learning

9. Reinforcement Learning

K-Nearest Neighbors and Naive Bayes

In the previous chapter, we have learned about computationally intensive methods. In contrast, this chapter discusses the simple methods to balance it out! We will be covering the two techniques, called k-nearest neighbors (KNN)and Naive Bayes here. Before touching on KNN, we explained the issue with the curse of dimensionality with a simulated example. Subsequently, breast cancer medical examples have been utilized to predict whether the cancer is malignant or benign using KNN. In the final section of the chapter, Naive Bayes has been explained with spam/ham classification, which also involves the application of the natural language processing (NLP) techniques consisting of the following basic preprocessing and modeling steps:

Punctuation removal
Word tokenization and lowercase conversion
Stopwords removal
Stemming
Lemmatization with POS tagging...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (1)

Pratap Dangeti

Pratap Dangeti develops machine learning and deep learning solutions for structured, image, and text data at TCS, analytics and insights, innovation lab in Bangalore. He has acquired a lot of experience in both analytics and data science. He received his master's degree from IIT Bombay in its industrial engineering and operations research program. He is an artificial intelligence enthusiast. When not working, he likes to read about next-gen technologies and innovative methodologies.

See other products by Pratap Dangeti