You're reading from Statistics for Machine Learning Techniques for exploring supervised, unsupervised, and reinforcement learning models with Python and R

Product type Paperback

Published in Jul 2017

Publisher Packt

ISBN-13 9781788295758

Length 442 pages

Edition 1st Edition

Languages

Python

Concepts

Machine Learning

Author (1):

Pratap Dangeti

View More author details

Table of Contents (10) Chapters

Preface

1. Journey from Statistics to Machine Learning FREE CHAPTER

2. Parallelism of Statistics and Machine Learning

3. Logistic Regression Versus Random Forest

4. Tree-Based Machine Learning Models

5. K-Nearest Neighbors and Naive Bayes

6. Support Vector Machines and Neural Networks

7. Recommendation Engines

8. Unsupervised Learning

9. Reinforcement Learning

Random forest

The random forest (RF) is a very powerful technique which is used frequently in the data science field for solving various problems across industries, as well as a silver bullet for winning competitions like Kaggle. We will cover various concepts from the basics to in depth in the next chapter; here we are restricted to the bare necessities. Random forest is an ensemble of decision trees, as we know, logistic regression has very high bias and low variance technique; on the other hand, decision trees have high variance and low bias, which makes decision trees unstable. By averaging decision trees, we will minimize the variance component the of model, which makes approximate nearest to an ideal model.

RF focuses on sampling both observations and variables of training data to develop independent decision trees and take majority voting for classification and averaging...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (1)

Pratap Dangeti

Pratap Dangeti develops machine learning and deep learning solutions for structured, image, and text data at TCS, analytics and insights, innovation lab in Bangalore. He has acquired a lot of experience in both analytics and data science. He received his master's degree from IIT Bombay in its industrial engineering and operations research program. He is an artificial intelligence enthusiast. When not working, he likes to read about next-gen technologies and innovative methodologies.

See other products by Pratap Dangeti