Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Save more on your purchases! discount-offer-chevron-icon
Savings automatically calculated. No voucher code required.
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Practical Machine Learning Cookbook

You're reading from   Practical Machine Learning Cookbook Supervised and unsupervised machine learning simplified

Arrow left icon
Product type Paperback
Published in Apr 2017
Publisher Packt
ISBN-13 9781785280511
Length 570 pages
Edition 1st Edition
Languages
Arrow right icon
Author (1):
Arrow left icon
Atul Tripathi Atul Tripathi
Author Profile Icon Atul Tripathi
Atul Tripathi
Arrow right icon
View More author details
Toc

Table of Contents (15) Chapters Close

Preface 1. Introduction to Machine Learning FREE CHAPTER 2. Classification 3. Clustering 4. Model Selection and Regularization 5. Nonlinearity 6. Supervised Learning 7. Unsupervised Learning 8. Reinforcement Learning 9. Structured Prediction 10. Neural Networks 11. Deep Learning 12. Case Study - Exploring World Bank Data 13. Case Study - Pricing Reinsurance Contracts 14. Case Study - Forecast of Electricity Consumption

Decision tree learning - income-based distribution of real estate values

Income has been an essential component of the attractive long-term total returns provided by real estate as an asset class. The annual income returns generated from investing in real estate have been more than 2.5 times higher than stocks and lagged bonds by only 50 basis points. Real estate often provides a steady source of income based on the rent paid by tenants.

Getting ready

In order to perform decision tree classification, we will be using a dataset collected from the real estate dataset.

Step 1 - collecting and describing the data

The dataset titled RealEstate.txt will be used. This dataset is available in TXT format, titled RealEstate.txt. The dataset is in standard format. There are 20,640 rows of data. The 9 numerical variables are as follows:

  • MedianHouseValue
  • MedianIncome
  • MedianHouseAge
  • TotalRooms
  • TotalBedrooms
  • Population
  • Households
  • Latitude
  • Longitude

How to do it...

Let's get into the details.

Step 2 - exploring...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime
Banner background image