Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Conferences
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Apache Mahout Essentials

You're reading from   Apache Mahout Essentials Implement top-notch machine learning algorithms for classification, clustering, and recommendations with Apache Mahout

Arrow left icon
Product type Paperback
Published in Jun 2015
Publisher
ISBN-13 9781783554997
Length 164 pages
Edition 1st Edition
Languages
Tools
Arrow right icon
Author (1):
Arrow left icon
Jayani Withanawasam Jayani Withanawasam
Author Profile Icon Jayani Withanawasam
Jayani Withanawasam
Arrow right icon
View More author details
Toc

Machine learning in a nutshell

 

"Machine learning is the most exciting field of all the computer sciences. Sometimes I actually think that machine learning is not only the most exciting thing in computer science, but also the most exciting thing in all of human endeavor."

 
 --Andrew Ng, Associate Professor at Stanford and Chief Scientist of Baidu

Giving a detailed explanation of machine learning is beyond the scope of this book. For this purpose, there are other excellent resources that I have listed here:

  • Machine Learning by Andrew Ng at Coursera (https://www.coursera.org/course/ml)
  • Foundations of Machine Learning (Adaptive Computation and Machine Learning series) by Mehryar Mohri, Afshin Rostamizadeh, and Ameet Talwalker

However, basic machine learning concepts are explained very briefly here, for those who are not familiar with it.

Machine learning is an area of artificial intelligence that focuses on learning from the available data to make predictions on unseen data without explicit programming.

To solve real-world problems using machine learning, we first need to represent the characteristics of the problem domain using features.

Features

A feature is a distinct, measurable, heuristic property of the item of interest being perceived. We need to consider the features that have the greatest potential in discriminating between different categories.

Supervised learning versus unsupervised learning

Let's explain the difference between supervised learning and unsupervised learning using a simple example of pebbles:

Supervised learning versus unsupervised learning
  • Supervised learning: Take a collection of mixed pebbles, as given in the preceding figure, and categorize (label) them as small, medium, and large pebbles. Examples of supervised learning are regression and classification.
  • Unsupervised learning: Here, just group them based on similar sizes but don't label them. An example of unsupervised learning is clustering.

For a machine to perform learning tasks, it requires features such as the diameter and weight of each pebble.

This book will cover how to implement the following machine learning techniques using Apache Mahout:

  • Clustering
  • Classification and regression
  • Recommendations
You have been reading a chapter from
Apache Mahout Essentials
Published in: Jun 2015
Publisher:
ISBN-13: 9781783554997
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime