Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Machine Learning in Java Helpful techniques to design, build, and deploy powerful machine learning applications in Java

Product type Paperback

Published in Nov 2018

Publisher Packt

ISBN-13 9781788474399

Length 300 pages

Edition 2nd Edition

Languages

Java

Tools

JAVA-ML

Concepts

Machine Learning

Authors (2):

Ashish Bhatia

Bostjan Kaluza

View More author details

Table of Contents (13) Chapters

Preface

1. Applied Machine Learning Quick Start FREE CHAPTER

2. Java Libraries and Platforms for Machine Learning

3. Basic Algorithms - Classification, Regression, and Clustering

4. Customer Relationship Prediction with Ensembles

5. Affinity Analysis

6. Recommendation Engines with Apache Mahout

7. Fraud and Anomaly Detection

8. Image Recognition with Deeplearning4j

9. Activity Recognition with Mobile Phone Sensors

10. Text Mining with Mallet - Topic Modeling and Spam Detection

11. What Is Next?

12. Other Books You May Enjoy

Leave a review - let other readers know what you think

Basic Naive Bayes classifier baseline

As per the rules of the challenge, the participants had to outperform the basic Naive Bayes classifier in order to qualify for prizes, which makes an assumption that features are independent (refer to Chapter 1, Applied Machine Learning Quick Start).

The KDD Cup organizers ran the vanilla Naive Bayes classifier, without any feature selection or hyperparameter adjustments. For the large dataset, the overall scores of the Naive Bayes on the test set were as follows:

Churn problem: AUC = 0.6468
Appetency problem: AUC = 0.6453
Upselling problem: AUC=0.7211

Note that the baseline results are only reported for the large dataset. Moreover, while both the training and testing datasets are provided at the KDD Cup site, the actual true labels for the test set are not provided. Therefore, when we process the data with our models, there is no way to...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (2)

Ashish Bhatia

See other products by Ashish Bhatia

Bostjan Kaluza

Bostjan Kaluza is a researcher in artificial intelligence and machine learning with extensive experience in Java and Python. Bostjan is the chief data scientist at Evolven, a leading IT operations analytics company. He works with machine learning, predictive analytics, pattern mining, and anomaly detection to turn data into relevant information. Prior to Evolven, Bostjan served as a senior researcher in the department of intelligent systems at the Jozef Stefan Institute and led research projects involving pattern and anomaly detection, ubiquitous computing, and multi-agent systems. In 2013, Bostjan published his first book, Instant Weka How-To, published by Packt Publishing, exploring how to leverage machine learning using Weka.

See other products by Bostjan Kaluza