Packt+ | Advance your knowledge in tech

You're reading from Scala Machine Learning Projects Build real-world machine learning and deep learning projects with Scala

Product type Paperback

Published in Jan 2018

Publisher Packt

ISBN-13 9781788479042

Length 470 pages

Edition 1st Edition

Languages

Scala

Tools

Apache Spark

Concepts

Deep Learning

Author (1):

Md. Rezaul Karim

View More author details

Table of Contents (13) Chapters

Preface

1. Analyzing Insurance Severity Claims FREE CHAPTER

2. Analyzing and Predicting Telecommunication Churn

3. High Frequency Bitcoin Price Prediction from Historical and Live Data

4. Population-Scale Clustering and Ethnicity Prediction

5. Topic Modeling - A Better Insight into Large-Scale Texts

6. Developing Model-based Movie Recommendation Engines

7. Options Trading Using Q-learning and Scala Play Framework

8. Clients Subscription Assessment for Bank Telemarketing using Deep Neural Networks

9. Fraud Analytics Using Autoencoders and Anomaly Detection

10. Human Activity Recognition using Recurrent Neural Networks

11. Image Classification using Convolutional Neural Networks

12. Other Books You May Enjoy

Leave a review - let other readers know what you think

Summary

In this chapter, we have used a dataset having more than 284,807 instances of credit card use and for each transaction where only 0.172% transactions are fraudulent. We have seen how we can use autoencoders to pre-train a classification model and how to apply anomaly detection techniques to predict possible fraudulent transactions from highly imbalanced data—that is, we expected our fraudulent cases to be anomalies within the whole dataset.

Our final model now correctly identified 83% of fraudulent cases and almost 100% of non-fraudulent cases. Nevertheless, we have seen how to use anomaly detection using outliers, some ways of hyperparameter tuning, and, most importantly, feature selection.

A recurrent neural network (RNN) is a class of artificial neural network where connections between units form a directed cycle. RNNs make use of information from the past. That way, they can make predictions in data with high temporal dependencies. This creates an internal state of the network...