Packt+ | Advance your knowledge in tech

You're reading from Scala Machine Learning Projects Build real-world machine learning and deep learning projects with Scala

Product type Paperback

Published in Jan 2018

Publisher Packt

ISBN-13 9781788479042

Length 470 pages

Edition 1st Edition

Languages

Scala

Tools

Apache Spark

Concepts

Deep Learning

Author (1):

Md. Rezaul Karim

View More author details

Table of Contents (13) Chapters

Preface

1. Analyzing Insurance Severity Claims

2. Analyzing and Predicting Telecommunication Churn FREE CHAPTER

3. High Frequency Bitcoin Price Prediction from Historical and Live Data

4. Population-Scale Clustering and Ethnicity Prediction

5. Topic Modeling - A Better Insight into Large-Scale Texts

6. Developing Model-based Movie Recommendation Engines

7. Options Trading Using Q-learning and Scala Play Framework

8. Clients Subscription Assessment for Bank Telemarketing using Deep Neural Networks

9. Fraud Analytics Using Autoencoders and Anomaly Detection

10. Human Activity Recognition using Recurrent Neural Networks

11. Image Classification using Convolutional Neural Networks

12. Other Books You May Enjoy

Leave a review - let other readers know what you think

Tuning LSTM hyperparameters and GRU

Nevertheless, I still believe it is possible to attain about 100% accuracy with more LSTM layers. The following are the hyperparameters that I would still try to tune to see the accuracy:

// Hyper parameters for the LSTM training
val learningRate = 0.001f
val trainingIters = trainingDataCount * 1000 // Loop 1000 times on the dataset
val batchSize = 1500 // I would set it 5000 and see the performance
val displayIter = 15000 // To show test set accuracy during training
val numLstmLayer = 3 // 5, 7, 9 etc.

There are many other variants of the LSTM cell. One particularly popular variant is the Gated Recurrent Unit (GRU) cell, which is a slightly dramatic variation on the LSTM. It also merges the cell state and hidden state and makes some other changes. The resulting model is simpler than standard LSTM models and has been growing increasingly popular. This cell was proposed by Kyunghyun Cho et al. in a 2014 paper that also introduced the encoder-decoder network...