Packt+ | Advance your knowledge in tech

You're reading from Scala Machine Learning Projects Build real-world machine learning and deep learning projects with Scala

Product type Paperback

Published in Jan 2018

Publisher Packt

ISBN-13 9781788479042

Length 470 pages

Edition 1st Edition

Languages

Scala

Tools

Apache Spark

Concepts

Deep Learning

Author (1):

Md. Rezaul Karim

View More author details

Table of Contents (13) Chapters

Preface

1. Analyzing Insurance Severity Claims

2. Analyzing and Predicting Telecommunication Churn FREE CHAPTER

3. High Frequency Bitcoin Price Prediction from Historical and Live Data

4. Population-Scale Clustering and Ethnicity Prediction

5. Topic Modeling - A Better Insight into Large-Scale Texts

6. Developing Model-based Movie Recommendation Engines

7. Options Trading Using Q-learning and Scala Play Framework

8. Clients Subscription Assessment for Bank Telemarketing using Deep Neural Networks

9. Fraud Analytics Using Autoencoders and Anomaly Detection

10. Human Activity Recognition using Recurrent Neural Networks

11. Image Classification using Convolutional Neural Networks

12. Other Books You May Enjoy

Leave a review - let other readers know what you think

A simple Q-learning implementation

Q-learning is an algorithm that can be used in financial and market trading applications, such as options trading. One reason is that the best policy is generated through training. that is, RL defines the model in Q-learning over time and is constantly updated with any new episode. Q-learning is a method for optimizing (cumulated) discounted reward, making far-future rewards less prioritized than near-term rewards; Q-learning is a form of model-free RL. It can also be viewed as a method of asynchronous dynamic programming (DP).

It provides agents with the capability of learning to act optimally in Markovian domains by experiencing the consequences of actions, without requiring them to build maps of the domains. In short, Q-learning qualifies as an RL technique because it does not strictly require labeled data and training. Moreover, the Q-value does not have to be a continuous, differentiable function.