Packt+ | Advance your knowledge in tech

You're reading from Mastering Java Machine Learning A Java developer's guide to implementing machine learning and big data architectures

Product type Paperback

Published in Jul 2017

Publisher Packt

ISBN-13 9781785880513

Length 556 pages

Edition 1st Edition

Languages

Java

Concepts

Big Data

Authors (2):

Uday Kamath

Krishna Choppella

View More author details

Table of Contents (13) Chapters

Preface

1. Machine Learning Review

2. Practical Approach to Real-World Supervised Learning FREE CHAPTER

3. Unsupervised Machine Learning Techniques

4. Semi-Supervised and Active Learning

5. Real-Time Stream Machine Learning

6. Probabilistic Graph Modeling

7. Deep Learning

8. Text Mining and Natural Language Processing

9. Big Data Machine Learning – The Final Frontier

A. Linear Algebra

B. Probability

Index

Limitations of neural networks

In this section, we will discuss in detail the issues faced by neural networks, which will become the stepping stone for building deep learning networks.

Vanishing gradients, local optimum, and slow training

One of the major issues with neural networks is the problem of "vanishing gradient" (References [8]). We will try to give a simple explanation of the issue rather than exploring the mathematical derivations in depth. We will choose the sigmoid activation function and a two-layer neural network, as shown in the following figure, to demonstrate the issue:

Figure 5: Vanishing Gradient issue.

As we saw in the activation function description, the sigmoid function squashes the output between the range 0 and 1. The derivative of the sigmoid function g'(a) = g(a)(1 – g(a)) has a range between 0 and 0.25. The goal of learning is to minimize the output loss, that is, . In general, the output error does not go to 0, so maximum iterations; a user-specified parameter determines...

The rest of the chapter is locked

Tech Concepts

Programming languages

Tech Tools

Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

50+ new titles added per month and exclusive early access to books as they are being written.

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (2)

Uday Kamath

Dr. Uday Kamath is the chief data scientist at BAE Systems Applied Intelligence. He specializes in scalable machine learning and has spent 20 years in the domain of AML, fraud detection in financial crime, cyber security, and bioinformatics, to name a few. Dr. Kamath is responsible for key products in areas focusing on the behavioral, social networking and big data machine learning aspects of analytics at BAE AI. He received his PhD at George Mason University, under the able guidance of Dr. Kenneth De Jong, where his dissertation research focused on machine learning for big data and automated sequence mining.

See other products by Uday Kamath

Krishna Choppella

Krishna Choppella builds tools and client solutions in his role as a solutions architect for analytics at BAE Systems Applied Intelligence. He has been programming in Java for 20 years. His interests are data science, functional programming, and distributed computing.

See other products by Krishna Choppella