Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Hands-On Deep Learning with Apache Spark Build and deploy distributed deep learning applications on Apache Spark

Product type Paperback

Published in Jan 2019

Publisher Packt

ISBN-13 9781788994613

Length 322 pages

Edition 1st Edition

Languages

Java

Tools

Apache Spark

Concepts

Deep Learning

Author (1):

Guglielmo Iozzia

View More author details

Table of Contents (19) Chapters

Preface

1. The Apache Spark Ecosystem FREE CHAPTER

2. Deep Learning Basics

3. Extract, Transform, Load

4. Streaming

5. Convolutional Neural Networks

6. Recurrent Neural Networks

7. Training Neural Networks with Spark

8. Monitoring and Debugging Neural Network Training

9. Interpreting Neural Network Output

10. Deploying on a Distributed System

11. NLP Basics

12. Textual Analysis and Deep Learning

13. Convolution

14. Image Classification

15. What's Next for Deep Learning?

16. Other Books You May Enjoy

Leave a review - let other readers know what you think

Appendix A: Functional Programming in Scala

Functional programming (FP)

1. Appendix B: Image Data Preparation for Spark

Image preprocessing

NLP Basics

In the previous chapter, several topics were covered concerning the undertaking of DL distributed training in a Spark cluster. The concepts presented there are common to any network model. Starting from this chapter, specific use cases for RNNs or LSTMs will be looked at first, and then CNNs will be covered. This chapter starts by introducing the following core concepts of Natural Language Processing (NLP):

Tokenizers
Sentence segmentation
Part-of-speech tagging
Named entity extraction
Chunking
Parsing

The theory behind the concepts in the preceding list will be detailed before finally presenting two complete Scala examples of NLP, one using Apache Spark and the Stanford core NLP library, and the other using the Spark core and the Spark-nlp library (which is built on top of Apache Spark MLLib). The goal of the chapter is to make readers familiar with NLP, before moving...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (1)

Iozzia

Guglielmo Iozzia is currently a big data delivery manager at Optum in Dublin. He completed his master's degree in biomedical engineering at the University of Bologna. After graduation, he joined a start-up IT company in Bologna that had implemented a new system to manage online payments. There, he worked on complex Java projects for different customers in different areas. He has also worked at the IT department of FAO, an agency of the United Nations. In 2013, he had the chance to join IBM in Dublin. There, he improved his DevOps skills, working mostly on cloud-based applications. He is a golden member, writes articles at DZone, and maintains a personal blog to share his findings and thoughts about various tech topics.

See other products by Iozzia