Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Newsletter Hub

Free Learning

You're reading from Hands-On Deep Learning with Apache Spark Build and deploy distributed deep learning applications on Apache Spark

Product type Paperback

Published in Jan 2019

Publisher Packt

ISBN-13 9781788994613

Length 322 pages

Edition 1st Edition

Languages

Java

Tools

Apache Spark

Concepts

Deep Learning

Author (1):

Guglielmo Iozzia

View More author details

Table of Contents (19) Chapters

Preface

1. The Apache Spark Ecosystem FREE CHAPTER

2. Deep Learning Basics

3. Extract, Transform, Load

4. Streaming

5. Convolutional Neural Networks

6. Recurrent Neural Networks

7. Training Neural Networks with Spark

8. Monitoring and Debugging Neural Network Training

9. Interpreting Neural Network Output

10. Deploying on a Distributed System

11. NLP Basics

12. Textual Analysis and Deep Learning

13. Convolution

14. Image Classification

15. What's Next for Deep Learning?

16. Other Books You May Enjoy

Leave a review - let other readers know what you think

Appendix A: Functional Programming in Scala

Functional programming (FP)

1. Appendix B: Image Data Preparation for Spark

Image preprocessing

Data ingestion from S3

Nowadays, there's a big chance that the training and test data are hosted in some cloud storage system. In this section, we are going to learn how to ingest data through Apache Spark from an object storage such as Amazon S3 (https://aws.amazon.com/s3/) or S3-based (such as Minio, https://www.minio.io/). The Amazon simple storage service (which is more popularly known as Amazon S3) is an object storage service part of the AWS cloud offering. While S3 is available in the public cloud, Minio is a high performance distributed object storage server compatible with the S3 protocol and standards that has been designed for large-scale private cloud infrastructures.

We need to add to the Scala project the Spark core and Spark SQL dependencies, and also the following:

groupId: com.amazonaws
 artifactId: aws-java-sdk-core
 version1.11.234
 
 groupId: com.amazonaws...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at AU $24.99/month. Cancel anytime

Authors (1)

Iozzia

Guglielmo Iozzia is currently a big data delivery manager at Optum in Dublin. He completed his master's degree in biomedical engineering at the University of Bologna. After graduation, he joined a start-up IT company in Bologna that had implemented a new system to manage online payments. There, he worked on complex Java projects for different customers in different areas. He has also worked at the IT department of FAO, an agency of the United Nations. In 2013, he had the chance to join IBM in Dublin. There, he improved his DevOps skills, working mostly on cloud-based applications. He is a golden member, writes articles at DZone, and maintains a personal blog to share his findings and thoughts about various tech topics.

See other products by Iozzia