You're reading from Applied Machine Learning and High-Performance Computing on AWS Accelerate the development of machine learning applications following architectural best practices

Product type Paperback

Published in Dec 2022

Publisher Packt

ISBN-13 9781803237015

Length 382 pages

Edition 1st Edition

Tools

AWS

Concepts

Machine Learning

Authors (4):

Trenton Potgieter

Shreyas Subramanian

Farooq Sabir

Mani Khanuja

View More author details

Table of Contents (20) Chapters

Preface

1. Part 1: Introducing High-Performance Computing

2. Chapter 1: High-Performance Computing Fundamentals FREE CHAPTER

3. Chapter 2: Data Management and Transfer

4. Chapter 3: Compute and Networking

5. Chapter 4: Data Storage

6. Part 2: Applied Modeling

7. Chapter 5: Data Analysis

8. Chapter 6: Distributed Training of Machine Learning Models

9. Chapter 7: Deploying Machine Learning Models at Scale

10. Chapter 8: Optimizing and Managing Machine Learning Models for Edge Deployment

11. Chapter 9: Performance Optimization for Real-Time Inference

12. Chapter 10: Data Visualization

13. Part 3: Driving Innovation Across Industries

14. Chapter 11: Computational Fluid Dynamics

15. Chapter 12: Genomics

16. Chapter 13: Autonomous Vehicles

17. Chapter 14: Numerical Optimization

18. Index

Why subscribe?

19. Other Books You May Enjoy

Processing data at scale on AWS

In the previous section, Analyzing large amounts of unstructured data, the data was stored in an S3 bucket, which was used for training. There will be scenarios where you will need to load data faster for training instead of waiting for the training job to copy the data from S3 locally into your training instance. In these scenarios, you can store the data on a file system, such as Amazon Elastic File System (EFS) or Amazon FSx, and mount it to the training instance, which will be faster than storing the data in S3 location. The code for this is in the 3_unstructured_data.ipynb notebook. Refer to the Optimize it with data on EFS and Optimize it with data on FSX sections in the notebook.

Note

Before you run the Optimize it with data on EFS and Optimize it with data on FSX sections, please launch the CloudFormation template_filesystems.yaml template, in a similar fashion as we did in the Setting up EMR and SageMaker Studio section.

The rest of the chapter is locked

You're reading from Applied Machine Learning and High-Performance Computing on AWS Accelerate the development of machine learning applications following architectural best practices

Table of Contents (20) Chapters

Processing data at scale on AWS

Unlock this book and the full library FREE for 7 days

Authors (4)

Personalised recommendations for you