Search icon CANCEL
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Conferences
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Applied Machine Learning and High-Performance Computing on AWS

You're reading from   Applied Machine Learning and High-Performance Computing on AWS Accelerate the development of machine learning applications following architectural best practices

Arrow left icon
Product type Paperback
Published in Dec 2022
Publisher Packt
ISBN-13 9781803237015
Length 382 pages
Edition 1st Edition
Tools
Arrow right icon
Authors (4):
Arrow left icon
Trenton Potgieter Trenton Potgieter
Author Profile Icon Trenton Potgieter
Trenton Potgieter
Shreyas Subramanian Shreyas Subramanian
Author Profile Icon Shreyas Subramanian
Shreyas Subramanian
Farooq Sabir Farooq Sabir
Author Profile Icon Farooq Sabir
Farooq Sabir
Mani Khanuja Mani Khanuja
Author Profile Icon Mani Khanuja
Mani Khanuja
Arrow right icon
View More author details
Toc

Table of Contents (20) Chapters Close

Preface 1. Part 1: Introducing High-Performance Computing
2. Chapter 1: High-Performance Computing Fundamentals FREE CHAPTER 3. Chapter 2: Data Management and Transfer 4. Chapter 3: Compute and Networking 5. Chapter 4: Data Storage 6. Part 2: Applied Modeling
7. Chapter 5: Data Analysis 8. Chapter 6: Distributed Training of Machine Learning Models 9. Chapter 7: Deploying Machine Learning Models at Scale 10. Chapter 8: Optimizing and Managing Machine Learning Models for Edge Deployment 11. Chapter 9: Performance Optimization for Real-Time Inference 12. Chapter 10: Data Visualization 13. Part 3: Driving Innovation Across Industries
14. Chapter 11: Computational Fluid Dynamics 15. Chapter 12: Genomics 16. Chapter 13: Autonomous Vehicles 17. Chapter 14: Numerical Optimization 18. Index 19. Other Books You May Enjoy

What this book covers

Chapter 1, High-Performance Computing Fundamentals, introduces the concepts of HPC, highlighting the importance of HPC as it relates to real-world scenarios. We then talk about technological advancements in HPC and how you can use it to solve complex business problems with unlimited capacity, the most advanced computing capabilities, and the elasticity of the cloud, while still optimizing cost, to innovate faster and gain a competitive business advantage.

Chapter 2, Data Management and Transfer, dives into data management and transfer. The first step to running HPC applications on the cloud is to move the required data to the cloud. Therefore, this chapter focuses on different aspects of data migration, including challenges and key pain points businesses might face, and how AWS data migration services can help resolve them, following the best practices and still maintaining data integrity, consistency and security.

Chapter 3, Compute and Networking, explains how once you have the data in the cloud, you need to understand the compute options provided by AWS as well as their differences, in order to optimally select the right option based on your business requirements. Furthermore, to scale and secure your HPC applications, we then dive into the Networking section, to explain the concepts of private VPC, low latency networking, and optimizing the performance of inter-instance communications.

Chapter 4, Data Storage, explains that before you can begin performing ML using HPC, it’s important to understand the storage options and storage costs for both the transient and permanent storage requirements. This chapter dives deep into the various storage services in the AWS ecosystem, to help you select the right tool for the right job.

Chapter 5, Data Analysis, teaches you how to explore the data, collect metrics, perform data correlation, and process large amounts of data using AWS to ensure data quality before using it for training ML models.

Chapter 6, Distributed Training of Machine Learning Models, shows you how to implement a large ML model using vast amounts of data by leveraging distributed data parallel and model parallel concepts.

Chapter 7, Deploying Machine Learning Models at Scale, discusses model deployment and inference. We will start with what managed deployment means on AWS, then go on to discuss the right deployment options, followed by various inference options (batch, asynchronous, and real-time). We will then discuss the reliability and availability of model endpoints on AWS infrastructure and the blue/green deployment option for different versions of the model.

Chapter 8, Optimizing and Managing Machine Learning Models for Edge Deployment, explores ML models on edge devices. We will start with an introduction to edge computing, followed by factors we need to consider for the optimization of machine learning models. You will also learn about architecture design for edge deployment.

Chapter 9, Performance Optimization for Real-Time Inference, discusses some of the key performance metrics used for ML models, techniques to reduce the memory footprint of large models, choosing the right machine (instance type) to deploy the model, load testing, and performance tuning of models.

Chapter 10, Data Visualization, covers Amazon SageMaker Data Wrangler, a tool that enables users working in the domains of data science, ML, and analytics to build insightful data visualizations without writing much code. In addition, we will also briefly touch upon the topic of AWS’s graphics-optimized instances, since these instances can be used to create animated live data visualizations along with other high-performance computing applications such as game streaming, and ML.

Chapter 11, Computational Fluid Dynamics, introduces the field of computational fluid dynamics (CFD), which uses numerical analysis to solve fluid flow problems. CFD has far-reaching applications in many industries such as the automotive industry, oil and gas, and manufacturing. We will discuss how CFD solvers can be used on AWS for massive-scale problems, and how recent advances in ML help with accelerating CFD applications.

Chapter 12, Genomics, introduces the field of genomics, and how AWS can help customers with large-scale genomics applications that typically use large datasets. We also discuss typical architectures for storing and analyzing such data, along with how ML is currently being applied to genomics, followed by an example of protein structure analysis.

Chapter 13, Autonomous Vehicles, discusses autonomous vehicles (AVs) and the technology used to safely and efficiently operate a vehicle at various levels of automation. Today, companies use petabytes of data from sensors and cameras on large-scale clusters to perform deep neural network (DNN) training. Specifically, we discuss services that support AV development, architectures for large-scale data processing, and the use of DNNs for training AV models.

Chapter 14, Numerical Optimization, introduces you to what numerical optimization is, and why it is important to solve large-scale problems that we might encounter in this space. We will touch upon some common use cases in this domain and the HPC available to solve these use cases. We will also discuss the application of ML to numerical optimization.

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime