0

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Free Learning

Hands-On GPU Programming with Python and CUDA

You're reading from Hands-On GPU Programming with Python and CUDA Explore high-performance parallel computing with CUDA

Product type Paperback

Published in Nov 2018

Publisher Packt

ISBN-13 9781788993913

Length 310 pages

Edition 1st Edition

Languages

Python

Tools

CUDA

Concepts

Graphics Programming

Author (1):

Dr. Brian Tuomanen

View More author details

Table of Contents (15) Chapters

Preface

1. Why GPU Programming?

2. Setting Up Your GPU Programming Environment FREE CHAPTER

3. Getting Started with PyCUDA

4. Kernels, Threads, Blocks, and Grids

5. Streams, Events, Contexts, and Concurrency

6. Debugging and Profiling Your CUDA Code

7. Using the CUDA Libraries with Scikit-CUDA

8. The CUDA Device Function Libraries and Thrust

9. Implementation of a Deep Neural Network

10. Working with Compiled GPU Code

11. Performance Optimization in CUDA

12. Where to Go from Here

13. Assessment

14. Other Books You May Enjoy

Leave a review - let other readers know what you think

Summary

The main advantage of using a GPU over a CPU is its increased throughput, which means that we can execute more parallel code simultaneously on GPU than on a CPU; a GPU cannot make recursive algorithms or nonparallelizable algorithms somewhat faster. We see that some tasks, such as the example of building a house, are only partially parallelizable—in this example, we couldn't speed up the process of designing the house (which is intrinsically serial in this case), but we could speed up the process of the construction, by hiring more laborers (which is parallelizable in this case).

We used this analogy to derive Amdahl's Law, which is a formula that can give us a rough estimate of potential speedup for a program if we know the percentage of execution time for code that is parallelizable, and how many processors we will have to run this code. We then applied Amdahl's Law to analyze a small program that generates the Mandelbrot set and dumps it to an image file, and we determined that this would be a good candidate for parallelization onto a GPU. Finally, we ended with a brief overview of profiling code with the cPython module; this allows us to see where the bottlenecks in a program are, without explicitly timing function calls.

Now that we have a few of the fundamental concepts in place, and have a motivator to learn GPU programming, we will spend the next chapter setting up a Linux- or Windows 10-based GPU programming environment. We will then immediately dive into the world of GPU programming in the following chapter, where we will actually write a GPU-based version of the Mandelbrot program that we saw in this chapter.

The rest of the chapter is locked

Register for a free Packt account to unlock a world of extra content!

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (1)

Tuomanen

Tuomanen

Dr. Brian Tuomanen has been working with CUDA and General-Purpose GPU Programming since 2014. He received his Bachelor of Science in Electrical Engineering from the University of Washington in Seattle, and briefly worked as a Software Engineer before switching to Mathematics for Graduate School. He completed his Ph.D. in Mathematics at the University of Missouri in Columbia, where he first encountered GPU programming as a means for studying scientific problems. Dr. Tuomanen has spoken at the US Army Research Lab about General Purpose GPU programming, and has recently lead GPU integration and development at a Maryland based start-up company. He currently lives and works in the Seattle area.

See other products by Tuomanen

Other recommended products

Related to this chapter

Hands-On GPU-Accelerated Computer Vision with OpenCV and CUDA

Hands-On GPU-Accelerated Computer Vision with OpenCV and CUDA

This book is a guide to explore how accelerating of computer vision applications using GPUs will help you develop algorithms that work on complex image data in real time. It will solve the problems you face while deploying these algorithms on embedded platforms with the help of development boards from NVIDIA such as the Jetson TX1, Jetson TX2, and Jetson TK1.

Sep 2018 12h 40m

Hands-On GPU-Accelerated Computer Vision with OpenCV and CUDA

Hands-On GPU-Accelerated Computer Vision with OpenCV and CUDA

This book is a guide to explore how accelerating of computer vision applications using GPUs will help you develop algorithms that work on complex image data in real time. It will solve the problems you face while deploying these algorithms on embedded platforms with the help of development boards from NVIDIA such as the Jetson TX1, Jetson TX2, and Jetson TK1.

Sep 2018 12h 40m

Hands-On GPU-Accelerated Computer Vision with OpenCV and CUDA

Hands-On GPU-Accelerated Computer Vision with OpenCV and CUDA

This book is a guide to explore how accelerating of computer vision applications using GPUs will help you develop algorithms that work on complex image data in real time. It will solve the problems you face while deploying these algorithms on embedded platforms with the help of development boards from NVIDIA such as the Jetson TX1, Jetson TX2, and Jetson TK1.

Sep 2018 12h 40m

Hands-On GPU-Accelerated Computer Vision with OpenCV and CUDA

Hands-On GPU-Accelerated Computer Vision with OpenCV and CUDA

This book is a guide to explore how accelerating of computer vision applications using GPUs will help you develop algorithms that work on complex image data in real time. It will solve the problems you face while deploying these algorithms on embedded platforms with the help of development boards from NVIDIA such as the Jetson TX1, Jetson TX2, and Jetson TK1.

Sep 2018 12h 40m

Hands-On GPU-Accelerated Computer Vision with OpenCV and CUDA

Hands-On GPU-Accelerated Computer Vision with OpenCV and CUDA

This book is a guide to explore how accelerating of computer vision applications using GPUs will help you develop algorithms that work on complex image data in real time. It will solve the problems you face while deploying these algorithms on embedded platforms with the help of development boards from NVIDIA such as the Jetson TX1, Jetson TX2, and Jetson TK1.

Sep 2018 12h 40m

This book is for programmers who want to delve into parallel computing, become part of the high-performance computing community and apply those techniques to build modern applications. Experience with C++ programming is assumed. There are some sample examples on equivalent Fortran code. For Deep Learning enthusiasts python based sample code is also provided.

Sep 2019 16h 56m

This book is for programmers who want to delve into parallel computing, become part of the high-performance computing community and apply those techniques to build modern applications. Experience with C++ programming is assumed. There are some sample examples on equivalent Fortran code. For Deep Learning enthusiasts python based sample code is also provided.

Sep 2019 16h 56m

This book is for programmers who want to delve into parallel computing, become part of the high-performance computing community and apply those techniques to build modern applications. Experience with C++ programming is assumed. There are some sample examples on equivalent Fortran code. For Deep Learning enthusiasts python based sample code is also provided.

Sep 2019 16h 56m

This book is for programmers who want to delve into parallel computing, become part of the high-performance computing community and apply those techniques to build modern applications. Experience with C++ programming is assumed. There are some sample examples on equivalent Fortran code. For Deep Learning enthusiasts python based sample code is also provided.

Sep 2019 16h 56m

This book is for programmers who want to delve into parallel computing, become part of the high-performance computing community and apply those techniques to build modern applications. Experience with C++ programming is assumed. There are some sample examples on equivalent Fortran code. For Deep Learning enthusiasts python based sample code is also provided.

Sep 2019 16h 56m

This book is for programmers who want to delve into parallel computing, become part of the high-performance computing community and apply those techniques to build modern applications. Experience with C++ programming is assumed. There are some sample examples on equivalent Fortran code. For Deep Learning enthusiasts python based sample code is also provided.

Sep 2019 16h 56m

This book is for programmers who want to delve into parallel computing, become part of the high-performance computing community and apply those techniques to build modern applications. Experience with C++ programming is assumed. There are some sample examples on equivalent Fortran code. For Deep Learning enthusiasts python based sample code is also provided.

Sep 2019 16h 56m

Personalised recommendations for you

Based on your interests and search pattern

Realizing 3D Animation in Blender

Realizing 3D Animation in Blender

Learn Blender and animation at the same time! With clearly explained exercises, insightful commentary, and a focus on animation, this book has everything you need to start animating with the world's most advanced free software for 3D content creation.

Jul 2024 15h 12m

Unity 2022 by Example

Unity 2022 by Example

This book will introduce you to C# concepts and programming patterns in Unity that will help you solve common problems. After working on a range of 2D, 3D, AR, and VR game projects, you'll be able to confidently build playable and commercial games.

Jun 2024 19h 52m

Mastering Unity Game Development with C#

Mastering Unity Game Development with C#

A game changer for developers, this book will guide you through project structuring, clean C# coding, and UI optimization. You'll be able to elevate your Unity game development skills with real-world projects, hands-on tutorials, and expert tips.

Jul 2024 11h 52m

The Intergalactic Guide to Building an RPG in Unity

The Intergalactic Guide to Building an RPG in Unity

In this book, you will learn how to build high-quality and market-ready Roleplaying Games (RPG) projects in Unity. You will cover ScriptableObjects, characters, animations, navigation, combat system, terrain, Foley sound, ambient sound and musical soundtrack. You will also add hero abilities, visual effects, level loading, and more.