0

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Free Learning

Hands-On GPU Programming with Python and CUDA

You're reading from Hands-On GPU Programming with Python and CUDA Explore high-performance parallel computing with CUDA

Product type Paperback

Published in Nov 2018

Publisher Packt

ISBN-13 9781788993913

Length 310 pages

Edition 1st Edition

Languages

Python

Tools

CUDA

Concepts

Graphics Programming

Author (1):

Dr. Brian Tuomanen

View More author details

Table of Contents (15) Chapters

Preface

1. Why GPU Programming? FREE CHAPTER

2. Setting Up Your GPU Programming Environment

3. Getting Started with PyCUDA

4. Kernels, Threads, Blocks, and Grids

5. Streams, Events, Contexts, and Concurrency

6. Debugging and Profiling Your CUDA Code

7. Using the CUDA Libraries with Scikit-CUDA

8. The CUDA Device Function Libraries and Thrust

9. Implementation of a Deep Neural Network

10. Working with Compiled GPU Code

11. Performance Optimization in CUDA

12. Where to Go from Here

13. Assessment

14. Other Books You May Enjoy

Leave a review - let other readers know what you think

Profiling your code

We saw in the previous example that we can individually time different functions and components with the standard time function in Python. While this approach works fine for our small example program, this won't always be feasible for larger programs that call on many different functions, some of which may or may not be worth our effort to parallelize, or even optimize on the CPU. Our goal here is to find the bottlenecks and hotspots of a program—even if we were feeling energetic and used time around every function call we make, we might miss something, or there might be some system or library calls that we don't even consider that happen to be slowing things down. We should find candidate portions of the code to offload onto the GPU before we even think about rewriting the code to run on the GPU; we must always follow the wise words of the famous American computer scientist Donald Knuth: Premature optimization is the root of all evil.

We use what is known as a profiler to find these hot spots and bottlenecks in our code. A profiler will conveniently allow us to see where our program is taking the most time, and allow us to optimize accordingly.

Using the cProfile module

We will primarily be using the cProfile module to check our code. This module is a standard library function that is contained in every modern Python installation. We can run the profiler from the command line with -m cProfile, and specify that we want to organize the results by the cumulative time spent on each function with -s cumtime, and then redirect the output into a text file with the > operator.

This will work on both the Linux Bash or Windows PowerShell command line.

Let's try this now:

We can now look at the contents of the text file with our favorite text editor. Let's keep in mind that the output of the program will be included at the beginning of the file:

Now, since we didn't remove the references to time in the original example, we see their output in the first two lines at the beginning. We can then see the total number of function calls made in this program, and the cumulative amount of time to run it.

Subsequently, we have a list of functions that are called in the program, ordered from the cumulatively most time-consuming functions to the least; the first line is the program itself, while the second line is, as expected, the simple_mandelbrot function from our program. (Notice that the time here aligns with what we measured with the time command). After this, we can see many libraries and system calls that relate to dumping the Mandelbrot graph to a file, all of which take comparatively less time. We use such output from cProfile to infer where our bottlenecks are within a given program.

You have been reading a chapter from

Hands-On GPU Programming with Python and CUDA

Published in: Nov 2018

Publisher: Packt

ISBN-13: 9781788993913

© 2018 Packt Publishing Limited All Rights Reserved

Register for a free Packt account to unlock a world of extra content!

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (1)

Tuomanen

Tuomanen

Dr. Brian Tuomanen has been working with CUDA and General-Purpose GPU Programming since 2014. He received his Bachelor of Science in Electrical Engineering from the University of Washington in Seattle, and briefly worked as a Software Engineer before switching to Mathematics for Graduate School. He completed his Ph.D. in Mathematics at the University of Missouri in Columbia, where he first encountered GPU programming as a means for studying scientific problems. Dr. Tuomanen has spoken at the US Army Research Lab about General Purpose GPU programming, and has recently lead GPU integration and development at a Maryland based start-up company. He currently lives and works in the Seattle area.

See other products by Tuomanen

Other recommended products

Related to this chapter

Hands-On GPU-Accelerated Computer Vision with OpenCV and CUDA

Hands-On GPU-Accelerated Computer Vision with OpenCV and CUDA

This book is a guide to explore how accelerating of computer vision applications using GPUs will help you develop algorithms that work on complex image data in real time. It will solve the problems you face while deploying these algorithms on embedded platforms with the help of development boards from NVIDIA such as the Jetson TX1, Jetson TX2, and Jetson TK1.

Sep 2018 12h 40m

This book is for programmers who want to delve into parallel computing, become part of the high-performance computing community and apply those techniques to build modern applications. Experience with C++ programming is assumed. There are some sample examples on equivalent Fortran code. For Deep Learning enthusiasts python based sample code is also provided.

Sep 2019 16h 56m

Hands-On GPU Computing with Python

Hands-On GPU Computing with Python

GPU technologies are the paradigm shift in modern computing. This book will take you through architecting your GPU-based systems to deploying the computational models on GPUs for faster processing. You will learn to program your GPUs to build a GPU-accelerated environment for accelerating machine learning models and other data-intensive processing

May 2019 15h 4m

Julia 1.0 High Performance

Julia 1.0 High Performance

Julia is a high-level, high-performance dynamic programming language for numerical computing. This book will help you understand the performance characteristics of your Julia programs and achieve near-C levels of performance in Julia.

Jun 2019 7h 16m

Python Parallel Programming Cookbook

Python Parallel Programming Cookbook

Python Parallel Programming Cookbook, Second Edition, covers recipes that will help you how to build multithreaded, multiprocess and asynchronous applications in Python. The book will help you build applications for the GPU using CUDA and PyOPENCL and implement effective debugging and testing techniques.

Sep 2019 12h 20m

Personalised recommendations for you

Based on your interests and search pattern

Realizing 3D Animation in Blender

Realizing 3D Animation in Blender

Learn Blender and animation at the same time! With clearly explained exercises, insightful commentary, and a focus on animation, this book has everything you need to start animating with the world's most advanced free software for 3D content creation.

Jul 2024 15h 12m

Unity 2022 by Example

Unity 2022 by Example

This book will introduce you to C# concepts and programming patterns in Unity that will help you solve common problems. After working on a range of 2D, 3D, AR, and VR game projects, you'll be able to confidently build playable and commercial games.

Jun 2024 19h 52m

Mastering Unity Game Development with C#

Mastering Unity Game Development with C#

A game changer for developers, this book will guide you through project structuring, clean C# coding, and UI optimization. You'll be able to elevate your Unity game development skills with real-world projects, hands-on tutorials, and expert tips.

Jul 2024 11h 52m

The Intergalactic Guide to Building an RPG in Unity

The Intergalactic Guide to Building an RPG in Unity

In this book, you will learn how to build high-quality and market-ready Roleplaying Games (RPG) projects in Unity. You will cover ScriptableObjects, characters, animations, navigation, combat system, terrain, Foley sound, ambient sound and musical soundtrack. You will also add hero abilities, visual effects, level loading, and more.