Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Hands-On GPU Computing with Python Explore the capabilities of GPUs for solving high performance computational problems

Product type Paperback

Published in May 2019

Publisher Packt

ISBN-13 9781789341072

Length 452 pages

Edition 1st Edition

Languages

Python

Tools

CUDA

Concepts

Data Processing

Author (1):

Avimanyu Bandyopadhyay

View More author details

Table of Contents (17) Chapters

Preface

1. Section 1: Computing with GPUs Introduction, Fundamental Concepts, and Hardware

2. Introducing GPU Computing FREE CHAPTER

3. Designing a GPU Computing Strategy

4. Setting Up a GPU Computing Platform with NVIDIA and AMD

5. Section 2: Hands-On Development with GPU Programming

6. Fundamentals of GPU Programming

7. Setting Up Your Environment for GPU Programming

8. Working with CUDA and PyCUDA

9. Working with ROCm and PyOpenCL

10. Working with Anaconda, CuPy, and Numba for GPUs

11. Section 3: Containerization and Machine Learning with GPU-Powered Python

12. Containerization on GPU-Enabled Platforms

13. Accelerated Machine Learning on GPUs

14. GPU Acceleration for Scientific Applications Using DeepChem

15. Other Books You May Enjoy

Leave a review - let other readers know what you think

Appendix A

Interoperability between CuPy and Numba within a single Python program

In this section, we try to test the interoperability between two different modules within the same Python program, namely, CuPy and Numba. So, we import the cuda module from numba and cupy, as well:

from numba import cuda #Using Numba
import cupy as cp #Using CuPy
from timeit import default_timer as timer

N = 500000000

@cuda.jit
def multiply(p, q):
    # Thread id in a 1D block
    tx = cuda.threadIdx.x
    # Block id in a 1D grid
    ty = cuda.blockIdx.x
    # Number of threads per block
    bw = cuda.blockDim.x
    # Compute flattened index inside the array
    index = tx + ty * bw

Like on our previous program, we compute the product based on a condition, as shown in the following code:

    if index < N: # Check array size limit
        q[index]=p[index]*q[index]

def main():
    a_source = cp.zeros(N, dtype=cp.double...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (1)

Avimanyu Bandyopadhyay

Avimanyu Bandyopadhyay is currently pursuing a PhD degree in Bioinformatics based on applied GPU computing in Computational Biology at Heritage Institute of Technology, Kolkata, India. Since 2014, he developed a keen interest in GPU computing, and used CUDA for his master's thesis. He has experience as a systems administrator as well, particularly on the Linux platform. Avimanyu is also a scientific writer, technology communicator, and a passionate gamer. He has published technical writing on open source computing and has actively participated in NVIDIA's GPU computing conferences since 2016. A big-time Linux fan, he strongly believes in the significance of Linux and an open source approach in scientific research. Deep learning with GPUs is his new passion!

See other products by Avimanyu Bandyopadhyay