- Which programming language is used to define the kernel function using the SourceModule class in PyCUDA? Which compiler will be used to compile this kernel function?
- Write a kernel call function for the myfirst_kernel function used in this chapter, with the number of blocks equal to 1024 x 1024 and threads per block equal to 512 x 512.
- State true or false: The block execution inside PyCUDA program is in sequential order.
- What is the advantage of using the In, Out ,and inout driver class primitives in PyCUDA programs?
- Write a PyCUDA program to add two to every element of a vector with an arbitrary size using the gpuarray class.
- What is the advantage of using CUDA events to measure the time for a kernel execution?
- State true or false: The gpuarray class is the GPU device version of the numpy library in Python.