This modular tutorial contains content on all things related to accelerated Python:
- Notebooks containing lessons and exercises, intended for self-paced or instructor-led learning, which can be run on NVIDIA Brev or Google Colab.
- Slides containing the lecture content for the lessons.
- Syllabi that select a subset of the notebooks for a particular learning objective.
- Docker Images and Docker Compose files for creating Brev Launchables or running locally.
Brev Launchables of this tutorial should use:
- L40S, L4, or T4 instances (for non-distributed notebooks).
- 4xL4 or 2xL4 instances (for distributed notebooks).
- Crusoe or any other provider with Flexible Ports.
- CUDA Python - CuPy, cuDF, CCCL, & Kernels - 8 Hours.
- CUDA Python - cuda.core & CCCL - 2 Hours
- PyHPC - NumPy, CuPy, & mpi4py - 4 Hours
| # | Exercise | Link | Solution |
|---|---|---|---|
| 40 | Kernel Authoring: Copy | ||
| 41 | Kernel Authoring: Book Histogram | ||
| 42 | Kernel Authoring: Gaussian Blur | ||
| 43 | Kernel Authoring: Black and White |
| # | Exercise | Link | Solution |
|---|---|---|---|
| 60 | mpi4py | ||
| 61 | Dask |