Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

FZJ-JSC/tutorial-multi-gpu

Open more actions menu

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ISC25 Tutorial: Efficient Distributed GPU Programming for Exascale

DOI

Repository with talks and exercises of our Efficient GPU Programming for Exascale tutorial, to be held at ISC25.

Coordinates

  • Date: 13 June 2025
  • Occasion: ISC25 Tutorial
  • Tutors: Simon Garcia de Gonzalo (SNL), Andreas Herten (JSC), Lena Oden (Uni Hagen), with support by Markus Hrywniak (NVIDIA) and Jiri Kraus (NVIDIA)

Setup

The tutorial is an interactive tutorial with introducing lectures and practical exercises to apply knowledge. The exercises have been derived from the Jacobi solver implementations available in NVIDIA/multi-gpu-programming-models.

Walk-through:

Curriculum (Note: square-bracketed sessions are skipped at ISC25 because only ½ day was allocated to the tutorial):

  1. Lecture: Tutorial Overview, Introduction to System + Onboarding Andreas
  2. Lecture: MPI-Distributed Computing with GPUs Simon
  3. Hands-on: Multi-GPU Parallelization
  4. [Lecture: Performance / Debugging Tools]
  5. Lecture: Optimization Techniques for Multi-GPU Applications Lena
  6. Hands-on: Overlap Communication and Computation with MPI
  7. [Lecture: Overview of NCCL and NVSHMEN in MPI]
  8. [Hands-on: Using NCCL and NVSHMEM]
  9. [Lecture: Device-initiated Communication with NVSHMEM]
  10. [Hands-on: Using Device-Initiated Communication with NVSHMEM]
  11. Lecture: Conclusion and Outline of Advanced Topics Andreas
Morty Proxy This is a proxified and sanitized view of the page, visit original site.