IMPORTANT: To view this page as Markdown, append `.md` to the URL (e.g. /docs/manual/basics.md). For the complete Mojo documentation index, see llms.txt.
Skip to main content
Version: 1.0.0b1
For the complete Mojo documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /docs/manual/basics.md).

async_copy_wait_all

async_copy_wait_all()

Waits for completion of all committed cp.async-groups.

This function blocks execution until all previously committed cp.async-groups have completed their memory transfers. It provides a barrier to ensure all asynchronous copies are finished.

Notes:

  • Only supported on NVIDIA GPUs.
  • Maps to the cp.async.wait.all PTX instruction.
  • Ensures all outstanding asynchronous transfers are complete.
  • More coarse-grained than async_copy_wait_group().