IMPORTANT: To view this page as Markdown, append `.md` to the URL (e.g. /docs/manual/basics.md). For the complete Mojo documentation index, see llms.txt.
Skip to main content
Version: 1.0.0b1
For the complete Mojo documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /docs/manual/basics.md).

cp_async_bulk_shared_cluster_global

cp_async_bulk_shared_cluster_global[dst_type: AnyType, src_type: AnyType, mbr_type: AnyType, /, *, eviction_policy: CacheEviction = CacheEviction.EVICT_NORMAL](dst_mem: UnsafePointer[dst_type, address_space=AddressSpace.SHARED], src_mem: UnsafePointer[src_type, address_space=src_mem.address_space], size: Int32, mem_bar: UnsafePointer[mbr_type, address_space=AddressSpace.SHARED])

Initiates an asynchronous bulk copy from global memory to shared CTA memory.

Performs a non-blocking copy of size bytes from global memory to shared memory using the cp.async.bulk PTX instruction. Completion is signaled via the mbarrier specified by mem_bar.

Both dst_mem and src_mem must be 16-byte aligned, and size must be a multiple of 16. Requires sm_100 or higher.

Parameters:

  • dst_type (AnyType): The element type of the destination shared memory.
  • src_type (AnyType): The element type of the source global memory.
  • mbr_type (AnyType): The element type of the mbarrier object in shared memory.
  • eviction_policy (CacheEviction): Cache eviction policy for the L2 cache. Defaults to EVICT_NORMAL.

Args: