Version: Nightly

For the complete Mojo documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /docs/manual/basics.md).

copy_sram_to_dram

def copy_sram_to_dram[thread_layout: Layout[thread_layout.shape_types, thread_layout.stride_types], *, swizzle: Optional[Swizzle] = None, num_threads: Int = thread_layout.size()](dst: TileTensor[Storage=dst.Storage, linear_idx_type=dst.linear_idx_type], src: TileTensor[Storage=src.Storage, address_space=AddressSpace.SHARED, linear_idx_type=src.linear_idx_type])

Synchronously copies a tile from SRAM (shared memory) to DRAM (generic).

Delegates to SharedToGenericTileCopier. The binary_op fusion and fp32 -> half-precision downcast paths of the legacy free function are not supported here.

Parameters:

thread_layout (Layout[thread_layout.shape_types, thread_layout.stride_types]): Layout describing how threads are organized over the copy.
swizzle (Optional[Swizzle]): Swizzle the shared-memory tile was populated with; must match the swizzle used when the tile was written.
num_threads (Int): Total number of threads in the thread block. Threads beyond thread_layout.size() do not participate.

Args:

dst (TileTensor[Storage=dst.Storage, linear_idx_type=dst.linear_idx_type]): Destination tile in generic memory.
src (TileTensor[Storage=src.Storage, address_space=AddressSpace.SHARED, linear_idx_type=src.linear_idx_type]): Source tile in shared memory.