Skip to main content
Version: 1.0

apple_mma_load

apple_mma_load[dtype: DType](ptr: UnsafePointer[Scalar[dtype], ptr.origin, address_space=ptr.address_space], row_stride: Int, col_stride: Int = 1) -> SIMD[dtype, 8]

Loads a 16x16 matrix fragment for the current simdgroup thread.

Parameters:

  • dtype (DType): Element type of the matrix.

Args:

  • ptr (UnsafePointer): Pointer to the top-left corner of the 16x16 tile.
  • row_stride (Int): Distance between consecutive rows in the buffer.
  • col_stride (Int): Distance between consecutive columns within a row.

Returns:

SIMD: SIMD vector of 8 elements for this thread's fragment.