wgmma_fence_aligned
wgmma_fence_aligned()
Inserts a memory fence for warp group matrix multiply operations.
This ensures all prior shared memory accesses are visible before subsequent WGMMA operations. Must be called before starting a new sequence of WGMMA operations.