cluster_sync_relaxed
cluster_sync_relaxed()
Performs a full cluster synchronization with relaxed memory ordering.
This is a convenience function that combines cluster_arrive_relaxed() and cluster_wait() to provide a barrier synchronization across all thread blocks in the cluster without memory ordering guarantees. Only supported on NVIDIA SM90+ GPUs.