jax
69fd2052 - [Pallas:MGPU] Expose multicast TMA stores

Commit
105 days ago
[Pallas:MGPU] Expose multicast TMA stores They use a new MulticastRef transform. Unfortunately we can't use it to implement regular multicast stores, as we always use swap_p to represent stores and there's no good dual to multicast store (ld_reduce is not what we want!). PiperOrigin-RevId: 819742981
References
Author
Parents
Loading