CUDA: add attention sinks for tile and wmma #15178
CUDA: add attention sinks for tile and wmma
4946c199
Review: formatting changes + remove syncthreads from tile + remove wa…
1ef7fd00
am17an
merged
34c9d765
into master 32 days ago
am17an
deleted the cuda_fattn_tile_wmma branch 32 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub