xla
64ac8b68 - Add heuristic default block sizes for different cases in ragged attention kernel

Commit
252 days ago
Add heuristic default block sizes for different cases in ragged attention kernel
Author
Parents
Loading