vllm
ac3dac54 - [Bugfix][Perf] Indexer upcast WK to BF16 for fusion (#38928)

Commit
11 days ago
[Bugfix][Perf] Indexer upcast WK to BF16 for fusion (#38928) Signed-off-by: Benjamin Chislett <bchislett@nvidia.com>
Author
Parents
Loading