vllm
ac3dac54
- [Bugfix][Perf] Indexer upcast WK to BF16 for fusion (#38928)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
11 days ago
[Bugfix][Perf] Indexer upcast WK to BF16 for fusion (#38928) Signed-off-by: Benjamin Chislett <bchislett@nvidia.com>
References
#38928 - [Bugfix][Perf] Indexer upcast WK to BF16 for fusion
Author
benchislett
Parents
39ac6404
Loading