vllm
e3b90c1b
- [Bugfix][Speculative Decoding] Extend Eagle quantization config fix to llama_eagle.py (#26590)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
196 days ago
[Bugfix][Speculative Decoding] Extend Eagle quantization config fix to llama_eagle.py (#26590) Signed-off-by: Rahul Tuli <rtuli@redhat.com>
References
#26590 - [Bugfix][Speculative Decoding] Extend Eagle quantization config fix to llama_eagle.py
Author
rahul-tuli
Parents
134f70b3
Loading