vllm
24575899
- [Bugfix] Rescale NVFP4 weight scales to fix BF16 dequant underflow (#34577)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
36 days ago
[Bugfix] Rescale NVFP4 weight scales to fix BF16 dequant underflow (#34577) Signed-off-by: ricky-chaoju <ricky.chen@infinirc.com> Co-authored-by: Michael Goin <mgoin64@gmail.com>
References
#34577 - [Bugfix] Rescale NVFP4 weight scales to fix BF16 dequant underflow
Author
ricky-chaoju
Parents
1204cf0a
Loading