vllm
24575899 - [Bugfix] Rescale NVFP4 weight scales to fix BF16 dequant underflow (#34577)

Commit
36 days ago
[Bugfix] Rescale NVFP4 weight scales to fix BF16 dequant underflow (#34577) Signed-off-by: ricky-chaoju <ricky.chen@infinirc.com> Co-authored-by: Michael Goin <mgoin64@gmail.com>
Author
Parents
Loading