vllm
02c97d9a
- [Quantization] Add compressed-tensors emulations support for NVFP4 (#19879)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
242 days ago
[Quantization] Add compressed-tensors emulations support for NVFP4 (#19879) Signed-off-by: Dipika Sikka <dipikasikka1@gmail.com> Signed-off-by: Dipika <dipikasikka1@gmail.com>
References
#19879 - [Quantization] Add compressed-tensors emulations support for NVFP4
Author
dsikka
Parents
e795d723
Loading