vllm
06d49028
- [NVFP4][Perf] Tune NVFP4 input quant kernel for small batch size (#30897)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
124 days ago
[NVFP4][Perf] Tune NVFP4 input quant kernel for small batch size (#30897) Signed-off-by: mgoin <mgoin64@gmail.com>
References
#30897 - [NVFP4][Perf] Tune NVFP4 input quant kernel for small batch size
Author
mgoin
Parents
b471092d
Loading