vllm
3bc2734d
- [Kernel] Fuse FP8 output quantization into merge_attn_states (#36518)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
21 days ago
[Kernel] Fuse FP8 output quantization into merge_attn_states (#36518) Signed-off-by: Carl You <4531192+carlyou@users.noreply.github.com>
References
#36518 - [Kernel] Fuse FP8 output quantization into merge_attn_states
Author
carlyou
Parents
1f5ec288
Loading