vllm
077a9a8e
- [torch.compile] Refactor Attention Quant Fusion Pass and Remove Boilerplate (#37373)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
35 days ago
[torch.compile] Refactor Attention Quant Fusion Pass and Remove Boilerplate (#37373) Signed-off-by: BadrBasowid <badr.basowid@gmail.com> Co-authored-by: vllmellm <vllm.ellm@embeddedllm.com>
References
#37373 - [torch.compile] Refactor Attention Quant Fusion Pass and Remove Boilerplate
Author
BadrBasowid
Parents
07edd551
Loading