vllm
179ae7da
- [Revert] Fix performance regression for GLM-4.7-GPTQ decode and MTP acceptance rate (#33771)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
12 days ago
[Revert] Fix performance regression for GLM-4.7-GPTQ decode and MTP acceptance rate (#33771) Signed-off-by: aabbccddwasd <aabbccddwasd@qq.com>
References
#33771 - [Revert] Fix performance regression for GLM-4.7-GPTQ decode and MTP acceptance rate
Author
aabbccddwasd
Parents
c4df59ad
Loading