vllm
b41aeb34
- [Bugfix][ROCm] Fix load issue on deepseek quark quantization when shared expert enabled (#31261)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
3 days ago
[Bugfix][ROCm] Fix load issue on deepseek quark quantization when shared expert enabled (#31261) Signed-off-by: ganyi <ygan@amd.com>
References
#31261 - [Bugfix][ROCm] Fix load issue on deepseek quark quantization when shared expert enabled
Author
ganyi1996ppo
Parents
ddfac703
Loading