text-generation-inference
ebb26f0c
- [gaudi] Deepseek v2 mla and add ep to unquantized moe (#3287)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
188 days ago
[gaudi] Deepseek v2 mla and add ep to unquantized moe (#3287) Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
References
#3287 - [gaudi] Deepseek v2 mla and add ep to unquantized moe
Author
sywangyi
Parents
778b61c0
Loading