text-generation-inference
f08b44ad
- Upgrade to new vllm extension ops for Gaudi backend (fix issue in exponential bucketing) (#3239)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
212 days ago
Upgrade to new vllm extension ops for Gaudi backend (fix issue in exponential bucketing) (#3239) Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
References
#3239 - upgrade to new vllm extension ops(fix issue in exponential bucketing)
Author
sywangyi
Parents
674c514d
Loading