text-generation-inference
f08b44ad - Upgrade to new vllm extension ops for Gaudi backend (fix issue in exponential bucketing) (#3239)

Commit
212 days ago
Upgrade to new vllm extension ops for Gaudi backend (fix issue in exponential bucketing) (#3239) Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Author
Parents
Loading