vllm
c17231e8 - Fix kv_cache_dtype handling for out-of-tree HPU plugin (#21302)

Commit
165 days ago
Fix kv_cache_dtype handling for out-of-tree HPU plugin (#21302) Signed-off-by: Konrad Zawora <kzawora@habana.ai> Signed-off-by: Chendi.Xue <chendi.xue@intel.com> Co-authored-by: Chendi.Xue <chendi.xue@intel.com>
Author
Parents
Loading