vllm
[Bugfix] Remove hardcoded `head_size=256` for Deepseek v2 and v3
#12067
Merged

[Bugfix] Remove hardcoded `head_size=256` for Deepseek v2 and v3 #12067

Isotr0py
Isotr0py fix deepseek v2 and v3 head dim
e3486379
github-actions
Isotr0py Isotr0py requested a review from simon-mo simon-mo 1 year ago
Isotr0py Isotr0py requested a review from youkaichao youkaichao 1 year ago
mgoin
mgoin approved these changes on 2025-01-15
Isotr0py update test head_sizes
fb21cf0e
Isotr0py Isotr0py requested a review from tlrmchlsmth tlrmchlsmth 1 year ago
Isotr0py Isotr0py requested a review from WoosukKwon WoosukKwon 1 year ago
Isotr0py remove comment
7856937d
Isotr0py Isotr0py added ready
Isotr0py fix test head_sizes
9c9dd006
Isotr0py Isotr0py enabled auto-merge (squash) 1 year ago
Isotr0py Isotr0py merged dd7c9ad8 into main 1 year ago
Isotr0py Isotr0py deleted the fix-deepseek-head branch 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone