vllm
[Model] Deepseek GGUF support
#13167
Merged

[Model] Deepseek GGUF support #13167

SzymonOzog
SzymonOzog SzymonOzog requested a review from mgoin mgoin 1 year ago
SzymonOzog SzymonOzog requested a review from robertgshaw2-redhat robertgshaw2-redhat 1 year ago
SzymonOzog SzymonOzog requested a review from tlrmchlsmth tlrmchlsmth 1 year ago
github-actions
SzymonOzog SzymonOzog force pushed to 10383803 1 year ago
Isotr0py
Isotr0py commented on 2025-02-13
junuMoon
SzymonOzog
junuMoon
SzymonOzog
chuangzhidan
SzymonOzog
zlh1992
irdbl
seven1122
chuangzhidan
seven1122
huang-junhong
huang-junhong approved these changes on 2025-02-18
z7d1
leolmj
SzymonOzog
z7d1
SzymonOzog
davidsyoung
z7d1
davidsyoung
slr1997
junuMoon
leolmj
SzymonOzog
Isotr0py
Isotr0py commented on 2025-02-20
Isotr0py
Isotr0py commented on 2025-02-20
priscilla-pan
leolmj
SzymonOzog
priscilla-pan
priscilla-pan
priscilla-pan
vip-china
SzymonOzog WIP gguf MoE
d528c245
SzymonOzog Manual mapping of deepseek layers
625b6589
SzymonOzog Proof of concept GGUF MoE
c455901b
SzymonOzog MoE weight initialization for GGUF
a18d44ba
SzymonOzog Expert weight initialization for GGUF
8374b373
Add option to override hf config
011fbfb4
Cleanup
0f1c8ee3
SzymonOzog Fix gguf weight loading for replicated linear
ec4de414
Make hf_config_path optional
1ff2fd59
fix inheritance
ea9eb0eb
lint
58022e93
add hf config path to parser
ee1dc1f8
loading all experts at once
8a7e61fa
SzymonOzog Accept suggestion
da3dff61
SzymonOzog Accept suggestion
ae19a8a2
SzymonOzog Accept suggestion
bb4af454
SzymonOzog Accept suggestion
b92793e0
Incorporate feedback from code review + add assertions and config
0ea9252d
SzymonOzog SzymonOzog force pushed to 0ea9252d 1 year ago
mergify mergify added documentation
SzymonOzog
SzymonOzog
Isotr0py
SzymonOzog
irdbl
SzymonOzog
justinjja
fclearner
SzymonOzog
SzymonOzog
mergify
mergify mergify added needs-rebase
Isotr0py
Isotr0py commented on 2025-02-24
fclearner
merge from main
36f73b59
missing parameters
7caa1f74
mergify mergify removed needs-rebase
Isotr0py
Isotr0py commented on 2025-02-25
Isotr0py
Isotr0py approved these changes on 2025-02-25
remove TP assettion
a264af34
Fix supported dtypes
28baf157
SzymonOzog
SzymonOzog
Isotr0py Isotr0py enabled auto-merge (squash) 1 year ago
github-actions github-actions added ready
cjackal
cjackal commented on 2025-02-26
disabled auto-merge 1 year ago
Manually disabled by user
SzymonOzog SzymonOzog requested a review from DarkLight1337 DarkLight1337 1 year ago
SzymonOzog SzymonOzog requested a review from ywang96 ywang96 1 year ago
SzymonOzog SzymonOzog requested a review from simon-mo simon-mo 1 year ago
SzymonOzog SzymonOzog requested a review from WoosukKwon WoosukKwon 1 year ago
SzymonOzog SzymonOzog requested a review from njhill njhill 1 year ago
SzymonOzog SzymonOzog requested a review from comaniac comaniac 1 year ago
SzymonOzog SzymonOzog requested a review from alexm-redhat alexm-redhat 1 year ago
SzymonOzog SzymonOzog requested a review from zhuohan123 zhuohan123 1 year ago
SzymonOzog SzymonOzog requested a review from youkaichao youkaichao 1 year ago
mergify mergify added ci/build
mergify mergify added frontend
mergify mergify added structured-output
mergify mergify added speculative-decoding
mergify mergify added v1
feedback: add activation parameter
e5473941
SzymonOzog SzymonOzog force pushed to e5473941 1 year ago
SzymonOzog
Isotr0py Isotr0py enabled auto-merge (squash) 1 year ago
ZinonDynn
cjackal
vllm-bot vllm-bot merged 7f0be2aa into main 1 year ago
ZinonDynn
cjackal
vip-china
boywuxu
cjackal
boywuxu
zlh1992
maekawatoshiki
SzymonOzog
maekawatoshiki
iehgit
boywuxu
zlh1992
SzymonOzog
richjjj
richjjj
SzymonOzog
justinjja
SzymonOzog
davidsyoung
cjackal
davidsyoung
cjackal
davidsyoung
davidsyoung
davidsyoung
davidsyoung
SzymonOzog
SzymonOzog
davidsyoung
davidsyoung
SzymonOzog
joshuakoh1
joshuakoh1
SzymonOzog
lv03
zhaotyer
zhaotyer
SzymonOzog
zhaotyer
SzymonOzog
zhaotyer
lv03
SzymonOzog
lv03
SzymonOzog
zhaotyer
zhaotyer
zhaotyer
hahmad2008
ChuanhongLi
SzymonOzog
ChuanhongLi
ChuanhongLi
maekawatoshiki
SzymonOzog
hyunwen
kechengcode

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone