vllm
[Model] Deepseek GGUF support
#13167
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
23
Changes
View On
GitHub
[Model] Deepseek GGUF support
#13167
vllm-bot
merged 23 commits into
vllm-project:main
from
SzymonOzog:gguf-deepseek
SzymonOzog
requested a review
from
mgoin
1 year ago
SzymonOzog
requested a review
from
robertgshaw2-redhat
1 year ago
SzymonOzog
requested a review
from
tlrmchlsmth
1 year ago
SzymonOzog
force pushed
to
10383803
1 year ago
Isotr0py
commented on 2025-02-13
huang-junhong
approved these changes on 2025-02-18
Isotr0py
commented on 2025-02-20
Isotr0py
commented on 2025-02-20
WIP gguf MoE
d528c245
Manual mapping of deepseek layers
625b6589
Proof of concept GGUF MoE
c455901b
MoE weight initialization for GGUF
a18d44ba
Expert weight initialization for GGUF
8374b373
Add option to override hf config
011fbfb4
Cleanup
0f1c8ee3
Fix gguf weight loading for replicated linear
ec4de414
Make hf_config_path optional
1ff2fd59
fix inheritance
ea9eb0eb
lint
58022e93
add hf config path to parser
ee1dc1f8
loading all experts at once
8a7e61fa
Accept suggestion
da3dff61
Accept suggestion
ae19a8a2
Accept suggestion
bb4af454
Accept suggestion
b92793e0
Incorporate feedback from code review + add assertions and config
0ea9252d
SzymonOzog
force pushed
to
0ea9252d
1 year ago
mergify
added
documentation
mergify
added
needs-rebase
Isotr0py
commented on 2025-02-24
merge from main
36f73b59
missing parameters
7caa1f74
mergify
removed
needs-rebase
Isotr0py
commented on 2025-02-25
Isotr0py
approved these changes on 2025-02-25
remove TP assettion
a264af34
Fix supported dtypes
28baf157
Isotr0py
enabled auto-merge (squash)
1 year ago
github-actions
added
ready
cjackal
commented on 2025-02-26
disabled auto-merge
1 year ago
Manually disabled by user
SzymonOzog
requested a review
from
DarkLight1337
1 year ago
SzymonOzog
requested a review
from
ywang96
1 year ago
SzymonOzog
requested a review
from
simon-mo
1 year ago
SzymonOzog
requested a review
from
WoosukKwon
1 year ago
SzymonOzog
requested a review
from
njhill
1 year ago
SzymonOzog
requested a review
from
comaniac
1 year ago
SzymonOzog
requested a review
from
alexm-redhat
1 year ago
SzymonOzog
requested a review
from
zhuohan123
1 year ago
SzymonOzog
requested a review
from
youkaichao
1 year ago
mergify
added
ci/build
mergify
added
frontend
mergify
added
structured-output
mergify
added
speculative-decoding
mergify
added
v1
feedback: add activation parameter
e5473941
SzymonOzog
force pushed
to
e5473941
1 year ago
Isotr0py
enabled auto-merge (squash)
1 year ago
vllm-bot
merged
7f0be2aa
into main
1 year ago
Login to write a write a comment.
Login via GitHub
Reviewers
Isotr0py
huang-junhong
cjackal
mgoin
robertgshaw2-redhat
tlrmchlsmth
DarkLight1337
ywang96
simon-mo
WoosukKwon
njhill
comaniac
alexm-redhat
zhuohan123
youkaichao
Assignees
No one assigned
Labels
documentation
structured-output
frontend
speculative-decoding
ready
ci/build
v1
Milestone
No milestone
Login to write a write a comment.
Login via GitHub