vllm
[Hardware][NV] Fix Modelopt model loading for k-v-scales for Llama models.
#11787
Merged

[Hardware][NV] Fix Modelopt model loading for k-v-scales for Llama models. #11787

pavanimajety
github-actions
mgoin mgoin requested a review from mgoin mgoin 1 year ago
pavanimajety pavanimajety force pushed 1 year ago
pavanimajety pavanimajety marked this pull request as ready for review 1 year ago
pavanimajety
mgoin
mgoin commented on 2025-01-08
pavanimajety pavanimajety changed the title [Hardware][NV] Fix Modelopt model loading for k-v-scales [Hardware][NV] Fix Modelopt model loading for k-v-scales for Llama models. 1 year ago
pavanimajety pavanimajety force pushed 1 year ago
pavanimajety [Hardware][NV] Fix Modelopt model loading for k-v-scales
16e4650e
pavanimajety Format
c02df146
pavanimajety NFC: remove print
aee253a5
pavanimajety Address Feedback
229ebe8e
pavanimajety Add scales to mixtral models as well
705cf4e6
pavanimajety pavanimajety force pushed to 705cf4e6 1 year ago
pavanimajety pavanimajety requested a review from mgoin mgoin 1 year ago
mgoin Merge branch 'main' into modelopt-k-v-scales
045e2000
mgoin
mgoin approved these changes on 2025-01-27
mgoin mgoin added quantization
mgoin mgoin added ready
mgoin mgoin enabled auto-merge (squash) 1 year ago
pavanimajety
disabled auto-merge 1 year ago
Manually disabled by user
simon-mo simon-mo merged b02fd288 into main 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone