vllm
[Hardware][NV] Fix Modelopt model loading for k-v-scales for Llama models.
#11787
Merged

[Hardware][NV] Fix Modelopt model loading for k-v-scales for Llama models. #11787

pavanimajety
github-actions
mgoin mgoin requested a review from mgoin mgoin 337 days ago
pavanimajety pavanimajety force pushed 337 days ago
pavanimajety pavanimajety marked this pull request as ready for review 337 days ago
pavanimajety
mgoin
mgoin commented on 2025-01-08
pavanimajety pavanimajety changed the title [Hardware][NV] Fix Modelopt model loading for k-v-scales [Hardware][NV] Fix Modelopt model loading for k-v-scales for Llama models. 327 days ago
pavanimajety pavanimajety force pushed 327 days ago
pavanimajety [Hardware][NV] Fix Modelopt model loading for k-v-scales
16e4650e
pavanimajety Format
c02df146
pavanimajety NFC: remove print
aee253a5
pavanimajety Address Feedback
229ebe8e
pavanimajety Add scales to mixtral models as well
705cf4e6
pavanimajety pavanimajety force pushed to 705cf4e6 322 days ago
pavanimajety pavanimajety requested a review from mgoin mgoin 322 days ago
mgoin Merge branch 'main' into modelopt-k-v-scales
045e2000
mgoin
mgoin approved these changes on 2025-01-27
mgoin mgoin added quantization
mgoin mgoin added ready
mgoin mgoin enabled auto-merge (squash) 317 days ago
pavanimajety
disabled auto-merge 315 days ago
Manually disabled by user
simon-mo simon-mo merged b02fd288 into main 315 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone