text-generation-inference
fix(server): fix fp8 weight loading
#2268
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
4
Changes
View On
GitHub
fix(server): fix fp8 weight loading
#2268
OlivierDehaene
merged 4 commits into
main
from
fix/fp8_loading
fix(server): fix fp8 weight loading
119918cc
fixed scales loading
74f1f6a7
update snap
0d68619e
OlivierDehaene
force pushed
from
44a2784c
to
0d68619e
1 year ago
revert default dtype
6d8e3659
danieldk
approved these changes on 2024-07-22
OlivierDehaene
merged
4844ff79
into main
1 year ago
OlivierDehaene
deleted the fix/fp8_loading branch
1 year ago
Login to write a write a comment.
Login via GitHub
Reviewers
danieldk
Assignees
No one assigned
Labels
None yet
Milestone
No milestone
Login to write a write a comment.
Login via GitHub