transformers
[Quantization] Add cutlass kernel for FP8
#43304
Merged

[Quantization] Add cutlass kernel for FP8 #43304

ArthurZucker merged 5 commits into main from quantization-kernels
MekkCyber
MekkCyber add cutlass
4dfe49ec
MekkCyber MekkCyber requested a review from SunMarc SunMarc 74 days ago
HuggingFaceDocBuilderDev
sayakpaul
sayakpaul commented on 2026-01-15
sayakpaul
sayakpaul commented on 2026-01-15
SunMarc
SunMarc commented on 2026-01-15
SunMarc Merge branch 'main' into quantization-kernels
6939e63c
MekkCyber feedback
2d15c091
SunMarc
SunMarc approved these changes on 2026-01-16
MekkCyber Merge branch 'main' into quantization-kernels
cfd4b9eb
github-actions
SunMarc Merge branch 'main' into quantization-kernels
503b9a24
SunMarc
ArthurZucker
ArthurZucker approved these changes on 2026-01-28
ArthurZucker ArthurZucker merged 2b7bc596 into main 61 days ago
ArthurZucker ArthurZucker deleted the quantization-kernels branch 61 days ago
ArthurZucker

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone