vllm
fp8 online quant: split out Fp8OnlineLinearMethod
#32189
Merged

fp8 online quant: split out Fp8OnlineLinearMethod #32189

vkuzo
vkuzo vkuzo requested a review from mgoin mgoin 31 days ago
vkuzo vkuzo requested a review from robertgshaw2-redhat robertgshaw2-redhat 31 days ago
vkuzo vkuzo requested a review from yewentao256 yewentao256 31 days ago
vkuzo vkuzo requested a review from pavanimajety pavanimajety 31 days ago
vkuzo vkuzo requested a review from tlrmchlsmth tlrmchlsmth 31 days ago
vkuzo
vkuzo commented on 2026-01-12
vkuzo
vkuzo commented on 2026-01-12
vkuzo
vkuzo commented on 2026-01-12
gemini-code-assist
gemini-code-assist commented on 2026-01-12
robertgshaw2-redhat
mergify
kylesayrs
kylesayrs commented on 2026-01-12
vkuzo vkuzo force pushed 31 days ago
mergify
vkuzo vkuzo force pushed 31 days ago
mgoin
mgoin approved these changes on 2026-01-13
mgoin mgoin added ready
mgoin mgoin added quantization
vkuzo
vkuzo vkuzo force pushed 29 days ago
vkuzo vkuzo force pushed 29 days ago
vkuzo
vkuzo vkuzo force pushed 27 days ago
vkuzo
vkuzo vkuzo force pushed 27 days ago
vkuzo fp8 online quant: split out Fp8OnlineLinearMethod
0ebce01d
vkuzo vkuzo force pushed to 0ebce01d 23 days ago
vkuzo
mgoin mgoin merged d2389c12 into main 23 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone