Route fp16 HQNBIT_CompInt8 (4-bit and 8-bit) through fp32 MLAS path in MatMulNBits #27820
Route fp16 HQNBIT_CompInt8 through fp32 MLAS path for 4-bit and 8-bit
c0c51bed
Remove dead HQ4BitGemm_CompInt8 and HQ8BitGemm_CompInt8 MLAS code
a217afb1
lint
0931a5f2
Fix HQNBIT_CompInt8 PrePack bugs for 4-bit and 8-bit
316b13fa
jambayk
marked this pull request as ready for review 89 days ago
jambayk
enabled auto-merge (squash) 89 days ago
vraspar
dismissed these changes
on 2026-03-24
jambayk
dismissed their stale review
via 23094525
89 days ago
jambayk
dismissed their stale review
via 23094525
89 days ago
jambayk
force pushed
from
fc606f56
to
23094525
89 days ago
vraspar
dismissed these changes
on 2026-03-24
Address review: ORT_ENFORCE for scales, move SQNBIT check to GetCompu…
138318a0
jambayk
dismissed their stale review
via 138318a0
89 days ago
jambayk
force pushed
from
23094525
to
138318a0
89 days ago
vraspar
approved these changes
on 2026-03-25
jambayk
merged
36242c6c
into main 88 days ago
jambayk
deleted the jambayk/mnb-4-16 branch 88 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub