Add fp16 support for 8-bit MatMulNBits on ARM64 and fix pre-existing bugs #27692
Add HQNBIT_CompFp16 support for 8-bit MatMulNBits on ARM64 NEON
93529bc8
acc 4 support for HQ
566df8df
fix bias bug, compint8 support for 8 bits
cd67587a
remove MLAS_TARGET_AMD64_IX86 guard on QuantBDataWorkspace
128908ae
fix scale packing
8415246a
only pack for 8 bit
6332d94a
address reviews
218a6f9b
more reviews
ce7d5335
fix DequantB8Bit reference in dequant test
d4064093
Fix Float16_8b_ARM_CompFp16 SIGTRAP in Debug builds
886971a8
Add fp16 CompInt8 8-bit tests and improve N/K/BlockSize coverage
8359fe0b
jambayk
enabled auto-merge (squash) 10 days ago
Increase fp16 8-bit test tolerances for large-K cases
e0a1834d
jambayk
dismissed their stale review
via e0a1834d
10 days ago
jambayk
merged
c1f38c03
into main 10 days ago
jambayk
deleted the jambayk/mnb-arm-16 branch 10 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub