QAttention calls into MatMulIntToFloat instead of Dequantize+GEMM #16851
jeffbloo
force-pushed the
DmlPrototype
branch
from
eb6222b2
to
0790b051
2 years ago
QAttention calls into MatMulIntToFloat instead of Dequantize+GEMM
23ad7917
rebase DmlPrototype
98bd750b
consistent style
f2aff002
zhangxiang1993
deleted the user/xianz/QAttention_v2 branch 2 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub