onnxruntime
[ARM64] MatMulNBits: use neon instrinsics to convert between fp16 and fp32
#22195
Merged

[ARM64] MatMulNBits: use neon instrinsics to convert between fp16 and fp32 #22195

fajin-corp merged 24 commits into main from fajin/nbmmfp16cvt2
fajin-corp
fajin-corp fajin-corp requested a review 1 year ago
github-advanced-security
github-advanced-security commented on 2024-09-24
fajin-corp added files to support arm fp16 and installed fp convert interface
8aeb6175
fajin-corp added fp16 to fp32 kernel
22365add
fajin-corp added fp32 to fp16 kernel
665e65ab
fajin-corp handling aligned source
804633dc
fajin-corp force align the temp buffer
0e184ba0
fajin-corp added ut
d842cb6d
fajin-corp passed intel build
13ec5fdf
fajin-corp fix type
bae14003
fajin-corp use constant argument
229d5e50
fajin-corp fix ut
909b47bb
fajin-corp fix ut
2df131c9
fajin-corp fixed ut
b7a2e519
fajin-corp added benchmark
286170a9
fajin-corp added benchmarks
76a4172b
fajin-corp reset mem addr aligning
d74cc175
fajin-corp fix linting
0f765edc
fajin-corp fix loop error
66323eb8
fajin-corp fajin-corp force pushed from 2eb9dba9 to 66323eb8 1 year ago
yufenglee
yufenglee commented on 2024-09-24
yufenglee
yufenglee commented on 2024-09-24
yufenglee
yufenglee commented on 2024-09-24
yufenglee
yufenglee commented on 2024-09-24
yufenglee
yufenglee commented on 2024-09-24
fajin-corp resolve comments
a1b86acc
yufenglee
yufenglee commented on 2024-09-25
yufenglee
yufenglee dismissed these changes on 2024-09-25
fajin-corp move scale and bias convert to prepack
2e60b0d7
fajin-corp fajin-corp dismissed their stale review via 2e60b0d7 1 year ago
fajin-corp fix build
097cbab8
fajin-corp fix build
1b7170f2
yufenglee
yufenglee dismissed these changes on 2024-09-25
fajin-corp added fallback
2ccc0476
fajin-corp fajin-corp dismissed their stale review via 2ccc0476 1 year ago
fajin-corp fix build
a0314a48
fajin-corp remove comments
1d7ab168
yufenglee
yufenglee approved these changes on 2024-09-26
fajin-corp fajin-corp merged 1942e40e into main 1 year ago
fajin-corp fajin-corp deleted the fajin/nbmmfp16cvt2 branch 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone