Rope imbedding kernel to use avx2 #23694
profile init code
4523b3d0
from patch
59e27609
Merge branch 'liqun/GQA' into Intel-ROPE-kernel-to-use-AVX2
1611fccc
Merge branch 'Intel-ROPE-kernel-to-use-AVX2' into liqun/Intel-ROPE-ke…
6e2f414e
liqunfu
requested a review
1 year ago
node_name and remove profiler wrapper
35bf5176
m:erge branch 'main' into liqun/GQA
e1232fc8
Merge branch 'liqun/GQA' into liqun/Intel-ROPE-kernel-to-use-AVX2
b36fa2c8
snnn
commented
on 2025-02-14
remove profiling code
3964acc6
undo test_gqa_cpu.py
40a68542
lint
43bdb444
some edit
46353d84
fix data correctness in interleaved cases
e13ac56d
one more data correctness fix, add MLAS RoPE test to covert all scena…
9867ee78
lint
754be929
fix build - declaration error
4dc471b4
unused RoPERegisterAllShortExecuteTests ci failure
5ecee3c6
Merge branch 'liqun/Intel-ROPE-kernel-to-use-AVX2' of https://github.…
4e128da7
missing implementation
62d5eefb
add benchmark, etc.
d6de70f9
Update onnxruntime/test/mlas/bench/bench_rope.cpp
89e1411f
unaligned store
b46927ef
sin ->sin_data, constexpr
484d1285
yihonglyu
approved these changes
on 2025-02-20
liqunfu
merged
af04b202
into main 1 year ago
liqunfu
deleted the liqun/Intel-ROPE-kernel-to-use-AVX2 branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub