Split KV on MHA and Attention ops #18007
split kv on mha and att ops
e283e013
update num_splits heuristic
72326d8e
Merge branch 'main' into aciddelgado/mha_splitkv
868fb4b0
helper function for split buffers
9401ac9a
faxu
added triage:approved
faxu
added sdxl_llama
Merge branch 'main' into aciddelgado/mha_splitkv
7af984c7
lint
bd4d74c8
tianleiwu
approved these changes
on 2023-11-01
tianleiwu
merged
819b5a3e
into main 2 years ago
tianleiwu
deleted the aciddelgado/mha_splitkv branch 2 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub