LopezCastroRoberto
changed the title [Perf][WIP] Optimize cp_gather_and_upconvert_fp8_kv_cache [Attention][Perf][WIP] Optimize cp_gather_and_upconvert_fp8_kv_cache74 days ago
vectorized ops
b5e7b8f9
LopezCastroRoberto
changed the title [Attention][Perf][WIP] Optimize cp_gather_and_upconvert_fp8_kv_cache [Attention][Perf][WIP] Optimize cp_gather_and_upconvert_fp8_kv_cache - DeepSeek-v3.273 days ago
LopezCastroRoberto
changed the title [Attention][Perf][WIP] Optimize cp_gather_and_upconvert_fp8_kv_cache - DeepSeek-v3.2 [Attention][Perf] Optimize cp_gather_and_upconvert_fp8_kv_cache - DeepSeek-v3.267 days ago
Merge branch 'main' into perf/gather_upconvert
e0635332
Merge branch 'main' into perf/gather_upconvert
53bd6e6a
Merge branch 'main' into perf/gather_upconvert
aa06e41c
Merge branch 'main' into perf/gather_upconvert
5e1d1263
Merge branch 'main' into perf/gather_upconvert
d9c6aeda
fix amd path
fd8e5161
disabled auto-merge 64 days ago Head branch was pushed to by a user without write access
Login to write a write a comment.
Login via GitHub