feat(cache): StaticCache uses index_copy_ to avoid useless copy #31857
feat(cache): StaticCache uses index_copy_ to avoid useless copy
d329ad28
feat(cache): SlidingWindowCache uses index_copy_ to avoid useless copy
53e99d18
fix(cache): fallback of index_copy_ when not implemented
8e622ec0
fix(cache): in index_copy_ ensure tensors are on same device
aba28b57
[run slow] llama
02608dd1
gante
approved these changes
on 2024-07-12
fix(cache): add move of cache_position to same device in SlidingWindo…
4c818f31
Revert "[run slow] llama"
a76cfb7f
ArthurZucker
deleted the alvaro/static_cache_xla branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub