Reset schedule earlier to allow overlap with ggml graph computation on device #6933
Reset schedule earlier to allow overlap with graph computation on device
a2beaffe
moved reset to end of llama_decode_internal
34847caa
agray3
marked this pull request as ready for review 1 year ago
slaren
commented
on 2024-04-26
style fix
728562bc
slaren
approved these changes
on 2024-04-26
slaren
merged
928e0b70
into master 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub