Improve support for GPUs with capability < 8 #2575
danieldk
force pushed
from
491c0269
to
5bd79b8a
1 year ago
Improve support for GPUs with capability < 8
bee5ee1f
nix: add flash-attn-v1 to the server environment
8c0f9312
danieldk
force pushed
from
5bd79b8a
to
8c0f9312
1 year ago
danieldk
marked this pull request as ready for review 1 year ago
Narsil
commented
on 2024-09-27
Narsil
commented
on 2024-09-27
Narsil
commented
on 2024-09-27
Narsil
commented
on 2024-09-27
Move disabling prefix caching into the block of exceptions
a29636ee
Capability as `usize`s
3eb68a37
Narsil
approved these changes
on 2024-09-27
danieldk
merged
5b6b74e2
into main 1 year ago
danieldk
deleted the bugfix/pre-cc-8 branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub