llama.cpp
a5114249
- [SYCL] Support Q4_1, Q5_0, Q5_1 in Flash-attention (#23812)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 day ago
[SYCL] Support Q4_1, Q5_0, Q5_1 in Flash-attention (#23812) * support Q4_1, Q5_0, Q5_1 * update ut case
References
#23812 - [SYCL] Support Q4_1, Q5_0, Q5_1 in Flash-attention
Author
arthw
Parents
41625226
Loading