llama.cpp
f4837d3e
- CUDA: tune GLM 4.7 Flash FA kernel selection logic (DGX Spark)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
5 days ago
CUDA: tune GLM 4.7 Flash FA kernel selection logic (DGX Spark)
References
#19142 - CUDA: tune GLM 4.7 Flash FA kernel selection logic (DGX Spark)
Author
ggerganov
Committer
ggerganov
Parents
68ac3acb
Loading