llama.cpp
CUDA: don't route RDNA3.5 flash attention to the rocWMMA kernel
#24562
Open

CUDA: don't route RDNA3.5 flash attention to the rocWMMA kernel #24562

liminfei-amd
liminfei-amd CUDA: don't route RDNA3.5 flash attention to the rocWMMA kernel
b311e833
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
ggml-gh-bot
liminfei-amd liminfei-amd marked this pull request as ready for review 2 days ago
liminfei-amd liminfei-amd requested a review from IMbackK IMbackK 2 days ago
liminfei-amd liminfei-amd requested a review 2 days ago
liminfei-amd

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone