PR #17505 CUDA: generalized (mma) FA, add Volta support

CUDA: generalized (mma) FA, add Volta support #17505

JohannesGaessler merged 8 commits into ggml-org:master from JohannesGaessler:cuda-fa-mma-update-5

JohannesGaessler requested a review from

am17an 210 days ago

JohannesGaessler requested a review from

ggerganov 210 days ago

github-actions added Nvidia GPU

github-actions added ggml

JohannesGaessler changed the title ~~CUDA: ganeralized (mma) FA, add Volta support~~ CUDA: generalized (mma) FA, add Volta support 210 days ago

JohannesGaessler force pushed 210 days ago

JohannesGaessler force pushed to 2ef0c5f6 210 days ago

CUDA: generalized (mma) FA, add Volta support

17f191e9

fix const correctness

e2c50b1d

fix turing config lookup

301ae300

JohannesGaessler force pushed from b92e6f86 to 301ae300 207 days ago

refactor template parameters

13500e8c

adjust kernel selection logic

394ced5f

fix trailing whitespace

3e1ca0c6

fix kernel selection logic

ec176eef

am17an commented on 2025-12-02

ggerganov approved these changes on 2025-12-02

am17an approved these changes on 2025-12-02

use struct for MMA FA kernel config

d861a34e

JohannesGaessler force pushed from 323c6830 to d861a34e 202 days ago

JohannesGaessler merged 2e1c9cd8 into master 202 days ago

LostRuins commented on 2025-12-06

Reviewers

am17an

ggerganov

LostRuins

Assignees

No one assigned

Labels

Nvidia GPU ggml

Milestone

No milestone