llama.cpp
HIP: Refactor mma for RDNA and CDNA
#17990
Merged

HIP: Refactor mma for RDNA and CDNA #17990

zhang-hui-yulo
mma.cuh for rdna4
318cb5b8
mma for rdna3
074b9314
mmq for rdna4
98846cb9
mmq for rdna3
62e4954d
align i-major and j-major
8b26bc38
cdna
afb0e3d5
zhang-hui-yulo zhang-hui-yulo requested a review from JohannesGaessler JohannesGaessler 155 days ago
zhang-hui-yulo zhang-hui-yulo requested a review from am17an am17an 155 days ago
fix cuda error
6b8ed41f
zhang-hui-yulo zhang-hui-yulo marked this pull request as draft 155 days ago
add missing tile of mfma
6acad9c7
JohannesGaessler
JohannesGaessler commented on 2025-12-13
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
zhang-hui-yulo
JohannesGaessler
zhang-hui-yulo
JohannesGaessler
zhang-hui-yulo
fix j-major wrong ne on CDNA
cffa070b
zhang-hui-yulo
JohannesGaessler
zhang-hui-yulo
zhang-hui-yulo zhang-hui-yulo marked this pull request as ready for review 152 days ago
JohannesGaessler
JohannesGaessler approved these changes on 2025-12-16
JohannesGaessler
fix gramma and empty spaces
cad07fa4
zhang-hui-yulo
JohannesGaessler JohannesGaessler merged acec774e into master 151 days ago
CISC
zhang-hui-yulo zhang-hui-yulo deleted the refactor_mma_for_rdna branch 150 days ago
zhang-hui-yulo
MikeLP
zhang-hui-yulo

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone