llama.cpp
cuda : add half2 __shfl_xor() for ROCm 5.5
#7263
Merged

cuda : add half2 __shfl_xor() for ROCm 5.5 #7263

Engininja2
Engininja2 cuda : add half2 __shfl_xor() for ROCm 5.5
79b044b0
mofosyne mofosyne added Nvidia GPU
mofosyne mofosyne added Review Complexity : Medium
JohannesGaessler
JohannesGaessler approved these changes on 2024-05-16
JohannesGaessler
Engininja2
JohannesGaessler JohannesGaessler merged d233b507 into master 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone