PR #7263 cuda : add half2 __shfl_xor() for ROCm 5.5

cuda : add half2 __shfl_xor() for ROCm 5.5 #7263

JohannesGaessler merged 1 commit into ggml-org:master from Engininja2:fix-rocm-5.5

cuda : add half2 __shfl_xor() for ROCm 5.5

79b044b0

mofosyne added Nvidia GPU

mofosyne added Review Complexity : Medium

JohannesGaessler approved these changes on 2024-05-16

JohannesGaessler merged d233b507 into master 1 year ago

Reviewers

JohannesGaessler

Assignees

No one assigned

Labels

Nvidia GPU Review Complexity : Medium

Milestone

No milestone