llvm-project
[flang][cuda] Use PTX instruction for atomicAdd with 4xf32
#169581
Merged

[flang][cuda] Use PTX instruction for atomicAdd with 4xf32 #169581

clementval
clementval [flang][cuda] Use libdevice for atomicAdd with 4xf32
9695a91b
clementval clementval requested a review from wangzpgi wangzpgi 79 days ago
llvmbot llvmbot added flang
llvmbot llvmbot added flang:fir-hlfir
llvmbot
clementval clementval changed the title [flang][cuda] Use libdevice for atomicAdd with 4xf32 [flang][cuda] Use PTX instruction for atomicAdd with 4xf32 79 days ago
wangzpgi
wangzpgi approved these changes on 2025-11-25
clementval clementval enabled auto-merge (squash) 79 days ago
clementval clementval merged f7a9fcad into main 79 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone