llvm-project
[AMDGPU] In promote-alloca, if index is dynamic, sandwich load with bitcasts to reduce excessive codegen
#171253
Merged

[AMDGPU] In promote-alloca, if index is dynamic, sandwich load with bitcasts to reduce excessive codegen #171253

choikwa merged 13 commits into llvm:main from choikwa:alloca-bitcast
choikwa
choikwa choikwa requested a review from arsenm arsenm 21 days ago
choikwa choikwa requested a review from nhaehnle nhaehnle 21 days ago
choikwa choikwa requested a review from ruiling ruiling 21 days ago
choikwa choikwa requested a review from Pierre-vh Pierre-vh 21 days ago
choikwa choikwa requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 21 days ago
llvmbot llvmbot added backend:AMDGPU
llvmbot
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2025-12-09
github-actions
choikwa
ruiling
choikwa
ruiling
nhaehnle
nhaehnle commented on 2025-12-12
choikwa [AMDGPU] In promote-alloca, if index is dynamic, sandwich load with b…
55266345
choikwa Address feedback, add unaligned testcase. Now checks if VectorTy is m…
8ae99ccd
choikwa choikwa force pushed from fffe8614 to 8ae99ccd 15 days ago
github-actions
github-actions
nhaehnle
nhaehnle commented on 2025-12-15
choikwa don't use gep, compare against SubVecTy and Load's alignment
25b622f7
choikwa formatting
0ef9d81c
choikwa more formatting
9b60c8e3
nhaehnle
nhaehnle commented on 2025-12-16
ruiling
ruiling commented on 2025-12-16
choikwa Address feedback, use LShr instead of UDiv, simplify checks
2bd4cd6d
choikwa space
a5f1a4b6
ruiling
ruiling approved these changes on 2025-12-18
ruiling
ruiling commented on 2025-12-18
choikwa choikwa changed the title [AMDGPU] In promote-alloca, if index is dynamic, sandwich load with bitcasts to reduce number of extractelements as they have large expansion in the backend. [AMDGPU] In promote-alloca, if index is dynamic, sandwich load with bitcasts to reduce excessive codegen 12 days ago
choikwa add check for pow2 on TotalNumElts, needed for correct shift amt calc…
2eee4b64
choikwa fmt
aada9522
choikwa add non-pow2 alloca NumElts testcase
a6521f5d
choikwa TotalNumElts / NewNumElts = SubVecTy->getNumElts
4fb61fd2
arsenm
arsenm commented on 2025-12-18
arsenm
arsenm commented on 2025-12-18
choikwa Use TypeSize
dcfba6e9
ruiling
ruiling approved these changes on 2025-12-19
choikwa add tests varying NumElts
6133aed4
choikwa choikwa merged 5897f276 into main 10 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone