[AMDGPU] In promote-alloca, if index is dynamic, sandwich load with bitcasts to reduce excessive codegen #171253
[AMDGPU] In promote-alloca, if index is dynamic, sandwich load with b…
55266345
Address feedback, add unaligned testcase. Now checks if VectorTy is m…
8ae99ccd
choikwa
force pushed
from
fffe8614
to
8ae99ccd
15 days ago
don't use gep, compare against SubVecTy and Load's alignment
25b622f7
formatting
0ef9d81c
more formatting
9b60c8e3
Address feedback, use LShr instead of UDiv, simplify checks
2bd4cd6d
space
a5f1a4b6
ruiling
approved these changes
on 2025-12-18
choikwa
changed the title [AMDGPU] In promote-alloca, if index is dynamic, sandwich load with bitcasts to reduce number of extractelements as they have large expansion in the backend. [AMDGPU] In promote-alloca, if index is dynamic, sandwich load with bitcasts to reduce excessive codegen 12 days ago
add check for pow2 on TotalNumElts, needed for correct shift amt calc…
2eee4b64
fmt
aada9522
add non-pow2 alloca NumElts testcase
a6521f5d
TotalNumElts / NewNumElts = SubVecTy->getNumElts
4fb61fd2
arsenm
commented
on 2025-12-18
arsenm
commented
on 2025-12-18
Use TypeSize
dcfba6e9
ruiling
approved these changes
on 2025-12-19
add tests varying NumElts
6133aed4
choikwa
merged
5897f276
into main 10 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub