[CIR][ABI][AArch64][Lowering] Support calls for struct types > 128 bits #1074

bruteforceboy · 2024-11-07T09:58:32Z

As the title says, this PR adds support for calls with struct types > 128 bits, building upon this PR.

The idea is gotten from the original Codegen, and I have added a couple of tests.

smeenai · 2024-11-07T18:57:20Z

CC @gitoleg for recent work in this area and @sitio-couto for ABI.

bcardosolopes

Thanks for working on this!

clang/lib/CIR/Dialect/Transforms/TargetLowering/LowerFunction.cpp

bcardosolopes

Mostly nits and good to go

clang/lib/CIR/Dialect/Transforms/TargetLowering/LowerFunction.cpp

@foo

…1335) In [PR#1074](#1074) we introduced calls for struct types > 128 bits, but there's is an issue here. [This](https://github.com/llvm/clangir/blob/3e17e7b9404e1a28bf33bdd5943f4a208134d479/clang/lib/CIR/Dialect/Transforms/TargetLowering/LowerFunction.cpp#L1169) is meant to be a `memcpy` of the alloca instead of directly passing the alloca, just like in the [OG](https://github.com/llvm/clangir/blob/3e17e7b9404e1a28bf33bdd5943f4a208134d479/clang/lib/CodeGen/CGCall.cpp#L5323). The PR was meant to use a `memcpy` and later handle cases where we don't need the `memcpy`. For example, running the following code snippet `tmp.c` using `bin/clang tmp.c -o tmp -Xclang -fclangir -Xclang -fclangir-call-conv-lowering --target=aarch64-none-linux-gnu`: ``` #include <stdio.h> typedef struct { int a, b, c, d, e; } S; void change(S s) { s.a = 10; } void foo(void) { S s; s.a = 9; change(s); printf("%d\n", s.a); } int main(void) { foo(); return 0; } ``` gives 10 instead of 9, because we pass the pointer instead of a copy. Relevant part of the OG LLVM output: ``` @foo() %s = alloca %struct.S, align 4 %byval-temp = alloca %struct.S, align 4 %a = getelementptr inbounds nuw %struct.S, ptr %s, i32 0, i32 0 store i32 9, ptr %a, align 4 call void @llvm.memcpy.p0.p0.i64(ptr align 4 %byval-temp, ptr align 4 %s, i64 20, i1 false) call void @change(ptr noundef %byval-temp) ``` Current LLVM output through CIR: ``` @foo() %1 = alloca %struct.S, i64 1, align 4 %2 = getelementptr %struct.S, ptr %1, i32 0, i32 0 store i32 9, ptr %2, align 4 %3 = load %struct.S, ptr %1, align 4 call void @change(ptr %1) ``` So, there should be a memcpy. This PR fixes this, and adds a comment/note for the future cases where we need to check if the copy is not needed. I have also updated the old test with structs having size > 128.

…ts (#1074) As the title says, this PR adds support for calls with struct types > 128 bits, building upon this [PR](#1068). The idea is gotten from the original Codegen, and I have added a couple of tests.

@foo

…1335) In [PR#1074](#1074) we introduced calls for struct types > 128 bits, but there's is an issue here. [This](https://github.com/llvm/clangir/blob/3e17e7b9404e1a28bf33bdd5943f4a208134d479/clang/lib/CIR/Dialect/Transforms/TargetLowering/LowerFunction.cpp#L1169) is meant to be a `memcpy` of the alloca instead of directly passing the alloca, just like in the [OG](https://github.com/llvm/clangir/blob/3e17e7b9404e1a28bf33bdd5943f4a208134d479/clang/lib/CodeGen/CGCall.cpp#L5323). The PR was meant to use a `memcpy` and later handle cases where we don't need the `memcpy`. For example, running the following code snippet `tmp.c` using `bin/clang tmp.c -o tmp -Xclang -fclangir -Xclang -fclangir-call-conv-lowering --target=aarch64-none-linux-gnu`: ``` #include <stdio.h> typedef struct { int a, b, c, d, e; } S; void change(S s) { s.a = 10; } void foo(void) { S s; s.a = 9; change(s); printf("%d\n", s.a); } int main(void) { foo(); return 0; } ``` gives 10 instead of 9, because we pass the pointer instead of a copy. Relevant part of the OG LLVM output: ``` @foo() %s = alloca %struct.S, align 4 %byval-temp = alloca %struct.S, align 4 %a = getelementptr inbounds nuw %struct.S, ptr %s, i32 0, i32 0 store i32 9, ptr %a, align 4 call void @llvm.memcpy.p0.p0.i64(ptr align 4 %byval-temp, ptr align 4 %s, i64 20, i1 false) call void @change(ptr noundef %byval-temp) ``` Current LLVM output through CIR: ``` @foo() %1 = alloca %struct.S, i64 1, align 4 %2 = getelementptr %struct.S, ptr %1, i32 0, i32 0 store i32 9, ptr %2, align 4 %3 = load %struct.S, ptr %1, align 4 call void @change(ptr %1) ``` So, there should be a memcpy. This PR fixes this, and adds a comment/note for the future cases where we need to check if the copy is not needed. I have also updated the old test with structs having size > 128.

@foo

…lvm#1335) In [PR#1074](llvm/clangir#1074) we introduced calls for struct types > 128 bits, but there's is an issue here. [This](https://github.com/llvm/clangir/blob/3e17e7b9404e1a28bf33bdd5943f4a208134d479/clang/lib/CIR/Dialect/Transforms/TargetLowering/LowerFunction.cpp#L1169) is meant to be a `memcpy` of the alloca instead of directly passing the alloca, just like in the [OG](https://github.com/llvm/clangir/blob/3e17e7b9404e1a28bf33bdd5943f4a208134d479/clang/lib/CodeGen/CGCall.cpp#L5323). The PR was meant to use a `memcpy` and later handle cases where we don't need the `memcpy`. For example, running the following code snippet `tmp.c` using `bin/clang tmp.c -o tmp -Xclang -fclangir -Xclang -fclangir-call-conv-lowering --target=aarch64-none-linux-gnu`: ``` typedef struct { int a, b, c, d, e; } S; void change(S s) { s.a = 10; } void foo(void) { S s; s.a = 9; change(s); printf("%d\n", s.a); } int main(void) { foo(); return 0; } ``` gives 10 instead of 9, because we pass the pointer instead of a copy. Relevant part of the OG LLVM output: ``` @foo() %s = alloca %struct.S, align 4 %byval-temp = alloca %struct.S, align 4 %a = getelementptr inbounds nuw %struct.S, ptr %s, i32 0, i32 0 store i32 9, ptr %a, align 4 call void @llvm.memcpy.p0.p0.i64(ptr align 4 %byval-temp, ptr align 4 %s, i64 20, i1 false) call void @change(ptr noundef %byval-temp) ``` Current LLVM output through CIR: ``` @foo() %1 = alloca %struct.S, i64 1, align 4 %2 = getelementptr %struct.S, ptr %1, i32 0, i32 0 store i32 9, ptr %2, align 4 %3 = load %struct.S, ptr %1, align 4 call void @change(ptr %1) ``` So, there should be a memcpy. This PR fixes this, and adds a comment/note for the future cases where we need to check if the copy is not needed. I have also updated the old test with structs having size > 128.

…ts (llvm#1074) As the title says, this PR adds support for calls with struct types > 128 bits, building upon this [PR](llvm#1068). The idea is gotten from the original Codegen, and I have added a couple of tests.

bruteforceboy added 5 commits November 7, 2024 11:25

support return and calls for struct size > 128 bits

0918a4f

add tests

2183744

update tests

02810d0

add assignment

2407385

clean up

3edead1

bruteforceboy requested review from bcardosolopes and lanza as code owners November 7, 2024 09:58

use findAlloca

7c7034f

smeenai requested a review from gitoleg November 7, 2024 18:57

smeenai requested a review from sitio-couto November 7, 2024 18:57

bcardosolopes requested changes Nov 8, 2024

View reviewed changes

bruteforceboy added 3 commits November 13, 2024 12:02

Merge branch 'llvm:main' into aarch64-struct-128

d396669

fix review comments

d2ebca5

more cleanup

1ae1c90

bcardosolopes requested changes Nov 13, 2024

View reviewed changes

clang/lib/CIR/Dialect/Transforms/TargetLowering/LowerFunction.cpp Outdated Show resolved Hide resolved

clang/lib/CIR/Dialect/Transforms/TargetLowering/LowerFunction.cpp Outdated Show resolved Hide resolved

fix review comments

0dced91

bcardosolopes approved these changes Nov 14, 2024

View reviewed changes

bcardosolopes merged commit 1b97f92 into llvm:main Nov 14, 2024
6 checks passed

bruteforceboy mentioned this pull request Feb 11, 2025

[CIR][ABI][AArch64][Lowering] Fix calls for struct types > 128 bits #1335

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CIR][ABI][AArch64][Lowering] Support calls for struct types > 128 bits #1074

[CIR][ABI][AArch64][Lowering] Support calls for struct types > 128 bits #1074

Uh oh!

bruteforceboy commented Nov 7, 2024

Uh oh!

smeenai commented Nov 7, 2024 •

edited

Loading

Uh oh!

bcardosolopes left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bcardosolopes left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[CIR][ABI][AArch64][Lowering] Support calls for struct types > 128 bits #1074

[CIR][ABI][AArch64][Lowering] Support calls for struct types > 128 bits #1074

Uh oh!

Conversation

bruteforceboy commented Nov 7, 2024

Uh oh!

smeenai commented Nov 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bcardosolopes left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bcardosolopes left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

smeenai commented Nov 7, 2024 •

edited

Loading