[Bugfix][TOPI] Fix a bug in arm_cpu int8 conv2d i8mm schedule #15484

Anndrey24 · 2023-08-04T12:55:18Z

topi.arm_cpu.schedule_conv2d_NHWC_quantized_interleaved was failing compilation with the +i8mm extension enabled (as done in #14888) whenever the output height and output width were both equal to 1, such that OH x OW = 1.

Padding was being removed during the tir.BufferShapeLegalize pass, causing an error in the tir.BufferBindUnwrapper pass. Some of the removed padding was necessary for tensorize (using the gemm_acc_2x2_int8_int8_int32 intrinsic), which expects 2x2 output tiles. However, because of the optimisations mentioned above, the output tensor C_interleaved was reduced to having 1x2 tiles instead.

e.g. for A = [1x1x1x8], W = [1x1x8x24], C = [1x1x1x24]:

Before fix: C_interleaved = T.Buffer((1, 1, 2, 1, 6, 1, 2), "int32”)
After fix: C_interleaved = T.Buffer((1, 1, 2, 1, 6, 2, 2), "int32”)

To make sure the required padding is left untouched, while the rest of it is still removed, a dummy reference to the needed axis is declared.

In the end, the leftover padding is still disregarded when computing the final output tensor C.

`topi.arm_cpu.schedule_conv2d_NHWC_quantized_interleaved` was failing compilation with the `+i8mm` extension enabled whenever the output height and output width were both equal to 1, such that OH x OW = 1. Padding was being removed during the `tir.BufferShapeLegalize` pass, causing an error in the `tir.BufferBindUnwrapper` pass. Some of the removed padding was necessary for tensorize (using the `gemm_acc_2x2_int8_int8_int32` intrinsic), which expects 2x2 output tiles. However, because of the optimisations mentioned above, the output tensor `C_interleaved` was reduced to having 1x2 tiles instead. e.g. for A = [1x1x1x8], W = [1x1x8x24], C = [1x1x1x24]: - Before fix: `C_interleaved = T.Buffer((1, 1, 2, 1, 6, 1, 2), "int32”)` - After fix: `C_interleaved = T.Buffer((1, 1, 2, 1, 6, 2, 2), "int32”)` To make sure the required padding is left untouched, while the rest of it is still removed, a dummy reference to the needed axis is declared. Finally, the leftover padding is still disregarded when computing the final output tensor `C`.

tvm-bot · 2023-08-04T12:55:21Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

No users to tag found in teams: bugfix, topi _{See #10317 for details}

_{Generated by tvm-bot}

Anndrey24 · 2023-08-04T14:20:59Z

cc @ekalda @neildhickey @lhutton1 @leandron

ekalda

Thanks @Anndrey24, LGTM, great work!

ekalda · 2023-08-07T08:40:06Z

Thanks @Anndrey24!

lhutton1 mentioned this pull request Aug 4, 2023

[TEST] Enable arm_cpu qnn.conv2d i8mm test #14888

Closed

ekalda approved these changes Aug 4, 2023

View reviewed changes

ekalda merged commit 93356cd into apache:main Aug 7, 2023

Anndrey24 deleted the i8mm-schedule-bugfix branch August 7, 2023 08:58

ysh329 mentioned this pull request Oct 18, 2023

[Release] v0.14.0 Release Candidate Notes #15948

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix][TOPI] Fix a bug in arm_cpu int8 conv2d i8mm schedule #15484

[Bugfix][TOPI] Fix a bug in arm_cpu int8 conv2d i8mm schedule #15484

Uh oh!

Anndrey24 commented Aug 4, 2023

Uh oh!

tvm-bot commented Aug 4, 2023

Uh oh!

Anndrey24 commented Aug 4, 2023

Uh oh!

ekalda left a comment

Uh oh!

ekalda commented Aug 7, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Bugfix][TOPI] Fix a bug in arm_cpu int8 conv2d i8mm schedule #15484

[Bugfix][TOPI] Fix a bug in arm_cpu int8 conv2d i8mm schedule #15484

Uh oh!

Conversation

Anndrey24 commented Aug 4, 2023

Uh oh!

tvm-bot commented Aug 4, 2023

Uh oh!

Anndrey24 commented Aug 4, 2023

Uh oh!

ekalda left a comment

Choose a reason for hiding this comment

Uh oh!

ekalda commented Aug 7, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants