[Quant][PT2E][X86] Enable annotation of aten.mul.tensor with X86InductorQuantizer #150831

Xia-Weiwen · 2025-04-08T07:37:23Z

Stack from ghstack (oldest at bottom):

Summary
This PR adds support of annotation of aten.mul.tensor in X86InductorQuantizer.
mul is not annotated by default. Users need to set the following to enable annotation of mul:

quantizer.set_function_type_qconfig(
    torch.mul, quantizer.get_global_quantization_config()
)

After convert_pt2e, users get patterns like

quantize_per_tensor_default = torch.ops.quantized_decomposed.quantize_per_tensor.default(x, ...)
dequantize_per_tensor_default = torch.ops.quantized_decomposed.dequantize_per_tensor.default(quantize_per_tensor_default, ...)
quantize_per_tensor_default_1 = torch.ops.quantized_decomposed.quantize_per_tensor.default(y, ...);
dequantize_per_tensor_default_1 = torch.ops.quantized_decomposed.dequantize_per_tensor.default(quantize_per_tensor_default_1, ...)
mul = torch.ops.aten.mul.Tensor(dequantize_per_tensor_default, dequantize_per_tensor_default_1);

Test plan

pytest test/quantization/pt2e/test_x86inductor_quantizer.py -k test_annotate_mul_tensor

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10

[ghstack-poisoned]

pytorch-bot · 2025-04-08T07:37:26Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/150831

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 71e079b with merge base 01f226b ():

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

pull / linux-jammy-py3-clang12-executorch / test (executorch, 1, 1, lf.ephemeral.linux.2xlarge) (gh) (#144480)
examples/models/llama3_2_vision/text_decoder/test/test_text_decoder.py::TextDecoderTest::test_llama3_2_text_decoder_aoti

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…torQuantizer ghstack-source-id: b336298 Pull Request resolved: #150831

Copilot

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

Comments suppressed due to low confidence (1)

test/quantization/pt2e/test_x86inductor_quantizer.py:2877

[nitpick] Avoid using 'type' as a variable name because it shadows the built-in function. Consider renaming it to 'model_type' or a similar descriptive name.

for type in [0, 1, 2]:

[ghstack-poisoned]

…torQuantizer ghstack-source-id: 23478a7 Pull Request resolved: #150831

leslie-fang-intel

LGTM. A small comment.

test/quantization/pt2e/test_x86inductor_quantizer.py

Xia-Weiwen · 2025-04-11T02:35:46Z

Hi @jerryzh168 Could you please review this PR? Thanks.

jerryzh168 · 2025-04-11T03:26:55Z

test/quantization/pt2e/test_x86inductor_quantizer.py

+                else:
+                    return x * y.sum().item()
+
+        for type in [0, 1, 2, 3]:


nit: I think we can improve [0, 1, 2, 3] by defining different classes for each test case here

Thanks. Updated.

[ghstack-poisoned]

…torQuantizer ghstack-source-id: dbd0974 Pull Request resolved: #150831

Xia-Weiwen · 2025-04-18T08:02:14Z

Close this PR as we need to move to Torchao.

Update

9614697

[ghstack-poisoned]

Xia-Weiwen requested a review from jerryzh168 as a code owner April 8, 2025 07:37

Xia-Weiwen mentioned this pull request Apr 8, 2025

[Quant][PT2E][X86] enable qconv1d-relu fusion #150751

Closed

pytorch-bot bot added the release notes: quantization release notes category label Apr 8, 2025

Xia-Weiwen added a commit that referenced this pull request Apr 8, 2025

[Quant][PT2E][X86] Enable annotation of aten.mul.tensor with X86Induc…

b818f27

…torQuantizer ghstack-source-id: b336298 Pull Request resolved: #150831

Xia-Weiwen marked this pull request as draft April 8, 2025 07:38

pytorchbot added the open source label Apr 8, 2025

Xia-Weiwen requested review from leslie-fang-intel and Copilot April 8, 2025 11:24

Copilot AI reviewed Apr 8, 2025

View reviewed changes

Xia-Weiwen added the intel This tag is for PR from Intel label Apr 8, 2025

Update

9d31e46

[ghstack-poisoned]

Xia-Weiwen added a commit that referenced this pull request Apr 9, 2025

[Quant][PT2E][X86] Enable annotation of aten.mul.tensor with X86Induc…

b0f7d64

…torQuantizer ghstack-source-id: 23478a7 Pull Request resolved: #150831

leslie-fang-intel approved these changes Apr 9, 2025

View reviewed changes

test/quantization/pt2e/test_x86inductor_quantizer.py Outdated Show resolved Hide resolved

Xia-Weiwen marked this pull request as ready for review April 10, 2025 02:29

jerryzh168 reviewed Apr 11, 2025

View reviewed changes

jerryzh168 approved these changes Apr 11, 2025

View reviewed changes

Update

a83b7c3

[ghstack-poisoned]

Xia-Weiwen mentioned this pull request Apr 11, 2025

[Quant][X86] add an op to compute uint8 pointwise mul #151112

Closed

Update

a52dfd2

[ghstack-poisoned]

Update

71e079b

[ghstack-poisoned]

Xia-Weiwen added a commit that referenced this pull request Apr 14, 2025

[Quant][PT2E][X86] Enable annotation of aten.mul.tensor with X86Induc…

3b72757

…torQuantizer ghstack-source-id: dbd0974 Pull Request resolved: #150831

Xia-Weiwen closed this Apr 18, 2025

Xia-Weiwen mentioned this pull request Apr 18, 2025

[Quant][PT2E][X86] Enable annotation of aten.mul.tensor with X86InductorQuantizer pytorch/ao#2075

Merged

github-actions bot deleted the gh/Xia-Weiwen/36/head branch May 25, 2025 02:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Quant][PT2E][X86] Enable annotation of aten.mul.tensor with X86InductorQuantizer #150831

[Quant][PT2E][X86] Enable annotation of aten.mul.tensor with X86InductorQuantizer #150831

Uh oh!

Xia-Weiwen commented Apr 8, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Apr 8, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

leslie-fang-intel left a comment

Uh oh!

Uh oh!

Xia-Weiwen commented Apr 11, 2025

Uh oh!

jerryzh168 Apr 11, 2025

Uh oh!

Xia-Weiwen Apr 11, 2025

Uh oh!

Xia-Weiwen commented Apr 18, 2025

Uh oh!

Uh oh!

[Quant][PT2E][X86] Enable annotation of aten.mul.tensor with X86InductorQuantizer #150831

[Quant][PT2E][X86] Enable annotation of aten.mul.tensor with X86InductorQuantizer #150831

Uh oh!

Conversation

Xia-Weiwen commented Apr 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Apr 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/150831

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

leslie-fang-intel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Xia-Weiwen commented Apr 11, 2025

Uh oh!

jerryzh168 Apr 11, 2025

Choose a reason for hiding this comment

Uh oh!

Xia-Weiwen Apr 11, 2025

Choose a reason for hiding this comment

Uh oh!

Xia-Weiwen commented Apr 18, 2025

Uh oh!

Uh oh!

Xia-Weiwen commented Apr 8, 2025 •

edited

Loading

pytorch-bot bot commented Apr 8, 2025 •

edited

Loading