Arm backend: Add 16A8W support and test for slice operation #13798

Ninja91 · 2025-08-29T06:42:54Z

Stack from ghstack (oldest at bottom):

Add 16A8W quantization support and test for the slice operation in ExecutorTorch ARM backend.

This follows the pattern established for linear, mul, sigmoid, and tanh operations, extending int16 support to slice operations.

Changes:

Add INT16 dtype validation support in op_slice.py
Add test_slice_tensor_16a8w_tosa_INT test function
Enable test_slice.py in test targets configuration

The 16A8W configuration uses 16-bit activations with 8-bit weights, enabling higher precision for activations while maintaining weight efficiency.

Differential Revision: D80511095

cc @digantdesai @freddan80 @per @zingo @oscarandersson8218

Add 16A8W quantization support and test for the slice operation in ExecutorTorch ARM backend. This follows the pattern established for linear, mul, sigmoid, and tanh operations, extending int16 support to slice operations. Changes: - Add INT16 dtype validation support in op_slice.py - Add test_slice_tensor_16a8w_tosa_INT test function - Enable test_slice.py in test targets configuration The 16A8W configuration uses 16-bit activations with 8-bit weights, enabling higher precision for activations while maintaining weight efficiency. Differential Revision: [D80511095](https://our.internmc.facebook.com/intern/diff/D80511095/) [ghstack-poisoned]

pytorch-bot · 2025-08-29T06:42:57Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13798

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures, 7 Unrelated Failures

As of commit ea012f7 with merge base 1d37845 ():

NEW FAILURES - The following jobs have failed:

trunk / test-qnn-optimum-model (fp32, mobilevit_v2) / linux-job (gh)
RuntimeError: Command docker exec -t 74931f959187d14a489aab80178c1f23cfc2d2cea031e417eef176d1e75dc6b2 /exec failed with exit code 92
trunk / test-qnn-optimum-model (fp32, roberta) / linux-job (gh)
RuntimeError: Command docker exec -t d73bc781b99aa212579a372ed6f0dd613ab69291f85b129469abcff95047b9f8 /exec failed with exit code 92

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Build Windows Wheels / pytorch/executorch / build-wheel-py3_10-cpu (gh) (trunk failure)
RuntimeError: Failed to install QNN SDK. Please check the logs above.
Build Windows Wheels / pytorch/executorch / upload / upload-wheel-py3_10-cpu (gh) (trunk failure)
pull / test-binary-size-linux-gcc / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-moshi-linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-openvino-linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-samsung-models-linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-setup-linux-gcc / linux-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-08-29T06:43:36Z