Arm backend: Cleanup dim-order and permute handling by AdrianLundell · Pull Request #19278 · pytorch/executorch

AdrianLundell · 2026-05-04T11:45:25Z

Replace u55 permute dimension check with a u55-only pass decomposing large permutes. This pass checks for support by compiling targeted permutes using Vela to ensure alignment between Executorch and Vela.
Remove passes and testing not required anymore after dim-order update.
Remove all outdated mention of dim-order in the arm backend.

cc @digantdesai @freddan80 @per @zingo @oscarandersson8218 @mansnils @Sebastian-Larsson @robell

- Replace u55 permute dimension check with a u55-only pass decomposing large permutes. This pass checks for support by compiling targeted permutes using Vela to ensure alignment between Executorch and Vela. - Remove passes and testing not required anymore after dim-order update. - Remove all outdated mention of dim-order in the arm backend. Signed-off-by: Adrian Lundell <adrian.lundell@arm.com> Change-Id: I098db4539179cb223b5c76683720e68c1bbecb8f

pytorch-bot · 2026-05-04T11:45:29Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19278

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Ubuntu services are down

✅ You can merge normally! (3 Unrelated Failures)

As of commit a606e58 with merge base 69989b7 ():

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest / macos / macos-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / unittest-editable / macos / macos-job (gh) (trunk failure)
##[error]The operation was canceled.
trunk / unittest-release / macos / macos-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

AdrianLundell · 2026-05-04T11:45:45Z

No buck2 changes required according to codex

Copilot

Pull request overview

This PR removes the old Arm backend dim-order/memory-format machinery and shifts permute handling toward explicit graph rewrites, including a new U55-specific large-permute decomposition pass. It mainly refactors TOSA lowering/serialization paths, operator support checks, and the associated Arm backend tests.

Changes:

Replace dim-order-based TOSA shape/constant handling with direct shape normalization and remove the old transpose/memory-format infrastructure.
Add DecomposePermuteForU55Pass and update U55 permute/view/select expectations and tests around the new behavior.
Delete legacy passes/operators/tests that were only needed for the previous dim-order approach.

Reviewed changes

Copilot reviewed 33 out of 33 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
`backends/arm/tosa/utils.py`	Renames shape helper to `normalize_symint`.
`backends/arm/tosa/mapping.py`	Drops dim-order from extracted tensor metadata and `TosaArg`.
`backends/arm/tosa/dialect/ops/transpose.py`	Deletes fake TOSA transpose dialect op.
`backends/arm/tosa/dialect/__init__.py`	Removes transpose dialect registration.
`backends/arm/test/tester/arm_tester.py`	Updates tester helper for new `extract_tensor_meta` signature.
`backends/arm/test/passes/test_to_tosa_memory_format.py`	Deletes tests for removed memory-format pass.
`backends/arm/test/passes/test_decompose_int16_activation_conv_pass.py`	Deletes tests for removed int16 conv decomposition pass.
`backends/arm/test/ops/test_view.py`	Renames test data sets and removes old U55 non-delegation cases.
`backends/arm/test/ops/test_select.py`	Removes old U55 non-delegation coverage.
`backends/arm/test/ops/test_permute.py`	Expands U55 permute coverage for large-shape decomposition path.
`backends/arm/test/misc/test_transpose_counts.py`	Updates expected transpose count for grouped conv channels-last case.
`backends/arm/process_node.py`	Removes dim-order-aware tensor serialization and uses normalized shapes directly.
`backends/arm/operators/op_while.py`	Stops consulting output dim-order when creating dummy loop outputs.
`backends/arm/operators/op_tosa_transpose.py`	Deletes backend visitor for removed TOSA transpose op.
`backends/arm/operators/op_tosa_shapes.py`	Serializes shape constants without dim-order remapping.
`backends/arm/operators/op_sum.py`	Uses raw reduction axis directly.
`backends/arm/operators/op_permute.py`	Removes dim-order permutation remapping logic.
`backends/arm/operators/op_cat.py`	Uses raw concat dimension directly.
`backends/arm/operators/op_any.py`	Uses raw reduction axis directly.
`backends/arm/operators/op_amin.py`	Uses raw reduction axis directly.
`backends/arm/operators/op_amax.py`	Uses raw reduction axis directly.
`backends/arm/operators/__init__.py`	Unregisters removed transpose visitor.
`backends/arm/operator_support/tosa_supported_operators.py`	Removes old U55 transpose/view support checks from factory.
`backends/arm/operator_support/ethos_u55_support.py`	Deletes legacy U55 view/permute support-check implementations.
`backends/arm/operator_support/convolution_support.py`	Simplifies transpose-conv U55 shape handling.
`backends/arm/_passes/to_tosa_memory_format_pass.py`	Deletes old dim-order/memory-format pass.
`backends/arm/_passes/insert_data_layout_casts_pass.py`	Removes dependency on deleted backend transpose op.
`backends/arm/_passes/decompose_permute_for_u55_pass.py`	Adds new U55 large-permute decomposition/probing pass.
`backends/arm/_passes/decompose_int16_activation_conv_pass.py`	Deletes old int16 activation conv decomposition pass.
`backends/arm/_passes/arm_pass_utils.py`	Removes output dim-order helper.
`backends/arm/_passes/arm_pass_manager.py`	Inserts new U55 permute pass and reorders slice rewriting.
`backends/arm/_passes/annotate_output_dim_order_pass.py`	Deletes old output dim-order annotation pass.
`backends/arm/_passes/__init__.py`	Updates exported pass list for removed/added passes.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

    dtype = map_dtype(val.dtype)
    shape = tuple(val.size())

-    dim_order = tuple(range(len(shape)))
-    return (dtype, shape, dim_order)
+    return (dtype, shape)


+        # This is a quick check to avoid the overhead of the Vela compilation in 99% of cases.
+        if not self._violates_u55_worst_case_constraint(input_shape):
+            return super().call_operator(op, args, kwargs, meta)
+


+    assert isinstance(tensor, torch.Tensor), (
+        f"Expected lifted tensor constant '{node.name}' to be a torch.Tensor, got "
+        f"{type(tensor).__name__}"
    )


    """Extract dtype, shape, and dimension order from FX metadata.

    Args:
        meta (dict): FX node ``meta`` containing a ``val`` FakeTensor (or tuple).

    Returns:
        tuple[ts.DType, tuple[int, ...], tuple[int, ...]]: Tuple containing
-        tensor dtype, shape, and dimension order.
+        tensor dtype and shape.


+        permutation and dtype to check wheter it is supported.
+        """
+
+        if dtype not in (torch.int8, torch.bool, torch.int16):


AdrianLundell added partner: arm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Arm ciflow/trunk release notes: none Do not include this in the release notes labels May 4, 2026

Copilot AI review requested due to automatic review settings May 4, 2026 11:45

AdrianLundell requested a review from digantdesai as a code owner May 4, 2026 11:45

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 4, 2026

github-actions Bot added the module: arm Issues related to arm backend label May 4, 2026

Copilot started reviewing on behalf of AdrianLundell May 4, 2026 11:46 View session

Copilot AI reviewed May 4, 2026

View reviewed changes

zingo approved these changes May 4, 2026

View reviewed changes

AdrianLundell merged commit 8e653a6 into pytorch:main May 5, 2026
444 of 458 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Arm backend: Cleanup dim-order and permute handling#19278

Arm backend: Cleanup dim-order and permute handling#19278
AdrianLundell merged 1 commit intopytorch:mainfrom
AdrianLundell:change-1249697

AdrianLundell commented May 4, 2026 •

edited by pytorch-bot Bot

Loading

Uh oh!

pytorch-bot Bot commented May 4, 2026 •

edited

Loading

Uh oh!

AdrianLundell commented May 4, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

AdrianLundell commented May 4, 2026 • edited by pytorch-bot Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented May 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19278

❗ 1 Active SEVs

✅ You can merge normally! (3 Unrelated Failures)

Uh oh!

AdrianLundell commented May 4, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

AdrianLundell commented May 4, 2026 •

edited by pytorch-bot Bot

Loading

pytorch-bot Bot commented May 4, 2026 •

edited

Loading