Add coreml_compute_plan.py: report which CoreML ops dispatch to ANE / GPU / CPU by john-rocky · Pull Request #19252 · pytorch/executorch

john-rocky · 2026-05-01T05:53:25Z

Summary

CoreML decides at compile/load time which device each MIL operation will
execute on, and coremltools 9.0+ exposes that through MLComputePlan.
The recurring question on the issue tracker is "why isn't my model
running fully on the ANE?" — for example:

llama model is not fully lowered to ANE (coreml backend) #4091 — llama model is not fully lowered to ANE
CoreML model is crashing on iPhone GPU, but not on iPhone CPU or macOS GPU #11541 — CoreML model is crashing on iPhone GPU, but not on iPhone CPU or macOS GPU
ANE compile OOMs on certain input shapes #8439 — ANE compile OOMs on certain input shapes
CPU Overhead After ANE Execution #8445 — CPU Overhead After ANE Execution

Today the only way for an ExecuTorch user to answer it is to break out
Swift / Xcode. This PR adds a Python wrapper around MLComputePlan so
the answer is one shell command:

$ python coreml_compute_plan.py --model_path my_model.mlpackage \
      --compute_units cpu_and_ne --show_non_ane

=== my_model.mlpackage ===
  ANE:   412 / 480 ( 85.8%)
  CPU:    68 / 480 ( 14.2%)

  Non-ANE op types:
       32  ios17.cast
       18  ios17.gather
       12  ios17.reshape
        6  ios17.constexpr_blockwise_shift_scale

Inputs supported:

Input	Behavior
`.pte`	Extract every Core ML partition into a tempdir, then analyze each.
`.mlpackage`	Compile to `.mlmodelc` in a tempdir, then analyze.
`.mlmodelc`	Analyze directly.

The PTE path reuses the same JSON/named-data extraction logic that
extract_coreml_models.py uses, and is inlined into the script so it can
be run against a plain CoreML model without depending on the executorch
package.

Test plan

Added test_coreml_compute_plan.py covering:

_device_name(...) for None and a stub MLNeuralEngineComputeDevice.
_COMPUTE_UNIT_CHOICES mapping (cpu_and_ne / all).
analyze_one(...) end-to-end on a tiny relu(x @ x.T) + x.sum()
mlpackage built with coremltools.convert(...): returns rows for
every dispatched op, with a main function and the expected MIL op
types (matmul, relu, add, reduce_sum).

$ python -m pytest examples/apple/coreml/scripts/test_coreml_compute_plan.py -v
============================== 7 passed in 3.68s ===============================

I also ran the script against a few hand-built .mlpackage and
.mlmodelc files on macOS 26 with coremltools 9.0 and verified the
output matches what MLComputePlan returns directly.

Authored with Claude.

cc @kimishpatel @YifanShenSZ @cymbalrush @metascroy

CoreML decides at compile/load time which device each MIL operation will execute on; that decision is exposed through MLComputePlan in coremltools 9.0+. This script wraps it so users can answer 'why isn't my model running on the ANE?' without writing Swift, which is the recurring question behind issues like pytorch#4091, pytorch#11541, and pytorch#8439. Inputs supported: * .pte — extracts every Core ML partition first. * .mlpackage — compiles to .mlmodelc in a tempdir. * .mlmodelc — analyzed directly. Reports per-op dispatch (ANE / GPU / CPU), an aggregate breakdown, and optionally the op types that did not get assigned to the ANE (--show_non_ane). Authored with Claude.

pytorch-bot · 2026-05-01T05:53:29Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19252

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

⚠️ 11 Awaiting Approval

As of commit 387d074 with merge base 94d2881 ():

AWAITING APPROVAL - The following workflows need approval before CI can run:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

john-rocky · 2026-05-02T15:38:09Z

@pytorchbot label "release notes: apple"

kimishpatel · 2026-05-04T20:13:21Z

I thought what op runs where is decided at compile time. Is this being extracted from AOT compile or just from lowering?

john-rocky · 2026-05-04T20:33:23Z

@kimishpatel Compile time, not runtime — MLComputePlan is a static analysis hook that coremltools 9.0+ exposes around the same dispatch decision the CoreML runtime would make. It loads from a compiled .mlmodelc (the script compiles a .mlpackage to a tempdir if needed) and reports preferred_compute_device per MIL operation without actually running predictions.

Quoting coremltools' own docs:

Represents the plan for executing a model. The application can use the plan to estimate the necessary cost and resources of the model before running the predictions.

So no AOT extraction or runtime profiling involved; just a wrapper that walks MLModelStructureProgram and calls get_compute_device_usage_for_mlprogram_operation per op. That's why this works against any of .pte, .mlpackage, or .mlmodelc — they all reduce to a compiled program the framework can plan against.

john-rocky requested a review from metascroy as a code owner May 1, 2026 05:53

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 1, 2026

john-rocky mentioned this pull request May 1, 2026

Add Gemma 4 text-decoder export to CoreML #19253

Open

pytorch-bot Bot added the release notes: apple Changes to the Apple backend delegate label May 2, 2026

nil-is-all added the module: coreml Issues related to Apple's Core ML delegation and code under backends/apple/coreml/ label May 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add coreml_compute_plan.py: report which CoreML ops dispatch to ANE / GPU / CPU#19252

Add coreml_compute_plan.py: report which CoreML ops dispatch to ANE / GPU / CPU#19252
john-rocky wants to merge 1 commit intopytorch:mainfrom
john-rocky:coreml/compute-plan-analyzer

john-rocky commented May 1, 2026 •

edited by pytorch-bot Bot

Loading

Uh oh!

pytorch-bot Bot commented May 1, 2026 •

edited

Loading

Uh oh!

john-rocky commented May 2, 2026

Uh oh!

kimishpatel commented May 4, 2026

Uh oh!

john-rocky commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

john-rocky commented May 1, 2026 • edited by pytorch-bot Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

pytorch-bot Bot commented May 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19252

⚠️ 11 Awaiting Approval

Uh oh!

john-rocky commented May 2, 2026

Uh oh!

kimishpatel commented May 4, 2026

Uh oh!

john-rocky commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

john-rocky commented May 1, 2026 •

edited by pytorch-bot Bot

Loading

pytorch-bot Bot commented May 1, 2026 •

edited

Loading