Skip to content

release: 0.33.0 — GRU, upsample2d Bilinear export, autodiff coverage fix#775

Merged
michalharakal merged 2 commits into
developfrom
release/0.33.0
Jun 29, 2026
Merged

release: 0.33.0 — GRU, upsample2d Bilinear export, autodiff coverage fix#775
michalharakal merged 2 commits into
developfrom
release/0.33.0

Conversation

@michalharakal

Copy link
Copy Markdown
Contributor

Release 0.33.0 — bundles everything merged to develop since 0.32.4.

Highlights (user value)

⚠️ Breaking

TensorOps.sin / cos / convTranspose1d are now abstract (were default-throwing) so they trace/export and become differentiable. Anyone implementing TensorOps directly must override them — both bundled backends (DefaultCpuOpsBase, VoidTensorOps) already do. Hence the minor bump.

Validation

Full downstream conformance suite — 12/12 models + 33/33 ops — exports, compiles, runs, and validates green end-to-end on IREE (llvm-cpu) against this exact tree (driven via a local-maven 0.32.5-localdev1 build of this branch). Includes the new GRU probe (max_abs_err 5.96e-08 vs the SKaiNET CPU oracle) and upsample2d (nearest 0.0, bilinear 1.19e-07 vs numpy).

Contents (gradle.properties 0.32.4→0.33.0, CHANGELOG.md, README "What's New")

This branch only bumps the version + docs. The code changes are already on develop via #769, #771, #772, #774.

Reviewer notes

  • develop also carries perf(native-cpu): Q6_K NEON matmul kernel #768 (Q6_K NEON matmul kernel) from a separate workstream — not authored here; please confirm its changelog wording if you want it called out.
  • GitHub reports Dependabot alerts on the default branch (mostly transitive/JS toolchain) — worth a glance but not release-blocking.
  • Not tagged / not published — left for the release owner's go.

🤖 Generated with Claude Code

Bumps VERSION_NAME 0.32.4 -> 0.33.0. Bundles the develop changes since 0.32.4:
GRU layer (#772/#217), upsample2d Bilinear + StableHLO export (#771), the
autodiff dispatch correctness fix + 7 newly-differentiable ops + KSP coverage
guard (#774), and norm converters lowering to real stablehlo.reduce (#769).

Minor bump (not patch): TensorOps.sin/cos/convTranspose1d became abstract, a
source/binary-incompatible change for downstream TensorOps implementers.

Validated: full conformance suite (12/12 models + 33/33 ops) green end-to-end
on IREE llvm-cpu against this tree (via local-maven 0.32.5-localdev1).

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
@michalharakal michalharakal merged commit 0d4d962 into develop Jun 29, 2026
4 checks passed
@michalharakal michalharakal deleted the release/0.33.0 branch June 29, 2026 10:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant