Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
14 changes: 14 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,20 @@ version line is kept in lock-step with the underlying SKaiNET engine
The format roughly follows [Keep a Changelog](https://keepachangelog.com/en/1.1.0/),
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).

## [0.33.0] — 2026-06-29

Ships against **SKaiNET engine 0.33.0**. No transformers API changes — this release adopts the new
engine line and routine dependency updates.

### Changed

- **Engine → 0.33.0.** Transformer models authored with this layer inherit the engine's 0.33.0 work;
most relevant here, `layerNorm` / `rmsNorm` now lower to real `stablehlo.reduce`, so transformer
exports compile and run on stock IREE (engine #769). The engine also fixes a silent autodiff
gradient-drop (`elu`/`leakyRelu`/`permute`) and adds new differentiable ops (`cos`/`sin`/`gather`/…),
available to model authors. (engine 0.33.0)
- **Dependencies:** Ktor client `3.5.1` (#198), Logback `1.5.36` (#199).

## [0.32.1] — 2026-06-26

Fixes streaming detokenization — generated text no longer runs words together
Expand Down
10 changes: 6 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -103,10 +103,12 @@ Honest status — see the project-status note at the top of this README.

## Current release

The current release is **0.32.1** (against **SKaiNET 0.32.4**). It fixes streaming
detokenization — per-token decode now keeps each word's leading space, so generated text
no longer runs together (`"the process"` not `"theprocess"`). On top of the **0.32.0**
real-GGUF **Llama** eager + StableHLO/IREE export work:
The current release is **0.33.0** (against **SKaiNET 0.33.0**). It adopts the new engine line —
no transformers API changes — so transformer models inherit the engine's 0.33.0 work: `layerNorm`/
`rmsNorm` now lower to real `stablehlo.reduce` (exports compile on stock IREE), a silent autodiff
gradient-drop is fixed, and new differentiable ops (`cos`/`sin`/`gather`/…) are available. On top of
the **0.32.1** streaming-detokenization fix and the **0.32.0** real-GGUF **Llama** eager +
StableHLO/IREE export work:

- The eager **`NATIVE_OPTIMIZED` path now works for Llama** (`Q4_K`/`Q6_K`): weights stay
packed and `LlamaNetworkLoader.fromGguf(NATIVE_OPTIMIZED) + OptimizedLLMRuntime` decodes
Expand Down
2 changes: 1 addition & 1 deletion gradle.properties
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
GROUP=sk.ainet.transformers
VERSION_NAME=0.32.1
VERSION_NAME=0.33.0

POM_DESCRIPTION=SKaiNET-transformers

Expand Down
2 changes: 1 addition & 1 deletion gradle/libs.versions.toml
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
[versions]
skainet = "0.32.4"
skainet = "0.33.0"
agp = "9.2.1"
jacksonDatabind = "2.22.0"
jsonSchemaValidator = "3.0.5"
Expand Down
Loading