Skip to content

Releases: d9d-project/d9d

v0.13.1

23 Apr 00:03

Choose a tag to compare

v0.13.1 (2026-04-23)

This release is published under the Apache-2.0 License.

Bug Fixes

  • Correct GQA/MQA head expansion logic for Q and K (62b73bd)

  • Introduce RoutingResult and fix probability renormalization (a1dabe3)

  • Remove GPU-CPU sync that was happening during bincount() call in MoE layer (534facc)

  • Remove MoE statistics interface from qwen3 decoder layer and module (99e0379)


Detailed Changes: v0.13.0...v0.13.1

v0.13.0

20 Apr 13:21

Choose a tag to compare

v0.13.0 (2026-04-20)

This release is published under the Apache-2.0 License.

Features

  • Migrate to Flash Attention 4 (19f8483)

Detailed Changes: v0.12.0...v0.13.0

v0.12.0

14 Apr 21:33

Choose a tag to compare

v0.12.0 (2026-04-14)

This release is published under the Apache-2.0 License.

Features

  • Add partial RoPE to GQA (4dcfe7e)

Detailed Changes: v0.11.0...v0.12.0

v0.11.0

13 Apr 23:21

Choose a tag to compare

v0.11.0 (2026-04-13)

This release is published under the Apache-2.0 License.

Features

  • Add Gated DeltaNet layer that use fla-core kernels (e0e466f)

  • Add shared expert to MoE block (0e93f45)


Detailed Changes: v0.10.0...v0.11.0

v0.10.0

13 Apr 00:56

Choose a tag to compare

v0.10.0 (2026-04-13)

This release is published under the Apache-2.0 License.

Features

  • Add custom RMSNorm kernel supporting zero-centered weight (365ad1f)

  • Add optional output gating for grouped query attention (af3e411)


Detailed Changes: v0.9.0...v0.10.0

v0.9.0

08 Apr 01:53

Choose a tag to compare

v0.9.0 (2026-04-08)

This release is published under the Apache-2.0 License.

Bug Fixes

  • Allow using MoE parallelization function with [EP]=1 (fdc7d6d)

  • Fix huggingface mappers for classification models (caa2e63)

Features

  • Add model state mappers that convert HuggingFace model states into d9d ones and vice versa (47bae88)

  • Make HF model state mappers for Qwen3 MoE support new fused experts format from transformers v5 (cdecc29)


Detailed Changes: v0.8.0...v0.9.0

v0.8.0

22 Mar 00:25

Choose a tag to compare

v0.8.0 (2026-03-22)

This release is published under the Apache-2.0 License.

Features

  • Add training hooks system based on event bus along with a set of predefined events (13a76e2)

Detailed Changes: v0.7.0...v0.8.0

v0.7.0

18 Mar 23:43

Choose a tag to compare

v0.7.0 (2026-03-18)

This release is published under the Apache-2.0 License.

Features

  • Add Multi-Latent Attention implementation along with support for different RoPE layouts (56684bc)

  • Add Qwen3 Dense model (08047bd)


Detailed Changes: v0.6.0...v0.7.0

v0.6.0

11 Mar 18:11

Choose a tag to compare

v0.6.0 (2026-03-11)

This release is published under the Apache-2.0 License.

Bug Fixes

  • Render multi-process model state loading progress using multiple progress bars (fixes #1) (defea97)

Features

  • Implement composition-based classificaiton metrics API (7d7476c)

Detailed Changes: v0.5.4...v0.6.0

v0.5.4

10 Mar 00:30

Choose a tag to compare

v0.5.4 (2026-03-10)

This release is published under the Apache-2.0 License.

Bug Fixes

  • Reorder contents in README (973c7cb)

Documentation

  • Migrate from mkdocs-shadcn to zensical (7c131c4)

Detailed Changes: v0.5.3...v0.5.4