Releases · d9d-project/d9d

23 Apr 00:03

github-actions

v0.13.1

474fb20

v0.13.1 Latest

Latest

v0.13.1 (2026-04-23)

This release is published under the Apache-2.0 License.

Bug Fixes

Correct GQA/MQA head expansion logic for Q and K (62b73bd)
Introduce RoutingResult and fix probability renormalization (a1dabe3)
Remove GPU-CPU sync that was happening during bincount() call in MoE layer (534facc)
Remove MoE statistics interface from qwen3 decoder layer and module (99e0379)

Detailed Changes: v0.13.0...v0.13.1

Assets 4

20 Apr 13:21

github-actions

v0.13.0

e82b5db

v0.13.0

v0.13.0 (2026-04-20)

This release is published under the Apache-2.0 License.

Features

Migrate to Flash Attention 4 (19f8483)

Detailed Changes: v0.12.0...v0.13.0

Assets 4

14 Apr 21:33

github-actions

v0.12.0

f978ac1

v0.12.0

v0.12.0 (2026-04-14)

This release is published under the Apache-2.0 License.

Features

Add partial RoPE to GQA (4dcfe7e)

Detailed Changes: v0.11.0...v0.12.0

Assets 4

13 Apr 23:21

github-actions

v0.11.0

9410e9f

v0.11.0

v0.11.0 (2026-04-13)

This release is published under the Apache-2.0 License.

Features

Add Gated DeltaNet layer that use fla-core kernels (e0e466f)
Add shared expert to MoE block (0e93f45)

Detailed Changes: v0.10.0...v0.11.0

Assets 4

13 Apr 00:56

github-actions

v0.10.0

1784409

v0.10.0

v0.10.0 (2026-04-13)

This release is published under the Apache-2.0 License.

Features

Add custom RMSNorm kernel supporting zero-centered weight (365ad1f)
Add optional output gating for grouped query attention (af3e411)

Detailed Changes: v0.9.0...v0.10.0

Assets 4

08 Apr 01:53

github-actions

v0.9.0

6f12520

v0.9.0

v0.9.0 (2026-04-08)

This release is published under the Apache-2.0 License.

Bug Fixes

Allow using MoE parallelization function with [EP]=1 (fdc7d6d)
Fix huggingface mappers for classification models (caa2e63)

Features

Add model state mappers that convert HuggingFace model states into d9d ones and vice versa (47bae88)
Make HF model state mappers for Qwen3 MoE support new fused experts format from transformers v5 (cdecc29)

Detailed Changes: v0.8.0...v0.9.0

Assets 4

22 Mar 00:25

github-actions

v0.8.0

220077f

v0.8.0

v0.8.0 (2026-03-22)

This release is published under the Apache-2.0 License.

Features

Add training hooks system based on event bus along with a set of predefined events (13a76e2)

Detailed Changes: v0.7.0...v0.8.0

Assets 4

18 Mar 23:43

github-actions

v0.7.0

2d694dd

v0.7.0

v0.7.0 (2026-03-18)

This release is published under the Apache-2.0 License.

Features

Add Multi-Latent Attention implementation along with support for different RoPE layouts (56684bc)
Add Qwen3 Dense model (08047bd)

Detailed Changes: v0.6.0...v0.7.0

Assets 4

11 Mar 18:11

github-actions

v0.6.0

46c712f

v0.6.0

v0.6.0 (2026-03-11)

This release is published under the Apache-2.0 License.

Bug Fixes

Render multi-process model state loading progress using multiple progress bars (fixes #1) (defea97)

Features

Implement composition-based classificaiton metrics API (7d7476c)

Detailed Changes: v0.5.4...v0.6.0

Assets 4

10 Mar 00:30

github-actions

v0.5.4

1d3e775

v0.5.4

v0.5.4 (2026-03-10)

This release is published under the Apache-2.0 License.

Bug Fixes

Reorder contents in README (973c7cb)

Documentation

Migrate from mkdocs-shadcn to zensical (7c131c4)

Detailed Changes: v0.5.3...v0.5.4

Assets 4

Releases: d9d-project/d9d

v0.13.1

v0.13.1 (2026-04-23)

Bug Fixes

Uh oh!

v0.13.0

v0.13.0 (2026-04-20)

Features

Uh oh!

v0.12.0

v0.12.0 (2026-04-14)

Features

Uh oh!

v0.11.0

v0.11.0 (2026-04-13)

Features

Uh oh!

v0.10.0

v0.10.0 (2026-04-13)

Features

Uh oh!

v0.9.0

v0.9.0 (2026-04-08)

Bug Fixes

Features

Uh oh!

v0.8.0

v0.8.0 (2026-03-22)

Features

Uh oh!

v0.7.0

v0.7.0 (2026-03-18)

Features

Uh oh!

v0.6.0

v0.6.0 (2026-03-11)

Bug Fixes

Features

Uh oh!

v0.5.4

v0.5.4 (2026-03-10)

Bug Fixes

Documentation

Uh oh!