Skip to content

Improve coverage for newer Qwen models and task profiles #24

Description

@Andyyyy64

Problem

Some newer model families are present in the cache but do not rank well because metadata or benchmark evidence is missing. Examples seen locally: Qwen/Qwen3.6-35B-A3B is not detected as MoE, and Qwen/Qwen3.5-9B / newer coder models have weak or missing benchmark evidence.

This also affects task profiles: --profile math is currently very sparse, and --profile coding still falls back to older Qwen2.5-Coder models quickly.

Done when

  • Qwen/Qwen3.6-35B-A3B is parsed with correct MoE/active-parameter metadata or an equivalent curated override.
  • Newer Qwen3.5/Qwen3.6 coder/math-relevant models have reasonable direct, inherited, or curated benchmark evidence.
  • --profile coding and --profile math return current, defensible recommendations on common simulated GPUs.
  • Tests cover the new metadata and ranking behavior.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions