Skip to content
Change the repository type filter

All

    Repositories list

    • RiT

      Public
      RiT: Rubrics-in-Thinking Reinforcement Learning for Improved Reasoning in Large Language Models
      Apache License 2.0
      0000Updated Apr 15, 2026Apr 15, 2026
    • CLIPO

      Public
      CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR
      Python
      21910Updated Apr 7, 2026Apr 7, 2026
    • ATP-Bench

      Public
      0220Updated Mar 30, 2026Mar 30, 2026
    • MARCH

      Public
      42211Updated Mar 26, 2026Mar 26, 2026
    • STAR

      Public
      STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models
      Python
      Apache License 2.0
      04400Updated Mar 23, 2026Mar 23, 2026
    • Proxy-GRM

      Public
      Python
      Apache License 2.0
      0420Updated Mar 19, 2026Mar 19, 2026
    • EVPV-PRM

      Public
      Python
      Apache License 2.0
      0700Updated Mar 19, 2026Mar 19, 2026
    • OpenRS

      Public
      Open Rubric System: Scaling Reinforcement Learning with Pairwise Adaptive Rubric
      Python
      Apache License 2.0
      11010Updated Mar 5, 2026Mar 5, 2026
    • Code of the research paper "SiameseNorm: Breaking the Barrier to Reconciling Pre/Post-Norm"
      Python
      751800Updated Feb 27, 2026Feb 27, 2026
    • DIR

      Public
      Python
      Apache License 2.0
      11710Updated Feb 14, 2026Feb 14, 2026
    • SSP

      Public
      Search Self-Play: Pushing the Frontier of Agent Capability without Supervision
      Python
      Apache License 2.0
      81700Updated Dec 30, 2025Dec 30, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.