Skip to content
#

deepcoder

Here are 6 public repositories matching this topic...

Language: All
Filter by language

借鉴《Learning to Reason in 13 parameters》 TinyLoRA方法,用极低参微调 Qwen2.5-Coder-3B-Instruct 进行OI。 Inspired by 《Learning to Reason in 13 parameters》, use Extreme parameter-efficient Reinforcement Learning to fine-tune Qwen2.5-Coder-3B-Instruct.

  • Updated Mar 5, 2026
  • Python

Improve this page

Add a description, image, and links to the deepcoder topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the deepcoder topic, visit your repo's landing page and select "manage topics."

Learn more