Roadmap for LightRFT v0.1.1

# 🗺️ Roadmap for LightRFT v0.1.1
**Expected Release:** January 2026

### ✨ New Features
*   **Multimodal Support**
    *   Add video support for reinforcement finetuning (#4)
  
*   **Training & Evaluation**
    *   Implement and optimize evaluation for SRM (Step-wise Reward Model) and GRM (Generative Reward Model) trainers (#12)
    *   Add high entropy token selection mechanism (#6)
*   **Analysis**
    *   Add analysis metrics to saved trajectories (#5)


### ⚙️ Compatibility & Dependencies
*   **Library Updates**
    *   [WIP] Adapt LightRFT to latest versions of `sglang`, `vllm`, and `deepspeed` (#24)
*   **Framework Compatibility**
    *   Rename `dtype` to `torch_dtype` for better `transformers` compatibility (#7)

### 🐛 Bug Fixes & Maintenance
*   **Fixes**
    *   Fix bug in GRM dataset message formatting and evaluation logic (#8)
    *   Remove redundant tuple nesting in `prepare_reward_model` return when using FSDP (#15)
*   **Code Style**
    *   Fix `make fcheck` in `lightrft/datasets` for linting errors (#10)

### 📚 Documentation
*   **Deployment**
    *   Setup documentation deploy actions (#18)
*   **Content Updates**
    *   Update Python typing lint (#26)
    *   Polish API comment doc details (#21)
    *   Update GRM on T2I benchmark results and analysis in best practices (#9)
    *   Update general documentation and README for v0.1.1 (#2)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Roadmap for LightRFT v0.1.1 #19

🗺️ Roadmap for LightRFT v0.1.1

✨ New Features

⚙️ Compatibility & Dependencies

🐛 Bug Fixes & Maintenance

📚 Documentation

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Roadmap for LightRFT v0.1.1 #19

Description

🗺️ Roadmap for LightRFT v0.1.1

✨ New Features

⚙️ Compatibility & Dependencies

🐛 Bug Fixes & Maintenance

📚 Documentation

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions