ChenmienTan

Follow

谭三爷 ChenmienTan

Follow

Engineer @ ByteDance Seed

83 followers · 3 following

Hangzhou, Zhejiang, China
chenmientan.github.io

Achievements

Achievements

Pinned Loading

RL2 RL2 Public

Python 1.3k 130
OpenRLHF/OpenRLHF OpenRLHF/OpenRLHF Public

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9.3k 917
THUDM/slime THUDM/slime Public

slime is an LLM post-training framework for RL Scaling.

Python 5.3k 711