Agents lose track during long runs. They forget earlier instructions, drift off task, or even lose their working directory.
Tasks
Research questions
- What information matters most across phases?
- How to detect context drift during execution?
- What is the right tradeoff between context breadth and depth?
Agents lose track during long runs. They forget earlier instructions, drift off task, or even lose their working directory.
Tasks
STATE.mdfile in the workspace that the agent updates at each phase (current phase, what is done, key findings, next steps)Research questions