Improve Completion Caching

Responses from the training engines include templates and CoT. OpenClaw does not filter this out for every model, resulting in unwanted text in responses. To address this, the inference backends strip CoT and templates. But for training we need exact-token rollouts to remain on-policy. To address this, we have a hacky cache in the inference backends that retrieve the unfiltered responses from the filtered ones. We should build a more robust approach compared to this hack.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Completion Caching #19

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Improve Completion Caching #19

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions