Proposal
At some point, perhaps we should define a "slave mode" for the runner whereby a dedicated slave system can continually pull individual model test requests from the bakeoff-results live site to run the model on the runner's hardware, store the results locally, and then push the results record upstream for incorporation into the global dataset. This would make it easy to coordinate dedicated runner systems which can keep running tests to contribute to the global dataset indefinitely. If there are no tests for this runner, it should just sleep for some time before checking again to avoid busy-looping load.
I believe we should create a companion ticket in bakeoff-results for the UI and queue API aspects of this.
Touches benchmark invariants?
Proposal
At some point, perhaps we should define a "slave mode" for the runner whereby a dedicated slave system can continually pull individual model test requests from the bakeoff-results live site to run the model on the runner's hardware, store the results locally, and then push the results record upstream for incorporation into the global dataset. This would make it easy to coordinate dedicated runner systems which can keep running tests to contribute to the global dataset indefinitely. If there are no tests for this runner, it should just sleep for some time before checking again to avoid busy-looping load.
I believe we should create a companion ticket in bakeoff-results for the UI and queue API aspects of this.
Touches benchmark invariants?