it seems run_evalution only resets before `for batch in loader`. If batch['user'] changes, memory returned by env would be invalid.
it seems run_evalution only resets before
for batch in loader. If batch['user'] changes, memory returned by env would be invalid.