Hi,
I got an error exclusively when using PPO2 with full tensorboard logs = true. It happens after around 300 steps:

Here are my settings:

It trains successfully while full tensorboard logs = false.
The agent also trains successfully using other algorithms with full tensorboard logs = true. Not sure why PPO2 only have this problem.
Note that I still get tensorboard logs just not the full data, so it's not a major issue.