Basic implementation of the Proximal Policy Optimization algorithm to solve the Bipedal Walker environment from the gymnasium library.