10-703 Homework 3 Part 1 Problem 1 and Problem 2

Implementation of Reinforce, Actor-Critic

Prerequisites

Python 3.6
Pytorch
OpenAI Gym
numpy
scipy
pybox2d
gym[box2d]
matplotlib
pyglet
h5py

Installation

Please follow the instructions on the homework handout.

Testing

python reinforce.py --num-episodes --lr

hw3_part1_plotter.py

For example,

python reinforce.py --num-episodes 50000 --lr 5e-4

The above will run the REINFORCE algorithm for 50000 training episodes and for every 200 training episodes it will output the average test reward (over 100 episodes). The reward is outputted to console (and can be redirected to a file), and can be plotted with hw3_part1_plotter.py.

python a2c.py --num-episodes --lr --critic-lr --n

hw3_part2_plotter.py

For example,

python a2c.py --num-episodes 50000 --lr 5e-4 --critic-lr 1e-4 --n 20

The above will run the advantage-actor critic algorithm for 50000 training episodes and for every 500 training episodes it will output the average test reward (over 100 episodes). The reward is outputted to console (and can be redirected to a file), and can be plotted with hw3_part2_plotter.py.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

10-703 Homework 3 Part 1 Problem 1 and Problem 2

Prerequisites

Installation

Testing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md
a2c.py		a2c.py
hw3_part1_plotter.py		hw3_part1_plotter.py
hw3_part2_plotter.py		hw3_part2_plotter.py
reinforce.py		reinforce.py

Folders and files

Latest commit

History

Repository files navigation

10-703 Homework 3 Part 1 Problem 1 and Problem 2

Prerequisites

Installation

Testing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages