🤖 Reinforcement Learning Methods

This repository collects implementations and notes for several reinforcement learning algorithms, organized by method and environment.
Most of these implementations originate from coursework and projects for the Deep Reinforcement Learning course at Sharif University of Technology.

🛰️ DQN

The DQN folder contains an implementation of the Deep Q‑Network algorithm applied to the LunarLander environment.

🐆 PPO

The PPO folder implements Proximal Policy Optimization for the continuous‑control HalfCheetah environment.

📚 PPO_SAC_DDPG_theory

The PPO_SAC_DDPG_theory folder provides concise theoretical notes and derivations for PPO, SAC, and DDPG. It focuses on objective functions, policy/value updates, and the main intuitions behind on‑policy and off‑policy actor–critic methods.

🔥 SAC_DDPG

The SAC_DDPG folder includes implementations of:

SAC on CartPole and HalfCheetah, highlighting entropy‑regularized learning and stochastic policies.
DDPG on HalfCheetah, demonstrating deterministic policy gradients for continuous action spaces.

🌉 TD&MC

The TD&MC folder implements Temporal Difference and Monte Carlo methods on the CliffWalking environment. It compares on‑policy TD learning with episodic Monte Carlo returns, making the trade‑offs between bias, variance, and exploration more concrete.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🤖 Reinforcement Learning Methods

🛰️ DQN

🐆 PPO

📚 PPO_SAC_DDPG_theory

🔥 SAC_DDPG

🌉 TD&MC

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
DQN		DQN
PPO		PPO
PPO_SAC_DDPG_theory		PPO_SAC_DDPG_theory
SAC_DDPG		SAC_DDPG
TD&MC		TD&MC
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

🤖 Reinforcement Learning Methods

🛰️ DQN

🐆 PPO

📚 PPO_SAC_DDPG_theory

🔥 SAC_DDPG

🌉 TD&MC

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages