Deep Reinforcement Learning for 2D Racing (DQN + Q-Table)

This repository contains a 2D driving simulation built with Pygame + Gymnasium, where an autonomous agent learns using radar sensor readings to race around a curved racetrack using RL. Two RL approaches are explored:

Deep Q-Network (DQN) — neural network agent using PyTorch
Q-Table Learning — tabular RL baseline

The simulation environment is implemented as custom Gym environment (gym_race), and gameplay is rendered using Pygame. The environment supports real-time rendering for visualization and non-render mode for faster training.

Repository Structure

├── gym_race/                 # Gym environment
│   └── envs/
│       ├── pyrace_2d.py
│       ├── race_env.py
│       └── utils.py
├── models_DQN_v01/           # Saved DQN models
│   ├── best_dqn_model.pth
│   └── dqn_model_0.pth
├── models_QT_v02/            # Saved QTable memory, tables
│   ├── memory_3500.npy
│   └── q_table_3500.npy
├── Pyrace_RL_QTable.py                    # Main RL training/testing script
├── Pyrace_performance_analysis.ipynb      #Training analysis notebook
└── *.png                     # Racing environment visual assets

Environment Overview

The Pyrace-v1 environment simulates a top-down 2D vehicle navigating a track using:

Ray based sensor inputs
Discrete actions (accelerate, turn left/right, ...)
Reward shaping for progress and collision penalties

Algorithms Implmented

Algorithm	File	Description
Deep QNetwork (DQN)	Pyrace_RL_QTable.py	NN approximates QValues using PyTorch
Q-Table RL	Pyrace_RL_QTable.py (legacy section & saved tables)	Baseline QLearning for comparison

Observation Space

The state consists of 5 radar sensor distances, normalized within [0,10]. Sensors are angled across the front of the car, with higher values meaning further distances.

[dist_1, dist_2, dist_3, dist_4, dist_5]

Action Space

Action	Effect
0	Accelerate
1	Turn left
2	Turn right
3	Brake (available in core env)

Reward Structure

The agent is encouraged to move foward & pass the checkpoint (full lap around track), and avoid walls.

Event	Reward
Checkpoint progress	+ distance-based reward
Crash -10000	+ distance traveled
Lap complete	+10000 bonus

Running the Code

Install Dependencies

pip install -r requirements.txt

Training Agent

Within Pyrace_RL_QTable.py, change this line of code:

#simulate()
load_and_play("best", learning=True)

to:

simulate()
# load_and_play("best", learning=True)

Run Trained Agent

Run the load_and_play function (and turn training off) to run the previously (best) trained agent:

#simulate()
load_and_play("best", learning=False)

Performance Analysis (Pyrace_performance_analysis.ipynb)

Performance of the agents was evaluated using two approaches:

DQN Learning Curves

Tracked episodic rewards, the number of steps/episode during training.
To assess improvement over time, visualized averaged rewards and rewards per step trends.

This shows how efficiently the agent learns to navigate the track, avoid collisions, and complete laps.

Q-Table Policy Interpretation

For tabular Q-learning agents, aggregated Q-values across radar sensor states to determine any preferred actions.
Normalized and scaled aggregated values to produce a visual “policy fingerprint” showing which actions the agent favors based on obstacle direction/distance.

Both analyses provided insight into both the overall learning progress and action selection behavior, helping compare DQN and QTable approaches (and guiding further improvements).

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
gym_race		gym_race
models_DQN_v01		models_DQN_v01
models_QT_v02		models_QT_v02
.gitignore		.gitignore
Pyrace_RL_QTable.py		Pyrace_RL_QTable.py
Pyrace_performance_analysis.ipynb		Pyrace_performance_analysis.ipynb
README.md		README.md
car.png		car.png
car_green.png		car_green.png
car_red.png		car_red.png
race_track_ie.png		race_track_ie.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Reinforcement Learning for 2D Racing (DQN + Q-Table)

Repository Structure

Environment Overview

Algorithms Implmented

Observation Space

Action Space

Reward Structure

Running the Code

Install Dependencies

Training Agent

Run Trained Agent

Performance Analysis (Pyrace_performance_analysis.ipynb)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Deep Reinforcement Learning for 2D Racing (DQN + Q-Table)

Repository Structure

Environment Overview

Algorithms Implmented

Observation Space

Action Space

Reward Structure

Running the Code

Install Dependencies

Training Agent

Run Trained Agent

Performance Analysis (Pyrace_performance_analysis.ipynb)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages