Dice Recognition 🎲🤖

Computer Vision project made with Python, YOLO and OpenCV to recognize dice, count them, and sum their values in real time using a webcam or video as input.

dice_recognition.mp4

Installation

1. Create a Virtual Environment

python3 -m venv .venv
source .venv/bin/activate  # On Windows use: .venv\Scripts\activate

2. Install Dependencies

Note

Before installing the dependencies, if you want to use CUDA for better performance, you should install the appropriate CUDA versions of torch and torchvision:

pip3 install torch torchvision --index-url https://download.pytorch.org/whl/cu126

The example above uses cu126 (CUDA 12.6). However, you must ensure that:

Your system has a compatible NVIDIA GPU.
You have the correct CUDA drivers installed.

If you're using an older GPU or have a lower CUDA version installed (e.g., CUDA 11.8), use the matching packages:

pip3 install torch torchvision --index-url https://download.pytorch.org/whl/cu118

For more options and compatibility information, check the official PyTorch installation guide.

pip3 install -r requirements.txt

Running the Model

Option 1: Use Pretrained Model

The pretrained model is already included in this repository at:

runs/detect/train/weights/best.pt

You can run the application directly:

python3 main.py

Option 2: Train Your Own Model

The dataset is already included in this repository at:

datasets/dices

To train the model you can use the YOLO CLI:

yolo task=detect mode=train model=yolov8n.pt data=datasets/dices/data.yaml epochs=50 plots=True

Note

You can customize the training process by for example modifying these opions:

model: YOLO model to use (e.g., yolov8n.pt, yolov8s.pt, etc.).
epochs: Max training cycles.
patience: Stop early if no improvement after this many epochs.

Once training is complete, the best-trained model will be stored at:

runs/detect/train2/weights/best.pt

You can use this model by modifying main.py:

model = YOLO("runs/detect/train2/weights/best.pt")

Note

If you train multiple times, new training folders (e.g., train2, train3, etc.) will be created, so you can choose the best model from any of them by modifying the path in main.py.

Usage

Simply run:

python3 main.py

The script will detect dice, count them, and display the total sum in real-time.

Project Structure

.
├── datasets/                # Contains the datasets for training
│   └── dices/
│       ├── data.yaml        # Dataset configuration file
│       ├── test/            # Test dataset
│       ├── train/           # Training dataset
│       └── valid/           # Validation dataset
├── main.py                  # Runs the dice recognition model
├── requirements.txt         # Project dependencies
├── runs/                    # Training outputs
│   └── detect/
│       ├── train/           # First training session
│       │   ├── weights/
│       │   │   ├── best.pt  # Best-trained model
└── yolov8n.pt               # Base YOLO model for training

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
datasets/dices		datasets/dices
runs/detect/train		runs/detect/train
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
yolov8n.pt		yolov8n.pt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dice Recognition 🎲🤖

Installation

1. Create a Virtual Environment

2. Install Dependencies

Running the Model

Option 1: Use Pretrained Model

Option 2: Train Your Own Model

Usage

Project Structure

License

About

Uh oh!

Releases 2

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Dice Recognition 🎲🤖

Installation

1. Create a Virtual Environment

2. Install Dependencies

Running the Model

Option 1: Use Pretrained Model

Option 2: Train Your Own Model

Usage

Project Structure

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Contributors

Uh oh!

Languages