Skip to content

MouYongli/DeepHallu

Repository files navigation

DeepHallu

On the anaylsis and mitigation of hallucinations in Vision Language Models (VLMs).

About

DeepHallu is a research project focused on developing advanced techniques for analysing and mitigating hallucinations in VLMs.

Features

  • Hallucination Analysis: Advanced algorithms to analyse the hallucination in VLMS, for example, identifying hallucinated content in model outputs and the patterns inside the VLMs.
  • Mitigation Strategies: Techniques to reduce hallucination rates in VLMs.
  • Evaluation Metrics: Comprehensive benchmarks for measuring hallucination rates and model reliability

Datasets and Benchmarks

The following datasets and benchmarks are used in the project:

  1. MME
  2. VQA v2.0
  3. CHAIR
  4. POPE
  5. Llava Bench in the Wild

Details of the datasets and benchmarks are in the data/datasets directory.

Installation

  1. Clone the repository
# Clone the repository
git clone https://github.com/MouYongli/DeepHallu.git
cd DeepHallu
export PROJECT_ROOT=$(pwd)
  1. Setup the environment
conda create -n deephallu python=3.12
conda activate deephallu
  1. Install PyTorch according to your own compute configuration.
pip install torch torchvision --index-url https://download.pytorch.org/whl/cu129
  1. Install the package locally
pip install -e .  

Quick Start

Coming soon! This project is currently in the initial setup phase.

Project Structure

DeepHallu/
├── data/               # Sample datasets and benchmarks
├── docs/               # Documentation
├── examples/           # Example scripts and notebooks
├── notebooks/          # Notebooks
├── scripts/            # Scripts
├── src/                # Main package source code
|   └── deephallu/
|       ├── __init__.py
|       ├── data/
|       └── models/
├── tests/              # Unit tests
├── README.md           # README
├── pyproject.toml      # Project configuration
├── requirements.txt    # Requirements
├── .gitignore          # Git ignore
└── LICENSE             # License

Contributing

We welcome contributions to DeepHallu! Please see our Contributing Guidelines for details on how to get started.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contact


Note: This project is currently in active development. Features and API may change.

About

This is the official repo for "DeepHallu: On the Analysis and Mitigation of Hallucination in Vision Language Models"

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages