On the anaylsis and mitigation of hallucinations in Vision Language Models (VLMs).
DeepHallu is a research project focused on developing advanced techniques for analysing and mitigating hallucinations in VLMs.
- Hallucination Analysis: Advanced algorithms to analyse the hallucination in VLMS, for example, identifying hallucinated content in model outputs and the patterns inside the VLMs.
- Mitigation Strategies: Techniques to reduce hallucination rates in VLMs.
- Evaluation Metrics: Comprehensive benchmarks for measuring hallucination rates and model reliability
The following datasets and benchmarks are used in the project:
- MME
- VQA v2.0
- CHAIR
- POPE
- Llava Bench in the Wild
Details of the datasets and benchmarks are in the data/datasets directory.
- Clone the repository
# Clone the repository
git clone https://github.com/MouYongli/DeepHallu.git
cd DeepHallu
export PROJECT_ROOT=$(pwd)- Setup the environment
conda create -n deephallu python=3.12
conda activate deephallu- Install PyTorch according to your own compute configuration.
pip install torch torchvision --index-url https://download.pytorch.org/whl/cu129- Install the package locally
pip install -e . Coming soon! This project is currently in the initial setup phase.
DeepHallu/
├── data/ # Sample datasets and benchmarks
├── docs/ # Documentation
├── examples/ # Example scripts and notebooks
├── notebooks/ # Notebooks
├── scripts/ # Scripts
├── src/ # Main package source code
| └── deephallu/
| ├── __init__.py
| ├── data/
| └── models/
├── tests/ # Unit tests
├── README.md # README
├── pyproject.toml # Project configuration
├── requirements.txt # Requirements
├── .gitignore # Git ignore
└── LICENSE # License
We welcome contributions to DeepHallu! Please see our Contributing Guidelines for details on how to get started.
This project is licensed under the MIT License - see the LICENSE file for details.
- Author: Yongli Mou, Er Jin, Johannes Stegmaier, Shin'ichi Satoh, Stefan Decker
- Email: mou@dbis.rwth-aachen.de
- GitHub: @MouYongli
- Website: https://mouyongli.github.io/
Note: This project is currently in active development. Features and API may change.