BaMCo: Balanced Multimodal Contrastive Learning for Knowledge-Driven Medical VQA

Welcome to BaMCo, a novel framework for multimodal, knowledge-driven biomedical Visual Question Answering. This repository contains the implementation of the paper, BaMCo, accepted to MICCAI 2025.

Getting Started

1. Clone the Repository

git clone https://github.com/yaziciz/BaMCo.git
cd BaMCo

2. Install Requirements

conda env create -f environment.yml
conda activate bamco

3. Prepare Datasets

Place your datasets under the appropriate folders in KSpace/Datasets/ or use the predefined datasets, Slake, PathVQA, and VQA-RAD.

4. Download Model Weights

VQA Model:
Download pytorch_model_best.bin from
Hugging Face BaMCo Collection
and place it in VQA/src/checkpoints/.
Knowledge Encoder:
Download <Dataset>_KnowledgeSpace.pt from
Google Drive Knowledge Space Weights
and place it in KSpace/src/checkpoints/.

5. Update Model Paths

Edit main.py in both VQA/src/ and KSpace/src/ to point to the correct checkpoint files as described in the respective Readme.md files in each checkpoints/ directory.

Main Components

KSpace:
Scripts for constructing and encoding biomedical knowledge sources.
VQA:
End-to-end VQA pipeline, including data loading, model training, evaluation, and inference.
Checkpoints:
Store and manage pretrained model weights for both knowledge encoders and VQA models.

Citation

We appreciate your interest! If you use or refer to BaMCo in your research, please cite us: The citation will be updated soon!

@inproceedings{BaMCo_MICCAI2025,
  title     = {BaMCo: Balanced Multimodal Contrastive Learning for Knowledge-Driven Medical VQA},
  author    = {Ziya Ata Yazici and Hazım Kemal Ekenel},
  booktitle = {International Conference on Medical Image Computing and Computer-Assisted Intervention},
  year      = {2025}
}

Contact

For questions, issues, or contributions, please open an issue or pull request on GitHub.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BaMCo: Balanced Multimodal Contrastive Learning for Knowledge-Driven Medical VQA

Getting Started

1. Clone the Repository

2. Install Requirements

3. Prepare Datasets

4. Download Model Weights

5. Update Model Paths

Main Components

Citation

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
KSpace		KSpace
VQA		VQA
LICENSE		LICENSE
Readme.md		Readme.md
environment.yml		environment.yml

License

yaziciz/BaMCo

Folders and files

Latest commit

History

Repository files navigation

BaMCo: Balanced Multimodal Contrastive Learning for Knowledge-Driven Medical VQA

Getting Started

1. Clone the Repository

2. Install Requirements

3. Prepare Datasets

4. Download Model Weights

5. Update Model Paths

Main Components

Citation

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages