GBT-SAM: A Parameter-Efficient Depth-Aware Model for Generalizable Brain Tumor Segmentation on mp-MRI

Official repository for the paper: "GBT-SAM: A Parameter-Efficient Depth-Aware Model for Generalizable Brain Tumor Segmentation on mp-MRI".

Overview

GBT-SAM adapts the Segment Anything Model (SAM) to volumetric multi-parametric MRI (mp-MRI) data for brain tumor segmentation. Standard foundational vision models are typically constrained by 3-channel inputs and process 3D slices in isolation, ignoring critical inter-slice spatial continuity. GBT-SAM resolves these limitations via a true 4-channel multi-modal patch embedding layer and a lightweight Depth-Condition module to extract inter-slice dependencies efficiently.

Figure 1: Comparison of state-of-the-art brain tumor segmentation frameworks based on accuracy (Dice Score) and number of trainable parameters. GBT-SAM delivers top-tier performance with the lowest parameter footprint among SAM-based alternatives.

Key Features

High-Performance Efficiency: The framework operates with only 9.97M trainable parameters while achieving competitive segmentation performance.
True Multi-modal Processing: Modifies SAM's foundational patch embedding layer to natively accept 4-channel mp-MRI sequences (T1, T2, T1c, and T2-FLAIR) simultaneously, preserving comprehensive multi-modal features.
Depth-Conditioned Context: Incorporates a parallel linear mixing and Multi-Layer Perceptron (MLP) module across the depth dimension to efficiently capture inter-slice volumetric dependencies without heavy 3D operations.
Robust Domain Generalization: Validated across four distinct clinical domains, demonstrating superior zero-shot transfer capabilities and domain robustness.

Architecture Overview

Figure 2: Overview of the GBT-SAM architecture and two-step training pipeline.

The network utilizes a two-step fine-tuning protocol:

Stage 1 (Patch Embedding Optimization): Freezes the entire architecture except for the modified 4-channel patch embedding layer to ensure optimal multi-modal alignment.
Stage 2 (Joint Fine-Tuning): Introduces Low-Rank Adaptation (LoRA) modules alongside the Depth-Condition (D.C.) blocks within the Vision Transformer (ViT) layers for specialized domain transfer.

During training, computational overhead is mitigated by sampling small subsets of consecutive slices per volume. At inference time, full-volume dense 3D predictions are systematically executed via a sliding window approach.

Experimental Results

The model is trained exclusively on the BraTS Adult Glioma dataset and evaluated zero-shot across multiple unseen modalities and tumor topologies to assess generalizability.

Domain / Evaluation Dataset	Evaluation Type	Dice Score (DS)
Adult Glioma ($DS_1$)	In-Domain (Test Split)	92.66
Meningioma ($DS_2$)	Zero-Shot Cross-Domain	91.90
Pediatric Glioma ($DS_3$)	Zero-Shot Cross-Domain	91.40
Sub-Saharan Glioma ($DS_4$)	Zero-Shot Cross-Domain	91.19
Cross-Domain Mean ($DS_{234}$)	Combined Unseen Average	91.50

Setup and Usage

All requirements, environment configurations, dataset preparation guidelines, and execution commands for training, validation, and inference are detailed in: INSTRUCTIONS.md

Citation

If you find this repository or our architectural approach useful for your research, please cite our work:

@article{diana2025gbtsam,
  title={GBT-SAM: A Parameter-Efficient Depth-Aware Model for Generalizable Brain Tumor Segmentation on mp-MRI},
  author={Diana-Albelda, Cecilia and Alcover-Couso, Roberto and Garc{\'\i}a-Mart{\'\i}n, {\'A}lvaro and Bescos, Jesus and Escudero-Vi{\~n}olo, Marcos},
  journal={arXiv preprint arXiv:2503.04325}, 
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
assets		assets
conf		conf
loss		loss
models		models
INSTRUCTIONS.md		INSTRUCTIONS.md
README.md		README.md
cfg.py		cfg.py
cfg_valid.py		cfg_valid.py
dataset.py		dataset.py
environment.yml		environment.yml
function.py		function.py
train.py		train.py
utils.py		utils.py
val.py		val.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GBT-SAM: A Parameter-Efficient Depth-Aware Model for Generalizable Brain Tumor Segmentation on mp-MRI

Overview

Key Features

Architecture Overview

Experimental Results

Setup and Usage

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GBT-SAM: A Parameter-Efficient Depth-Aware Model for Generalizable Brain Tumor Segmentation on mp-MRI

Overview

Key Features

Architecture Overview

Experimental Results

Setup and Usage

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages