SleepGPT: A Unified Time-Frequency Foundation Model for Sleep Decoding

SleepGPT is a foundation model designed for comprehensive sleep decoding. Built upon PyTorch Lightning and generative pretraining, SleepGPT generalizes across multiple sleep-related tasks and heterogeneous polysomnography (PSG) datasets. The model integrates time- and frequency-domain information using a unified transformer framework, and adapts dynamically to varying EEG channel configurations.

🚀 Key Features

Pretrained on over 86,000 hours of PSG recordings from 8,377 subjects
Supports multiple tasks: sleep staging, spindle detection, apnea classification, and signal generation
Unified time-frequency transformer architecture
Channel-adaptive fusion mechanism for diverse PSG configurations
Compatible with over 10+ public PSG datasets

📦 Repository Structure

🧠 Model Components

File	Description
`backbone.py`	Main model with time-frequency fusion and attention handling
`multiway_transformer.py`	Core domain-aware transformer encoder
`Swin_transformer.py`	Global-context encoder based on Swin Transformer
`backbone_pretrain.py`	Self-supervised pretraining variant
`heads.py`	Pooling and projection heads for classification tasks
`objectives.py`	Implements contrastive, classification, and reconstruction losses
`get_optm.py`	Optimizer and learning rate scheduler setup

📚 Dataset and DataModule

File/Folder	Description
`*_datamodule.py`	Lightning DataModules for datasets like MASS, SHHS, EDF, ISRUC, Apnea, etc.
`*_dataset.py`	Dataset implementations with task-specific processing
`BaseDataModule.py`	Base class for all DataModules
`new_base_dataset.py`	Base class for all Dataset

🧰 Utilities

File	Description
`my_metrics.py`	Custom metrics: Accuracy, Scalar, etc.
`transform.py`	Frequency-based and dual-stream data augmentation
`others.py`	Loss functions: Focal, Dice, Weighted BCE

📊 Visualization

Scripts under Visualization/ include:

visual_mask.py: attention mask heatmaps
visual_fft.py: frequency domain plots
visual_umap.py: embedding space visualization
visual_spindles.py: spindle detection overlays
visual_portion.py: per-epoch prediction visualization

🧹 Preprocessing

File/Folder	Description
`preprocessing.py`	Preprocess raw PSG into h5 format or pyarrow format, normalize channels
`generate_list.py`	Generate index and dataset split metadata
`cap/`, `edf/`, ...	Subdirectories for dataset-specific preprocessing scripts

🧪 Training & Evaluation

File	Purpose
`main.py`	Launch main training procedure
`main_kfold.py`	K-fold training
`main_test_kfold.py`	K-fold evaluation
`main_test_kfold_persub.py`	Per-subject evaluation mode
`.sh` files	Slurm / shell job scripts

⚙️ Getting Started

1. Install Dependencies

conda create -n sleepgpt python=3.8
conda activate sleepgpt
pip install -r requirements.txt

2. Preprocess Your Dataset

python preprocessing/dataset/preprocessing.py

3. Launch Training

To run experiments, use the provided SLURM scripts. All configurations are managed using Sacred, allowing you to define experiments by name.

🔧 Pretraining

Pretraining runs use main.py. You can launch it with SLURM like this:

sbatch Pt_unify_slurm.sh

Internally, it uses:

srun python3 main.py with pretrain_shhs_stage2 SHHS1_WM_datasets

pretrain_shhs_stage2: pretraining mode configuration
SHHS1_WM_datasets: dataset loader setup
Additional arguments (e.g. mask_ratio, loss_function, model_arch) are passed via CLI.

💾 Pretrained Checkpoint

We provide a pretrained checkpoint that can be used for downstream tasks such as sleep staging and spindle detection.

Download link: Google Drive

To use the checkpoint, specify the load_path in your training or fine-tuning script:

load_path=/your/path/to/ModelCheckpoint.ckpt

🧪 Fine-tuning (K-Fold)

Fine-tuning runs use main_kfold.py, usually with k-fold evaluation and resume support.

Launch with (e.g. MASS SS2 dataset):

sbatch Start_ft_mass_stage_p.sh

Internally:

srun python3 main_kfold.py with finetune_mass_stage MASS2_datasets

finetune_mass_stage: fine-tuning mode configuration (e.g. lr schedule, decoder head)
MASS2_datasets: MASS dataset loader with augmentation & label mapping

All configurations are defined in config.py, so you don’t need to modify code—just pass the names.

📂 Supported Tasks

💤 Sleep staging
⚡ Sleep signal generation
🫁 Sleep-related pathology classification
🌙 Sleep spindle detection

🔍 Demo: Visualizing Masked Reconstruction

See masked_reconstruction_demo.md for a full explanation and how to run the visualization script.

📝 Citation

If you use SleepGPT in your research, please cite:

@article{huang2026unified,
  title={A unified time-frequency foundation model for sleep decoding},
  author={Huang, Weixuan and Wang, Yan and Cheng, Hanrong and Xu, Wei and Li, Tingyue and Wu, Xiuwen and Xu, Hui and Liao, Pan and Cui, Zaixu and Zou, Qihong and others},
  journal={Nature Communications},
  year={2026},
  publisher={Nature Publishing Group UK London}
}

📬 Contact

Maintainer: Weixuan Huang
Institution: Peking University

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
.idea		.idea
docs		docs
main		main
result		result
temp_log/rem_feat		temp_log/rem_feat
.gitignore		.gitignore
Finetune_final.sh		Finetune_final.sh
Finetune_mass_sp.sh		Finetune_mass_sp.sh
Finetune_phy.sh		Finetune_phy.sh
Finetune_shhs1.sh		Finetune_shhs1.sh
Finetune_unify.sh		Finetune_unify.sh
Kill_all_node.sh		Kill_all_node.sh
Pt_conv_slurm.sh		Pt_conv_slurm.sh
Pt_unify.sh		Pt_unify.sh
Pt_unify_slurm.sh		Pt_unify_slurm.sh
README.md		README.md
Start_all_node.sh		Start_all_node.sh
Start_finetune_final.sh		Start_finetune_final.sh
Start_finetune_shhs1.sh		Start_finetune_shhs1.sh
Start_ft_edf_955.sh		Start_ft_edf_955.sh
Start_ft_edf_TCC.sh		Start_ft_edf_TCC.sh
Start_ft_edf_mul.sh		Start_ft_edf_mul.sh
Start_ft_edf_n2v.sh		Start_ft_edf_n2v.sh
Start_ft_edf_usleep.sh		Start_ft_edf_usleep.sh
Start_ft_mass_stage_p.sh		Start_ft_mass_stage_p.sh
Start_ft_phy_usleep.sh		Start_ft_phy_usleep.sh
Start_ft_shhs.sh		Start_ft_shhs.sh
Start_others.sh		Start_others.sh
Start_phy_test.sh		Start_phy_test.sh
Start_pr_edf_955.sh		Start_pr_edf_955.sh
Start_unify.sh		Start_unify.sh
Start_validation.sh		Start_validation.sh
Test_cap.sh		Test_cap.sh
Test_edf_2013.sh		Test_edf_2013.sh
Test_edf_2018.sh		Test_edf_2018.sh
Test_edf_2018_last.sh		Test_edf_2018_last.sh
Test_edf_955.sh		Test_edf_955.sh
Test_edf_TCC.sh		Test_edf_TCC.sh
Test_edf_aug_2013.sh		Test_edf_aug_2013.sh
Test_edf_aug_2018.sh		Test_edf_aug_2018.sh
Test_edf_mul.sh		Test_edf_mul.sh
Test_edf_n2v.sh		Test_edf_n2v.sh
Test_edf_portion.sh		Test_edf_portion.sh
Test_edf_usleep.sh		Test_edf_usleep.sh
Test_isruc_s1.sh		Test_isruc_s1.sh
Test_mass_channel.sh		Test_mass_channel.sh
Test_mass_portion.sh		Test_mass_portion.sh
Test_mass_sp.sh		Test_mass_sp.sh
Test_mass_stage.sh		Test_mass_stage.sh
Test_nmse_loss.sh		Test_nmse_loss.sh
Test_phy_persub.sh		Test_phy_persub.sh
Test_phy_slurm.sh		Test_phy_slurm.sh
Test_phy_usleep.sh		Test_phy_usleep.sh
Test_shhs.sh		Test_shhs.sh
Test_shhs1.sh		Test_shhs1.sh
Test_slurm.sh		Test_slurm.sh
Test_triton_softmax.sh		Test_triton_softmax.sh
Test_ums.sh		Test_ums.sh
Visual.sh		Visual.sh
Visual_attn.sh		Visual_attn.sh
Visual_datasets.sh		Visual_datasets.sh
download.sh		download.sh
download_sleep.sh		download_sleep.sh
main.py		main.py
main_kfold.py		main_kfold.py
main_test_kfold.py		main_test_kfold.py
main_test_kfold_persub.py		main_test_kfold_persub.py
main_test_last.py		main_test_last.py
requirements.txt		requirements.txt
run_distributed.sh		run_distributed.sh
run_with_ddp_fft_only.sh		run_with_ddp_fft_only.sh
run_with_torch_ddp_run.sh		run_with_torch_ddp_run.sh
start.sh		start.sh
start_bd.sh		start_bd.sh
start_ft_cap_pathology.sh		start_ft_cap_pathology.sh
start_ft_edf.sh		start_ft_edf.sh
start_ft_edf_2018.sh		start_ft_edf_2018.sh
start_ft_edf_aug.sh		start_ft_edf_aug.sh
start_ft_edf_aug_slurm.sh		start_ft_edf_aug_slurm.sh
start_ft_edf_portion.sh		start_ft_edf_portion.sh
start_ft_isruc_s1.sh		start_ft_isruc_s1.sh
start_ft_isruc_s3.sh		start_ft_isruc_s3.sh
start_ft_mass_apnea.sh		start_ft_mass_apnea.sh
start_ft_mass_portion.sh		start_ft_mass_portion.sh
start_ft_mass_sp_slurm.sh		start_ft_mass_sp_slurm.sh
start_ft_phy.sh		start_ft_phy.sh
start_ft_ums.sh		start_ft_ums.sh
start_mass_stage_slurm.sh		start_mass_stage_slurm.sh
start_visual_attn_edf_aug_f3_c4.sh		start_visual_attn_edf_aug_f3_c4.sh
transfer_all_node.sh		transfer_all_node.sh
upload.sh		upload.sh
upload_sleep.sh		upload_sleep.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SleepGPT: A Unified Time-Frequency Foundation Model for Sleep Decoding

🚀 Key Features

📦 Repository Structure

🧠 Model Components

📚 Dataset and DataModule

🧰 Utilities

📊 Visualization

🧹 Preprocessing

🧪 Training & Evaluation

⚙️ Getting Started

1. Install Dependencies

2. Preprocess Your Dataset

3. Launch Training

🔧 Pretraining

💾 Pretrained Checkpoint

🧪 Fine-tuning (K-Fold)

📂 Supported Tasks

🔍 Demo: Visualizing Masked Reconstruction

📝 Citation

📬 Contact

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SleepGPT: A Unified Time-Frequency Foundation Model for Sleep Decoding

🚀 Key Features

📦 Repository Structure

🧠 Model Components

📚 Dataset and DataModule

🧰 Utilities

📊 Visualization

🧹 Preprocessing

🧪 Training & Evaluation

⚙️ Getting Started

1. Install Dependencies

2. Preprocess Your Dataset

3. Launch Training

🔧 Pretraining

💾 Pretrained Checkpoint

🧪 Fine-tuning (K-Fold)

📂 Supported Tasks

🔍 Demo: Visualizing Masked Reconstruction

📝 Citation

📬 Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages