[CVPR 2025] SPIdepth: Strengthened Pose Information for Self-supervised Monocular Depth Estimation

Training

To train on KITTI, run:

python train.py ./args_files/hisfog/kitti/cvnXt_H_320x1024.txt

For instructions on downloading the KITTI dataset, see Monodepth2

To finetune on KITTI, run:

python ./finetune/train_ft_SQLdepth.py ./conf/cvnXt.txt ./finetune/txt_args/train/inc_kitti.txt

To train on CityScapes, run:

python train.py ./args_files/args_cityscapes_train.txt

To finetune on CityScapes, run:

python train.py ./args_files/args_cityscapes_finetune.txt

For preparing cityscapes dataset, please refer to SfMLearner's prepare_train_data.py script. We used the following command:

python prepare_train_data.py \
    --img_height 512 \
    --img_width 1024 \
    --dataset_dir <path_to_downloaded_cityscapes_data> \
    --dataset_name cityscapes \
    --dump_root <your_preprocessed_cityscapes_path> \
    --seq_length 3 \
    --num_threads 8

Pretrained weights and evaluation

You can download weights for some pretrained models here:

To evaluate a model on KITTI, run:

python evaluate_depth_config.py args_files/hisfog/kitti/cvnXt_H_320x1024.txt

Make sure you have first run export_gt_depth.py to extract ground truth files.

And to evaluate a model on Cityscapes, run:

python ./tools/evaluate_depth_cityscapes_config.py args_files/args_cvnXt_H_cityscapes_finetune_eval.txt

The ground truth depth files can be found at HERE, Download this and unzip into splits/cityscapes.

Inference with your own images

python test_simple_SQL_config.py ./conf/cvnXt.txt

In ./conf/cvnXt.txt, you can set --image_path to a single image or a directory of images.

Citation

If you find this project useful for your research, please consider citing:

@InProceedings{Lavreniuk_2025_CVPR,
    author    = {Lavreniuk, Mykola and Lavreniuk, Alla},
    title     = {SPIdepth: Strengthened Pose Information for Self-supervised Monocular Depth Estimation},
    booktitle = {Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR) Workshops},
    month     = {June},
    year      = {2025},
    pages     = {874-884}
}

Acknowledgement

This project is built on top of SQLdepth, and we are grateful for their outstanding contributions.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
args_files		args_files
conf		conf
datasets		datasets
finetune		finetune
networks		networks
splits		splits
tools		tools
trainers		trainers
.gitignore		.gitignore
CKA_visualize.py		CKA_visualize.py
LICENSE		LICENSE
README.md		README.md
SQLdepth.py		SQLdepth.py
attn_visualize.py		attn_visualize.py
cal_GMACs.py		cal_GMACs.py
calc_layers.py		calc_layers.py
evaluate_depth_config.py		evaluate_depth_config.py
export_gt_depth.py		export_gt_depth.py
kitti_utils.py		kitti_utils.py
layers.py		layers.py
options.py		options.py
requirements.txt		requirements.txt
test_simple_SQL_config.py		test_simple_SQL_config.py
train.py		train.py
trainer.py		trainer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[CVPR 2025] SPIdepth: Strengthened Pose Information for Self-supervised Monocular Depth Estimation

Training

Pretrained weights and evaluation

Inference with your own images

Citation

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

[CVPR 2025] SPIdepth: Strengthened Pose Information for Self-supervised Monocular Depth Estimation

Training

Pretrained weights and evaluation

Inference with your own images

Citation

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages