To train on KITTI, run:
python train.py ./args_files/hisfog/kitti/cvnXt_H_320x1024.txtFor instructions on downloading the KITTI dataset, see Monodepth2
To finetune on KITTI, run:
python ./finetune/train_ft_SQLdepth.py ./conf/cvnXt.txt ./finetune/txt_args/train/inc_kitti.txtTo train on CityScapes, run:
python train.py ./args_files/args_cityscapes_train.txtTo finetune on CityScapes, run:
python train.py ./args_files/args_cityscapes_finetune.txtFor preparing cityscapes dataset, please refer to SfMLearner's prepare_train_data.py script. We used the following command:
python prepare_train_data.py \
--img_height 512 \
--img_width 1024 \
--dataset_dir <path_to_downloaded_cityscapes_data> \
--dataset_name cityscapes \
--dump_root <your_preprocessed_cityscapes_path> \
--seq_length 3 \
--num_threads 8You can download weights for some pretrained models here:
To evaluate a model on KITTI, run:
python evaluate_depth_config.py args_files/hisfog/kitti/cvnXt_H_320x1024.txtMake sure you have first run export_gt_depth.py to extract ground truth files.
And to evaluate a model on Cityscapes, run:
python ./tools/evaluate_depth_cityscapes_config.py args_files/args_cvnXt_H_cityscapes_finetune_eval.txtThe ground truth depth files can be found at HERE,
Download this and unzip into splits/cityscapes.
python test_simple_SQL_config.py ./conf/cvnXt.txtIn ./conf/cvnXt.txt, you can set --image_path to a single image or a directory of images.
If you find this project useful for your research, please consider citing:
@InProceedings{Lavreniuk_2025_CVPR,
author = {Lavreniuk, Mykola and Lavreniuk, Alla},
title = {SPIdepth: Strengthened Pose Information for Self-supervised Monocular Depth Estimation},
booktitle = {Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR) Workshops},
month = {June},
year = {2025},
pages = {874-884}
}
This project is built on top of SQLdepth, and we are grateful for their outstanding contributions.