Skip to content

ICTIR 2025 - Unidentified and Confounded? Understanding Two-Tower Models for Unbiased Learning to Rank

License

Notifications You must be signed in to change notification settings

philipphager/two-tower-confounding

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

337 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Understanding Two-Tower Models for Unbiased Learning to Rank

Repository for the paper Unidentified and Confounded? Understanding Two-Tower Models for Unbiased Learning to Rank accepted at ICTIR 2025.

Setup:

This project uses Mamba for environment management. To set up a Python environment, run:

mamba env create -f environment.yml
mamba activate two-tower-confounding

Data

The project uses classic LTR datasets as the foundation for its click simulation. The code supports: MSLR30K, and Yahoo! Webscope, which have to be manually downloaded as of 2025. By default, our code expects raw .zip files under ~/ltr_datasets/download/. But you can change the directory to your preference under: config/config.yaml.

Experiments

We manage our experiments with Hydra, with all code configuations under config/. We also provide scripts for each experiment under scripts/. To begin, make sure all bash scripts are executable:

chmod +x scripts/*.sh  

To run, e.g., a well-specified linear two-tower model trained on users following a linear relevance behavior, you can use:

./scripts/2_model_fit_linear.sh

Optionally, you can launch the job on a SLURM cluster to distribute training jobs:

./scripts/2_model_fit_linear.sh +launcher=slurm

You can edit the launch parameters for SLURM under: config/launcher/slurm.yaml.

Results

We publish all simulation results under results/, orgainzed by the experimental script that created the results. All code for our visualizations is under notebooks/.

Reference

@inproceedings{Hager2025TwoTowers,
  author = {Philipp Hager and Onno Zoeter and Maarten de Rijke},
  title = {Unidentified and Confounded? Understanding Two-Tower Models for Unbiased Learning to Rank},
  booktitle = {Proceedings of the 11th ACM SIGIR / 15th International Conference on Innovative Concepts and Theories in Information Retrieval (ICTIR`25)},
  organization = {ACM},
  year = {2025},
}

License

This repository uses the MIT License.

About

ICTIR 2025 - Unidentified and Confounded? Understanding Two-Tower Models for Unbiased Learning to Rank

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published