Data and Code for Ranking Machine Translation Systems from Human Assessments

This directory contains code and data used to generate different rankings of machine translation systems from human assessment data, as described in:

Adam Lopez. Putting Human Assessments of Machine Translation Systems in Order. Proceedings of the Workshop on Statistical Machine Translation (WMT) 2012.

The data are derived from several past editions of the Workshop on Machine Translation organized by the ACL Special Interest Group on Machine Translation. Note that only data from the 2010 and 2011 editions were used for the paper, although data from the last five workshops are included.

Ranking the Systems

Run the command generate_rankings.sh. This will extract pairwise comparisons from the raw data and run the various ranking algorithms. Most of the code is either in simple bash or python scripts.

Directory structure

raw_data contains the raw assessment data from five incarnations of the workshop, obtained from these public URLs:
bin contains utility scripts in python and bash to extract pairwise rankings from the raw data, compute rankings from tournaments, compute the cost of a feedback arc sets, and compute Spearman's rho.
data contains rankings and intermediate data produced by the scripts. This directory is produced by the top-level script generate_rankings.sh

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
bin		bin
data		data
raw_data		raw_data
.gitignore		.gitignore
README.markdown		README.markdown
do_simulation.sh		do_simulation.sh
generate_rankings.sh		generate_rankings.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Data and Code for Ranking Machine Translation Systems from Human Assessments

Ranking the Systems

Directory structure

About

Uh oh!

Releases

Packages

Languages

alopez/wmt-ranking

Folders and files

Latest commit

History

Repository files navigation

Data and Code for Ranking Machine Translation Systems from Human Assessments

Ranking the Systems

Directory structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages