Skip to content

numeralbank/googleuninum

Repository files navigation

CLDF dataset derived from Ritchie et al.'s "UniNum: A Database of Number Names for 186 Languages" from 2019

CLDF validation

How to cite

If you use these data please cite

  • the original source

    Ritchie, S., Sproat, R., Gorman, K., van Esch, D., Schallhart, C., Bampounis, N., Brard, B., Mortensen, J. F., Holt, M., and Mahon, E. 2019. Unified verbalization for speech recognition & synthesis across languages. In Proc. INTERSPEECH, pages 3530-3534.

  • the derived dataset using the DOI of the particular released version you were using

Description

A collection of numerals ranging between 0 and 100000000000 (inclusive), provided by Google and language experts.

This dataset is licensed under a CC-BY-4.0 license

Available online at https://github.com/google/uninum

Conceptlists in Concepticon:

Statistics

CLDF validation Glottolog: 100% Concepticon: 100% Source: 100%

  • Varieties: 182
  • Concepts: 111
  • Lexemes: 19,877
  • Sources: 1
  • Synonymy: 1.01

Contributors

Name GitHub user Description Role
Christoph Rzymski @chrzyki patron, code Other

CLDF Datasets

The following CLDF datasets are available in cldf:

About

UniNum: A Database of Number Names for 186 Languages

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •