festvox_frontend_label

This is a minimum example of how to use Festvox (http://festvox.org/) to make labels in .utt format for your data. You must provide utterances in .wav format, their transcripts in txt.done.data format, your lexicon in Festival lex.scm format, and your phoneset in Festival phoneset.scm format. You can find examples of these files in the voice directories for existing voices, or read about these formats in the Festival/Festvox documentation. If you are only using the Festival frontend to create labels that you will use with a different backend, then your phoneset.scm does not have to contain most features (nasalized, etc), i.e. it can effectively just be a list of phones (in that .scm format) -- however, you should make sure that vowels and consonants are properly marked (vc + - ) because when converting to HTS-style labels, it uses this information to populate the "syllable vowel" feature. Also note that if you are going to be using the .utt output for HTS / Merlin eventually, then you need to avoid numerals as phoneme names (for HTS) and symbols that are used as HTS label-file-format delimiters (for both). It is easier to do this phone mapping at this stage than further down the pipeline.

To use this script:

mkdir yourvoicename
edit the label script, fill in the global variables with your values
cd yourvoicename
run the label script: ./label

.utt format label files will be under yourvoicename/festival/utt/.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
README.md		README.md
label		label

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

festvox_frontend_label

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

festvox_frontend_label

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages