Skip to content

New Words From Text Files #2

@rhdunn

Description

@rhdunn

This is automatically adding words from a source text (e.g. a Project GutenbergeText) that are not in the dictionary. It includes automatically generating the pronunciations from a source (e.g. a Text-to-Speech language), and reviewing those pronunciations.

This covers the following command-line script using the cainteoir-engine dictionary command:

cat FILENAME | sed -e 's/-/ /g' | dictionary -d DICTIONARY -np

NOTE: The sed -e 's/-/ /g' part ignores hyphenated words. This should be optional, and possibly be more sophisticated.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions