Skip to content

Fail downloading Seamless align data #41

@lzl-mt

Description

@lzl-mt

when i follow https://github.com/facebookresearch/seamless_communication/blob/main/docs/m4t/seamless_align_README.md, try to download the dataset, use
zcat seamless.dataset.metadata.public.arb-enA.tsv.gz | egrep ^crawl-data | tr '\t' ' ' | build/bin/wet_lines
raise Error:
image

and no wav is saved;
BTW, this script cost a lot of time to process something, but i cant find anything download in my workspace, is there any possible method to save each wav or text during the hole processing stage? Thx a lot.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions