Skip to content

Objective evaluation #3

@yxlu-0102

Description

@yxlu-0102

I synthesise waveforms with your official ckpt on the test set of the VCTK-Corpus-0.92, which contains the audio clips of the last 8 speakers.

I calculated the LSD and SNR scores between the generated and reference test set, but the calculated metrics are not as good as those in your paper.

Additionally, the lsd calculation in util.util.compute_metrics seems strange, the n_fft should be 2048 while your default setting is 1024.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions