Hi,
I want to try fastspeech on different dataset. therefore, can you share how to extract alignment from tacotron2?
I tried this code, but get bad result for synthesis when inference long sentence .
_, _, _, alignments = model.inference(sequence)
d = alignments.float().data.cpu().numpy()[0].T
x = np.zeros(d.shape[0])
for i,y in enumerate(d):
x[i] = y.sum()
np.save("path_to_save_folder"+name+".npy",x.astype(np.dtype('i4')))
Thank you.
Hi,
I want to try fastspeech on different dataset. therefore, can you share how to extract alignment from tacotron2?
I tried this code, but get bad result for synthesis when inference long sentence .
_, _, _, alignments = model.inference(sequence)d = alignments.float().data.cpu().numpy()[0].Tx = np.zeros(d.shape[0])for i,y in enumerate(d):x[i] = y.sum()np.save("path_to_save_folder"+name+".npy",x.astype(np.dtype('i4')))Thank you.