In paper, they use `Soft Dynamic Time Warping` in KL loss. In your code, I didn't find it. So, is the code in the progress? or any other reason?