Decoding in particular wastes a lot of time waiting for parts of the data that happen to contain very long utterances. The simpler alternative is to shuffle the data, though this has its own problems.
Decoding in particular wastes a lot of time waiting for parts of the data that happen to contain very long utterances.
The simpler alternative is to shuffle the data, though this has its own problems.