Use Generator, not Decoder. Use forward_batch(..., false), not generate_batch(...). Extract the last token, not the first token.
Use Generator, not Decoder.
Use forward_batch(..., false), not generate_batch(...).
Extract the last token, not the first token.