Codes for the paper titled "It Ain't That Bad: Understanding the Mysterious Performance Drop in OOD Generalization for Generative Transformer Models".
Remark: The code on mingpt is cloned from https://github.com/karpathy/minGPT
| Name | Name | Last commit date | ||
|---|---|---|---|---|
Codes for the paper titled "It Ain't That Bad: Understanding the Mysterious Performance Drop in OOD Generalization for Generative Transformer Models".
Remark: The code on mingpt is cloned from https://github.com/karpathy/minGPT