Skip to content

Latest commit

Β 

History

History
79 lines (57 loc) Β· 2.83 KB

File metadata and controls

79 lines (57 loc) Β· 2.83 KB

T5 From Scratch: A Complete PyTorch Implementation

Work in progress!! (filled with errors rn)

This implementation strictly follows the original T5 paper:

Raffel, C., Shazeer, N., Roberts, A., Lee, K., Narang, S., Matena, M., ... & Liu, P. J. (2020). Exploring the limits of transfer learning with a unified text-to-text transformer. Journal of Machine Learning Research, 21(140), 1-67.