Fine-Tuning the Multilingual Text-To-Text Transfer Transformer (mT5) for Language Classification

Or: What's the language of a given sentence?

We demonstrate how to fine-tune a pre-trained T5 model and its multimodal extension mT5, evaluating its accuracy, and using it for the language prediction task on GPU with TensorFlow.

Here you find how to to do the following:

Preprosess XNLI dataset
Create new seqIO Task with tf.data.Dataset
Fine-tune mT5 model as well as T5 for the new Task
Log with TensorBoard
Make some predictions with the trained model

Background

T5 was introduced by C. Raffel et al. in the paper Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer. T5 is the Text-To-Text Transfer Transformer, which allows converting text-based language problems into a text-to-text format. The authors achieved state-of-the-art performance with Colossal Clean Crawled Corpus (C4) in covering summarization, question answering, text classification, and other English-based NLP problems.

In this notebook, I aim to give a brief overview of T5, explain some of its implications for NLP, and demonstrate how it can be used for language classification task.

The full version of the project is available under the Link

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
xnli_transformer.ipynb		xnli_transformer.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fine-Tuning the Multilingual Text-To-Text Transfer Transformer (mT5) for Language Classification

Or: What's the language of a given sentence?

Background

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Fine-Tuning the Multilingual Text-To-Text Transfer Transformer (mT5) for Language Classification

Or: What's the language of a given sentence?

Background

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages