From f6ee198151c261af0df75833ab5366f2733a736f Mon Sep 17 00:00:00 2001 From: Tarun Kumar Date: Wed, 20 Mar 2024 14:00:35 +0530 Subject: [PATCH] Update README.md A small typo --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 11ac49a4..6b577968 100644 --- a/README.md +++ b/README.md @@ -29,7 +29,7 @@ pip install pandas singleton-decorator datasets "transformers<4.33.3" accelerate ## Preprocessing Please refer to the notebook [preprocess.ipynb](https://github.com/yl4579/PL-BERT/blob/main/preprocess.ipynb) for more details. The preprocessing is for English Wikipedia dataset only. I will make a new branch for Japanese if I have extra time to demostrate training on other languages. You may also refer to [#6](https://github.com/yl4579/PL-BERT/issues/6#issuecomment-1797869275) for preprocessing in other languages like Japanese. -## Trianing +## Training Please run each cell in the notebook [train.ipynb](https://github.com/yl4579/PL-BERT/blob/main/train.ipynb). You will need to change the line `config_path = "Configs/config.yml"` in cell 2 if you wish to use a different config file. The training code is in Jupyter notebook primarily because the initial epxeriment was conducted in Jupyter notebook, but you can easily make it a Python script if you want to.