GitHub - Mint-hfut/One2MultiSeq

This repository contains the code for our paper: Training with One2MultiSeq: CopyBART for Social Media Keyphrase Generation.

Dataset

The datasets can be downloaded from here

For more details about the Twitter dataset, please reference here or contact us at gaochunyang@mail.hfut.edu.cn

Prepocessing

To preprocess the source data, run: python One2MultiSeq_dataprocess.py

Training

To preprocess the source data, run: python train_One2MultiSeq.py After the training, you can change model_name in line 707 to the path of the trained model(for example, model_name = 'models/temp_model/CMKP/CopyBART_One2MultiSeq_base_epochs-10_learning_rate-5e-05_batch_size-32_seed-100') and set is_train = False in train_One2MultiSeq.py.

Note:

Please download and unzip the datasets in the ./data directory first.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
data		data
models		models
On2MultiSeq_dataprocess.py		On2MultiSeq_dataprocess.py
One2MultiSeq.py		One2MultiSeq.py
One2Set.py		One2Set.py
One2Set_dataprocess.py		One2Set_dataprocess.py
README.md		README.md
seq2seq_trainer_.py		seq2seq_trainer_.py
train_One2MultiSeq.py		train_One2MultiSeq.py
train_one2set.py		train_one2set.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dataset

Prepocessing

Training

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Dataset

Prepocessing

Training

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages