CPSC8430DL_HW

Adarsha Neupane

Overview

This project implements a Sequence-to-Sequence (Seq2Seq) model with attention for machine translation.
Key enhancements include:

Scheduled Sampling during training for improved robustness.
Beam Search decoding during inference for better translation quality.

The final model outperformed the baseline BLEU score of 0.60 (provided in the homework instructions), achieving an average BLEU score of 0.6364 using the checkpoint ep199.pt.

Files

submit.sh – Shell script for training the model. Handles hyperparameters, checkpoint saving, and execution.
hw2_seq2seq.sh – Shell script for running inference on the test set.
model_seq2seq.py – Core Python implementation of the Seq2Seq model with attention, scheduled sampling, and beam search.

Training

To train the model:

bash submit.sh

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
hw2		hw2
.DS_Store		.DS_Store
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CPSC8430DL_HW

Adarsha Neupane

Overview

Files

Training

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CPSC8430DL_HW

Adarsha Neupane

Overview

Files

Training

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages