Reduce re-loading huggingface tokenizers and models by laurejt · Pull Request #35 · Princeton-CDH/muse

laurejt · 2026-02-20T20:50:40Z

Associated Issue(s): None

Changes in this PR

Added a workaround to translate.py so that consecutive translation calls with the same model will only load that tokenizer and model once

Notes

This workaround is not ideal, but should be okay in the short-term / for this experimental phase

Reviewer Checklist

Check that translate_corpus.py runs successfully for HuggingFace models (i.e., hymt, madlad, nllb)
Confirm that translate_corpus.py only sends HTTP requests to HuggingFace for the first translation. Use --verbose to observe this behavior.

tanhaow

🚀

Add reuse last loaded HuggingFace model/tokenizer

ce0e489

laurejt requested a review from tanhaow February 20, 2026 20:50

laurejt self-assigned this Feb 20, 2026

laurejt changed the title ~~Add reuse last loaded HuggingFace model/tokenizer~~ Reduce re-loading huggingface tokenizers and models Feb 20, 2026

tanhaow approved these changes Feb 20, 2026

View reviewed changes

laurejt merged commit e36059c into develop Feb 20, 2026
1 check passed

laurejt deleted the feature/speed-up-translation branch February 20, 2026 21:08

laurejt added the 👇this sprint Add Issue to ZenHub label Feb 20, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce re-loading huggingface tokenizers and models#35

Reduce re-loading huggingface tokenizers and models#35
laurejt merged 1 commit intodevelopfrom
feature/speed-up-translation

laurejt commented Feb 20, 2026 •

edited by tanhaow

Loading

Uh oh!

tanhaow left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

laurejt commented Feb 20, 2026 • edited by tanhaow Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes in this PR

Notes

Reviewer Checklist

Uh oh!

tanhaow left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

laurejt commented Feb 20, 2026 •

edited by tanhaow

Loading