-
Notifications
You must be signed in to change notification settings - Fork 1k
Pull requests: huggingface/tokenizers
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix escape HTML characters in EncodingVisualizer output
#1937
opened Jan 29, 2026 by
OhashiReon
Loading…
Fix deprecated IPython.core.display usage in EncodingVisualizer
#1936
opened Jan 29, 2026 by
OhashiReon
Loading…
Add type hint, update to pyo3 0.27, add automatic type hint generator
#1928
opened Jan 12, 2026 by
ArthurZucker
Loading…
feat: add progress_format option for machine-readable JSON output
#1921
opened Dec 26, 2025 by
podarok
Loading…
6 tasks done
Use
unicode-normalization instead of unicode-normalization-alignments
#1912
opened Dec 14, 2025 by
IvanIsCoding
Loading…
Providing byte level offsets for effective alignment in Cross-Tokenizer On-Policy Distillation
Feature Request
#1880
opened Oct 30, 2025 by
JqzChandler
Loading…
Add a multithreaded tokenizer test and as well as 3.14 and 3.14t CI
#1864
opened Sep 12, 2025 by
ngoldbaum
Loading…
feat: allow BPETrainer to be seeded with a set of initial tokens
#1862
opened Sep 6, 2025 by
henrycharlesworth
Loading…
Fix unsigned integer underflow issue with truncation
#1859
opened Sep 1, 2025 by
maxdebayser
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.