This repository was archived by the owner on Mar 31, 2025. It is now read-only.
Closed
Conversation
86dba46 to
94a1ad5
Compare
94a1ad5 to
e0cb3f4
Compare
e0cb3f4 to
48fbf3c
Compare
48fbf3c to
b98bbdf
Compare
b98bbdf to
c448d9b
Compare
c448d9b to
5092021
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This PR contains the following updates:
3.1.0->3.5.0Release Notes
huggingface/datasets (datasets)
v3.5.0Compare Source
Datasets Features
What's Changed
New Contributors
Full Changelog: huggingface/datasets@3.4.1...3.5.0
v3.4.1Compare Source
Bug Fixes
Full Changelog: huggingface/datasets@3.4.0...3.4.1
v3.4.0Compare Source
Dataset Features
Faster folder based builder + parquet support + allow repeated media + use torchvideo by @lhoestq in https://github.com/huggingface/datasets/pull/7424
decordwithtorchvisionto read videos, sincedecordis not maintained anymore and isn't available for recent python versions, see the video dataset loading documentation here for more details. TheVideotype is still marked as experimental is this versionmetadata.parquetin addition tometadata.csvormetadata.jsonlfor the metadata of the image/audio/video filesAdd IterableDataset.decode with multithreading by @lhoestq in https://github.com/huggingface/datasets/pull/7450
Add with_split to DatasetDict.map by @jp1924 in https://github.com/huggingface/datasets/pull/7368
General improvements and bug fixes
string_to_dictto returnNoneif there is no match instead of raisingValueErrorby @ringohoffman in https://github.com/huggingface/datasets/pull/7435ds.set_epoch(new_epoch)by @lhoestq in https://github.com/huggingface/datasets/pull/7451New Contributors
Full Changelog: huggingface/datasets@3.3.2...3.4.0
v3.3.2Compare Source
Bug fixes
Other general improvements
New Contributors
Full Changelog: huggingface/datasets@3.3.1...3.3.2
v3.3.1Compare Source
Bug fixes
Full Changelog: huggingface/datasets@3.3.0...3.3.1
v3.3.0Compare Source
Dataset Features
Support async functions in map() by @lhoestq in https://github.com/huggingface/datasets/pull/7384
Add repeat method to datasets by @alex-hh in https://github.com/huggingface/datasets/pull/7198
Support faster processing using pandas or polars functions in
IterableDataset.map()by @lhoestq in https://github.com/huggingface/datasets/pull/7370Apply formatting after iter_arrow to speed up format -> map, filter for iterable datasets by @alex-hh in https://github.com/huggingface/datasets/pull/7207
What's Changed
New Contributors
Full Changelog: huggingface/datasets@3.2.0...3.3.0
v3.2.0Compare Source
Dataset Features
Other improvements and bug fixes
ClassLabelby @sergiopaniego in https://github.com/huggingface/datasets/pull/7293New Contributors
Full Changelog: huggingface/datasets@3.1.0...3.2.0
Configuration
📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).
🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.
♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.
🔕 Ignore: Close this PR and you won't be reminded about this update again.
This PR was generated by Mend Renovate. View the repository job log.