Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
huggingface/datasets
Pull Requests
Commits
Open
Closed
Add columns support to JSON loader
#7754 opened 2025-09-04 18:21 by
ArjunJagdale
Refactor: use unpacking in load.py for time and memory improvement
#7750 opened 2025-08-26 22:13 by
brchristian
docs: Streaming best practices
#7748 opened 2025-08-23 00:18 by
Abdul-Omira
Add wikipedia-2023-redirects dataset
#7747 opened 2025-08-22 23:49 by
Abdul-Omira
Reimplemented partial split download support (revival of #6832)
#7706 opened 2025-07-28 19:40 by
ArjunJagdale
fix del tqdm lock error
#7661 opened 2025-07-01 02:04 by
Hypothesis-Z
feat: add subset_name as alias for name in load_dataset
#7657 opened 2025-06-29 10:39 by
ArjunJagdale
fix(iterable): ensure MappedExamplesIterable supports state_dict for resume
#7656 opened 2025-06-29 07:50 by
ArjunJagdale
Added specific use cases in Improve Performace
#7655 opened 2025-06-28 19:00 by
ArjunJagdale
fix(load): strip deprecated use_auth_token from config_kwargs
#7654 opened 2025-06-28 09:20 by
ArjunJagdale
feat(load): fallback to `load_from_disk()` when loading a saved dataset directory
#7653 opened 2025-06-28 08:47 by
ArjunJagdale
fix: Extended metadata file names for folder_based_builder
#7651 opened 2025-06-27 13:12 by
iPieter
Introduces automatic subset-level grouping for folder-based dataset builders #7066
#7646 opened 2025-06-26 07:01 by
ArjunJagdale
`ClassLabel` docs: Correct value for unknown labels
#7645 opened 2025-06-25 20:01 by
l-uuz
Add ignore_decode_errors option to Image feature for robust decoding #7612
#7638 opened 2025-06-24 16:47 by
ArjunJagdale
Fix: Preserve float columns in JSON loader when values are integer-like (e.g. 0.0, 1.0)
#7635 opened 2025-06-24 06:16 by
ArjunJagdale
Pass user-agent from DownloadConfig into fsspec storage_options
#7631 opened 2025-06-21 14:22 by
ArjunJagdale
Add test for `as_iterable_dataset()` method in DatasetBuilder
#7629 opened 2025-06-19 19:23 by
ArjunJagdale
Add `as_iterable_dataset()` method to DatasetBuilder for streaming from cached Arrow files
#7628 opened 2025-06-19 19:15 by
ArjunJagdale
feat: Add h5folder dataset loader for HDF5 support
#7625 opened 2025-06-19 05:39 by
ArjunJagdale
fix: raise error when folder-based datasets are loaded without data_dir or data_files
#7618 opened 2025-06-16 07:43 by
ArjunJagdale
Enhance error handling and input validation across multiple modules
#7602 opened 2025-06-08 23:01 by
mohiuddin-khan-shiam
Change dill version in requirements
#7535 opened 2025-04-24 19:44 by
JGrel
(refactor) remove redundant logic in _check_valid_index_key
#7490 opened 2025-03-30 11:45 by
suzyahyah
fix: loading of datasets from Disk(#7373)
#7489 opened 2025-03-29 16:22 by
sam-hey
Adds EXR format to store depth images in float32
#7463 opened 2025-03-17 17:42 by
ducha-aiki
Use pyupgrade --py39-plus for remaining files
#7437 opened 2025-03-06 02:12 by
cyyever
Improved type annotation
#7429 opened 2025-02-28 10:39 by
saiden89
Make IterableDataset (optionally) resumable
#7385 opened 2025-02-04 15:55 by
yzhangcs
Fix lock permission
#7361 opened 2025-01-07 04:15 by
cih9088
Newer
Older