Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
huggingface/datasets
Pull Requests
Commits
Open
Closed
Fix lock permission
#7361 opened 2025-01-07 04:15 by
cih9088
Fix remove_columns in the formatted case
#7358 opened 2025-01-06 15:44 by
lhoestq
Resolved for empty datafiles
#7314 opened 2024-12-09 15:47 by
sahillihas
[Audio Features - DO NOT MERGE] PoC for adding an offset+sliced reading to audio file.
#7312 opened 2024-12-08 10:27 by
TParcollet
refactor: remove unnecessary else
#7307 opened 2024-12-05 12:11 by
HarikrishnanBalagopal
Allow for variation in metadata file names as per issue #7123
#7283 opened 2024-11-08 00:44 by
egrace479
Feature proposal: Stacking, potentially heterogeneous, datasets
#7279 opened 2024-11-05 15:40 by
TimCares
Let soundfile directly read local audio files
#7278 opened 2024-11-04 17:41 by
fawazahmed0
fast array extraction
#7227 opened 2024-10-14 20:51 by
alex-hh
fallback to default feature casting in case custom features not available during dataset loading
#7224 opened 2024-10-12 16:13 by
alex-hh
Add with_rank to Dataset.from_generator
#7199 opened 2024-10-04 16:51 by
muthissar
fix grammar in fingerprint.py
#7176 opened 2024-09-26 16:13 by
jxmorris12
Do not consume unnecessary memory during sharding
#7136 opened 2024-09-04 19:26 by
janEbert
Fix data file module inference
#7132 opened 2024-08-29 13:48 by
HennerM
Add Arabic Docs to Datasets
#7094 opened 2024-08-07 21:53 by
AhmedAlmaghz
Fix export to JSON when dataset larger than batch size
#7039 opened 2024-07-11 06:52 by
albertvillanova
[`feat`] Move dataset card creation to method for easier overriding
#6988 opened 2024-06-20 10:47 by
tomaarsen
Support folder-based datasets with large metadata.jsonl
#6859 opened 2024-05-02 09:07 by
gbenson
Support downloading specific splits in `load_dataset`
#6832 opened 2024-04-23 12:32 by
mariosasko
Support PathLike input in save_to_disk / load_from_disk
#6828 opened 2024-04-23 09:42 by
lhoestq
Make Image cast storage faster
#6786 opened 2024-04-05 17:00 by
Modexus
Fix issue with case sensitivity when loading dataset from local cache
#6763 opened 2024-03-28 14:52 by
Sumsky21
Test disabling transformers containers in docs CI
#6757 opened 2024-03-25 17:16 by
Wauplin
3x Faster Text Preprocessing
#6711 opened 2024-03-03 19:03 by
ashvardanian
__add__ for Dataset, IterableDataset
#6694 opened 2024-02-26 01:46 by
oh-gnues-iohc
Update loading.mdx to include "jsonl" file loading.
#6647 opened 2024-02-07 16:18 by
mosheber
Run download_and_prepare if missing splits
#6639 opened 2024-02-02 10:36 by
lhoestq
add safety checks when using only part of dataset
#6601 opened 2024-01-18 16:16 by
benseddikismail
Fix for continuation behaviour on broken dataset archives due to starving download connections via HTTP-GET
#6380 opened 2023-11-02 17:28 by
RuntimeRacer
Add repo_id to DatasetInfo
#6268 opened 2023-09-29 10:24 by
lhoestq
Newer
Older