Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
huggingface/transformers
Pull Requests
Commits
Open
Closed
Fix CI read-only cache failures by patching cached_files in conftest
#47043 opened 2026-07-03 15:26 by
ydshieh
Fix GenerationConfig continuous batching serialization
#47038 opened 2026-07-03 09:46 by
VectorPeak
Torch compile backend defaults to "neuron"
#47035 opened 2026-07-02 23:07 by
michaelbenayoun
Neuron sft exp
#47034 opened 2026-07-02 23:01 by
michaelbenayoun
Fix image-text-to-text stop_sequence handling
#47032 opened 2026-07-02 22:17 by
Sunt-ing
Skip caching_allocator_warmup on Neuron (no reuse pool to warm; currently OOMs)
#47029 opened 2026-07-02 17:59 by
dacorvo
Fix xLSTM train mode crash when return_last_states=False
#47026 opened 2026-07-02 14:13 by
lcheng321
Diffusion gemma: fix failed test cases
#47025 opened 2026-07-02 14:07 by
kaixuanliu
Bump the actions group across 1 directory with 8 updates
dependencies
github_actions
#47023 opened 2026-07-02 13:21 by
dependabot[bot]
fix mask return-type contract regression and add correctness guard for
#47019 opened 2026-07-02 08:36 by
kaixuanliu
Fix save_pretrained with offloading and weight conversions
#47018 opened 2026-07-02 08:14 by
Cyrilvallez
Route byte-level llama tokenizers to TokenizersBackend
#47017 opened 2026-07-02 05:11 by
subin9
Fix RTDetrHungarianMatcher crash on infinite cost matrix
#47016 opened 2026-07-02 04:55 by
AyushDas4890
🚨 Enable SDPA (and other attention backends) for T5 and propagate to the T5 family
#47014 opened 2026-07-02 01:54 by
jiqing-feng
Add max_groups input to nightly Serge triage caller (single-task test mode)
#47012 opened 2026-07-01 19:56 by
tarekziade
Add linear_attn entries to Qwen3.5 base_model_tp_plan
#47009 opened 2026-07-01 18:24 by
ZAID646
[serge] Fix 10 integration tests for model `gemma3` failing with `output_mismatch` (list output differs (8), other (2))
#47006 opened 2026-07-01 15:41 by
sergereview[bot]
OpenVINO HF Exporter
#47003 opened 2026-07-01 14:08 by
IlyasMoutawwakil
Add after-load fusion for static quantized MLPs
#46997 opened 2026-07-01 10:41 by
LiangSu8899
Add MLU support to is_flash_linear_attention_available
#46995 opened 2026-07-01 09:25 by
atri2549
Raise a clear error for empty token lists in bad_words_ids / sequence_bias
#46994 opened 2026-07-01 06:58 by
Sunt-ing
Fix prompt lookup decoding generating past max_length
#46993 opened 2026-07-01 06:42 by
Sunt-ing
FSDP orchestration: apply + loading/saving
#46990 opened 2026-07-01 05:09 by
3outeille
[serge] Fix 10 integration tests for model `glm_ocr` failing with `output_mismatch` (list output differs (10))
#46986 opened 2026-07-01 00:55 by
sergereview[bot]
Fix typo in `MusicgenForCausalLM.generate()`
#46974 opened 2026-06-30 04:55 by
jiqing-feng
[serge] Fix 8 integration tests for model `mamba2` failing with `OOM` (other (8))
#46971 opened 2026-06-30 00:44 by
sergereview[bot]
[serge] Fix 16 integration tests for model `musicgen_melody` failing with `output_mismatch` (tensor values differ (16))
#46970 opened 2026-06-30 00:38 by
sergereview[bot]
[docs] fix autolinks
#46968 opened 2026-06-29 22:55 by
stevhliu
Add Gemma4Unified sequence classification
#46966 opened 2026-06-29 19:04 by
yuvrajsharma9981
[Model] Support `use_cache=False` for DeepSeek V4
#46965 opened 2026-06-29 18:11 by
kylesayrs
Older