Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
younesbelkada/transformers
Pull Requests
Commits
add-cache-api
19445-fix
adapters
add_accelerate_roberta
add_bart_accelerate
add_bert_accelerate
add_better_transformers
add_bloom_flax
add_bloom_for_qa
add_bt_pipeline
add_dpt_hybrid
add_dpt_hybrid_2
add_esm_accelerate
add_flan_t5_doc
add_m2m100_accelerate
add_owl_vit_accelerate
add_roberta_accelerate
add_switch_transformers
add_vit_accelerate
add_whisper_accelerate
add-awq-fused
add-badam
add-blip
add-blip2-accelerate
add-blip2-int8
add-bt-doc
add-cache-api
add-cogvlm
add-dpt-hybrid-support
add-dtype-fe
add-dtype-precision-processor
add-efficient-sam-clean
add-exllama-v2-depre
add-falcon-h1
add-flash-attn-2
add-galore-optimizer
add-gguf-convert-trainer
add-llava-sdpa
add-marian-mt-accelerate
add-mobile-sam
add-mp-tests
add-no-split-modules
add-owl-vit-accelerate-support
add-peft-pipeline-pox
add-pix2struct
add-pix2struct-2
add-pix2struct-pipeline
add-sgalgn
add-super-resolution-pipeline
add-torch-dtype
add-torch-fp18
add-trainer-error-8bit
add-umt5
benchmark
better-tf-examples
bigscience176b
bigscience176b-jz
bigscience176b-test-fused
bigscience-jz-176-ds
bit-remove-unused-func
blip-2-accelerate
blip-fix-tolerance
blip-support-training
blip-text-refactor-test
bloom-change-slow-test
bloom-docstring
bloom-enhance-doc
bloom-fix-attn-mask
bloom-fix-module-order
bloom-fix-sequential-order
bloom-fix-tokenizer
bloom-minor-fix-tests
bnb_add_custom_map
bnb_add_safety_checker
bnb_simplify_test
bnb-add-mem-efficient
bnb-fix-change-arg-name
bnb-fix-cls-bug
bnb-fix-doc
bnb-fix-slow-test
bnb-fix-small-details
bnb-warn-device
ci_tweaks
circular-reference-schedulers
cleaner-pt-tf-conversion
cli
cohere-diff
ctrl-tokenizer-standardize
cvt-fix-init
denim-capybara-22
distil-update
distill-bloom
documentation
dpt-flax-younes
enable-input-require-grad
experiments-static-cache
extract-cached-archives
fast-tokenizers
finish-cogvlm
fipt-doctest
fix_codegen_causal_mask
fix_head_masking
fix_pt_tf_loading
fix_redirected_downloads
fix_redirected_link
fix_redirections
fix_t5_fp16
fix-4bit-blip2
fix-2042
fix-awq-tests
fix-beit-nit
fix-bert-pipeline
fix-bit-fp16
fix-bli-doctest
fix-blip2-accelerate
fix-blip-doctest
fix-blip-text-docstring
fix-bloom-deepspeed
fix-bloom-pipeline-test
fix-bnb-error-message
fix-bnb-itemize
fix-bnb-peft
fix-bnb-serialization-2
fix-bnb-slow-test
fix-bnb-trainer-test
fix-correct-dtype-casting
fix-documentation-warnings
fix-esm-accelerate
fix-fa-2-bf16
fix-fix-copies-2
fix-fm-rms
fix-fp16-generate
fix-fsmt
fix-fsmt-compatibility
fix-fuyu-nit
fix-gc-opt
fix-gc-recent
fix-gc-use-reentrant
fix-geb-fp16
fix-generation-doc
fix-gpt2-finetuning-memory
fix-gpt-neo-multi-gpu
fix-gpt-neo-x
fix-hubert-doctest
fix-idefics-config
fix-import-2
fix-improt
fix-int8-conversion
fix-int8-docstrin
fix-int8-seqtoseq
fix-integrations
fix-japanese-readme-template
fix-last-test-vith
fix-mbart
fix-mistral-regex
fix-mistral-training-bug
fix-mobilenet-v2-auto
fix-module-fp32
fix-nllb-device
fix-opt-bias
fix-opt-nit
fix-peft-device
fix-perceiver-test
fix-slow-path
fix-slow-tests
fix-spaces-token-issue
fix-speech-doctest
fix-switch-slow
fix-t5-dtype
fix-tf-xlm
fix-tf-xlnet
fix-tfroberta
fix-tok-pipe
fix-training-quanti
fix-ved-doctest
fix-vision-config
fix-vit-hybrid-test
fix-xlm-roberta
flash-att-2-temp
flava-docs
flax-tp-inference
fp4-bnb
generation_sampler
hf-quantizer-work
ignored-index-coherence
image-fp16
improved-generation
improved-testing
int8-config
int8_with_accelerate
integration-8bit
integration-8bit-codegen-hotfix
main
main-temp
master
mbart-fix-jit-issue
mini-fix-hqq
mistral-static-kv-cache
modify_distill_bert_module_name
opt_branch/opt-350-m
opt-350
opt-fix-device
opt-fix-mask
opt-fix-softmax-debug
opt-flax-tf
patch-bit
peft-fix-multi-adapter
peft-integration
pipelines-issues
pix2struct-cross-attn
pix2struct-refactored-ip
plugin-v1
potential-fix-ewkv
question-answering
quickstart-model2model
rbert
refactor-workflows
roberta-no-token-types
roc_bert_fix
run_language_modeling
scripts-device-flag
serialize-8bit
serving_improvements
shape_list
skip-multimodal-pipeline
small_fix_realm
switch_backup
t5-parallelism
temp-2
tentative-pipeline-t5
test_bnb_training
test-static-kv-cache
test-younes-workflow
tf
tf2-extended
tf2-glue
tokenizers
tokenizers-v2
torchhub
tpu-experiments
trainer-model-parallel
trocr-fix-use-cache
uncompressed_storage
update-deps
upgrade-run-generation
vilt-accelerate-suuport
vit-fix-init-nit
vqgan
xnli
younes/patch-bit-3
Add cuda_custom_kernel in DETA (#28989)
SangbumChoi
committed
2 years ago
Verified
83e96dc0
add v1
younesbelkada
committed
2 years ago
1430ca63
Fix static generation when compiling! (#28937)
ArthurZucker
committed
2 years ago
Verified
f3788b09
[`CLeanup`] Revert SDPA attention changes that got in the static kv cache PR (#29027)
ArthurZucker
committed
2 years ago
Verified
609a1767
FIX [`Trainer` / tags]: Fix trainer + tags when users do not pass `"tags"` to `trainer.push_to_hub()` (#29009)
younesbelkada
committed
2 years ago
Verified
7a0fccc6
[TPU] Support PyTorch/XLA FSDP via SPMD (#28949)
alanwaketan
committed
2 years ago
Verified
5f06053d
Backbone kwargs in config (#28784)
amyeroberts
committed
2 years ago
Verified
0199a484
Add tie_weights() to LM heads and set bias in set_output_embeddings() (#28948)
hackyon
committed
2 years ago
Verified
725f4ad1
Mask Generation Task Guide (#28897)
merveenoyan
committed
2 years ago
Verified
3f4e79d2
Fix flaky test vision encoder-decoder generate (#28923)
zucchini-nlp
committed
2 years ago
Verified
354775bc
Introduce AcceleratorConfig dataclass (#28664)
muellerzr
committed
2 years ago
Verified
0507e69d
Set the dataset format used by `test_trainer` to float32 (#28920)
ji-huazhong
committed
2 years ago
Verified
69ca640d
[`Doc`] Fix docbuilder - make `BackboneMixin` and `BackboneConfigMixin` importable from `utils`. (#29002)
amyeroberts
committed
2 years ago
Verified
7252e8d9
AQLM quantizer support (#28928)
Andrei Panferov
committed
2 years ago
Verified
1ecf5f7c
Add SiglipForImageClassification and CLIPForImageClassification (#28952)
NielsRogge
committed
2 years ago
Verified
63ffd56d
Add `StableLM` (#28810)
jon-tow
committed
2 years ago
Verified
de6029a0
ENH [`AutoQuantizer`]: enhance trainer + not supported quant methods (#28991)
younesbelkada
committed
2 years ago
Verified
164bdef8
ENH: Do not pass warning message in case `quantization_config` is in config but not passed as an arg (#28988)
younesbelkada
committed
2 years ago
Verified
1d12b8bc
[`DETR`] Update the processing to adapt masks & bboxes to reflect padding (#28363)
amyeroberts
committed
2 years ago
Verified
bd4b83e1
Update configuration_llama.py: fixed broken link (#28946)
AdityaKane2001
committed
2 years ago
Verified
3de6a6b4
Static Cache: load models with MQA or GQA (#28975)
gante
committed
2 years ago
Verified
3e70a207
Add sudachi_projection option to BertJapaneseTokenizer (#28503)
hiroshi-matsuda-rit
committed
2 years ago
Verified
da20209d
[`NllbTokenizer`] refactor with added tokens decoder (#27717)
ArthurZucker
committed
2 years ago
Verified
b4456753
[i18n-de] Translate CONTRIBUTING.md to German (#28954)
khipp
committed
2 years ago
Verified
d90acc16
[Docs] Add video section (#28958)
NielsRogge
committed
2 years ago
Verified
78ba9f46
[Docs] Add language identifiers to fenced code blocks (#28955)
khipp
committed
2 years ago
Verified
fe3df9d5
Clean up staging tmp checkpoint directory (#28848)
woshiyyya
committed
2 years ago
Verified
c617f988
Always initialize tied output_embeddings if it has a bias term (#28947)
hackyon
committed
2 years ago
Verified
136cd893
Updated requirements for image-classification samples: datasets>=2.14.0 (#28974)
alekseyfa
committed
2 years ago
Verified
792819f6
Tests: tag `test_save_load_fast_init_from_base` as flaky (#28930)
gante
committed
2 years ago
Verified
e30bbb26
Newer
Older