Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
younesbelkada/transformers
Pull Requests
Commits
serialize-8bit
19445-fix
adapters
add_accelerate_roberta
add_bart_accelerate
add_bert_accelerate
add_better_transformers
add_bloom_flax
add_bloom_for_qa
add_bt_pipeline
add_dpt_hybrid
add_dpt_hybrid_2
add_esm_accelerate
add_flan_t5_doc
add_m2m100_accelerate
add_owl_vit_accelerate
add_roberta_accelerate
add_switch_transformers
add_vit_accelerate
add_whisper_accelerate
add-awq-fused
add-badam
add-blip
add-blip2-accelerate
add-blip2-int8
add-bt-doc
add-cache-api
add-cogvlm
add-dpt-hybrid-support
add-dtype-fe
add-dtype-precision-processor
add-efficient-sam-clean
add-exllama-v2-depre
add-falcon-h1
add-flash-attn-2
add-galore-optimizer
add-gguf-convert-trainer
add-llava-sdpa
add-marian-mt-accelerate
add-mobile-sam
add-mp-tests
add-no-split-modules
add-owl-vit-accelerate-support
add-peft-pipeline-pox
add-pix2struct
add-pix2struct-2
add-pix2struct-pipeline
add-sgalgn
add-super-resolution-pipeline
add-torch-dtype
add-torch-fp18
add-trainer-error-8bit
add-umt5
benchmark
better-tf-examples
bigscience176b
bigscience176b-jz
bigscience176b-test-fused
bigscience-jz-176-ds
bit-remove-unused-func
blip-2-accelerate
blip-fix-tolerance
blip-support-training
blip-text-refactor-test
bloom-change-slow-test
bloom-docstring
bloom-enhance-doc
bloom-fix-attn-mask
bloom-fix-module-order
bloom-fix-sequential-order
bloom-fix-tokenizer
bloom-minor-fix-tests
bnb_add_custom_map
bnb_add_safety_checker
bnb_simplify_test
bnb-add-mem-efficient
bnb-fix-change-arg-name
bnb-fix-cls-bug
bnb-fix-doc
bnb-fix-slow-test
bnb-fix-small-details
bnb-warn-device
ci_tweaks
circular-reference-schedulers
cleaner-pt-tf-conversion
cli
cohere-diff
ctrl-tokenizer-standardize
cvt-fix-init
denim-capybara-22
distil-update
distill-bloom
documentation
dpt-flax-younes
enable-input-require-grad
experiments-static-cache
extract-cached-archives
fast-tokenizers
finish-cogvlm
fipt-doctest
fix_codegen_causal_mask
fix_head_masking
fix_pt_tf_loading
fix_redirected_downloads
fix_redirected_link
fix_redirections
fix_t5_fp16
fix-4bit-blip2
fix-2042
fix-awq-tests
fix-beit-nit
fix-bert-pipeline
fix-bit-fp16
fix-bli-doctest
fix-blip2-accelerate
fix-blip-doctest
fix-blip-text-docstring
fix-bloom-deepspeed
fix-bloom-pipeline-test
fix-bnb-error-message
fix-bnb-itemize
fix-bnb-peft
fix-bnb-serialization-2
fix-bnb-slow-test
fix-bnb-trainer-test
fix-correct-dtype-casting
fix-documentation-warnings
fix-esm-accelerate
fix-fa-2-bf16
fix-fix-copies-2
fix-fm-rms
fix-fp16-generate
fix-fsmt
fix-fsmt-compatibility
fix-fuyu-nit
fix-gc-opt
fix-gc-recent
fix-gc-use-reentrant
fix-geb-fp16
fix-generation-doc
fix-gpt2-finetuning-memory
fix-gpt-neo-multi-gpu
fix-gpt-neo-x
fix-hubert-doctest
fix-idefics-config
fix-import-2
fix-improt
fix-int8-conversion
fix-int8-docstrin
fix-int8-seqtoseq
fix-integrations
fix-japanese-readme-template
fix-last-test-vith
fix-mbart
fix-mistral-regex
fix-mistral-training-bug
fix-mobilenet-v2-auto
fix-module-fp32
fix-nllb-device
fix-opt-bias
fix-opt-nit
fix-peft-device
fix-perceiver-test
fix-slow-path
fix-slow-tests
fix-spaces-token-issue
fix-speech-doctest
fix-switch-slow
fix-t5-dtype
fix-tf-xlm
fix-tf-xlnet
fix-tfroberta
fix-tok-pipe
fix-training-quanti
fix-ved-doctest
fix-vision-config
fix-vit-hybrid-test
fix-xlm-roberta
flash-att-2-temp
flava-docs
flax-tp-inference
fp4-bnb
generation_sampler
hf-quantizer-work
ignored-index-coherence
image-fp16
improved-generation
improved-testing
int8-config
int8_with_accelerate
integration-8bit
integration-8bit-codegen-hotfix
main
main-temp
master
mbart-fix-jit-issue
mini-fix-hqq
mistral-static-kv-cache
modify_distill_bert_module_name
opt_branch/opt-350-m
opt-350
opt-fix-device
opt-fix-mask
opt-fix-softmax-debug
opt-flax-tf
patch-bit
peft-fix-multi-adapter
peft-integration
pipelines-issues
pix2struct-cross-attn
pix2struct-refactored-ip
plugin-v1
potential-fix-ewkv
question-answering
quickstart-model2model
rbert
refactor-workflows
roberta-no-token-types
roc_bert_fix
run_language_modeling
scripts-device-flag
serialize-8bit
serving_improvements
shape_list
skip-multimodal-pipeline
small_fix_realm
switch_backup
t5-parallelism
temp-2
tentative-pipeline-t5
test_bnb_training
test-static-kv-cache
test-younes-workflow
tf
tf2-extended
tf2-glue
tokenizers
tokenizers-v2
torchhub
tpu-experiments
trainer-model-parallel
trocr-fix-use-cache
uncompressed_storage
update-deps
upgrade-run-generation
vilt-accelerate-suuport
vit-fix-init-nit
vqgan
xnli
younes/patch-bit-3
Apply suggestions from code review
younesbelkada
committed
3 years ago
Verified
d293c254
Update src/transformers/modeling_utils.py
younesbelkada
committed
3 years ago
Verified
48b7f8f4
protect import
younesbelkada
committed
3 years ago
fc1411eb
clarify doc
younesbelkada
committed
3 years ago
a59b6381
last warning
younesbelkada
committed
3 years ago
f03d36e8
address last comments
younesbelkada
committed
3 years ago
9042365e
Update src/transformers/utils/quantization_config.py
younesbelkada
committed
3 years ago
Verified
eda6d401
remove unused function
younesbelkada
committed
3 years ago
9fadcf78
few fixes
younesbelkada
committed
3 years ago
42167957
Merge remote-tracking branch 'upstream/main' into serialize-8bit
younesbelkada
committed
3 years ago
379b7f39
adapt from suggestions
younesbelkada
committed
3 years ago
897cde9f
Relax `eos_token_id < 0` checks in `generate()` from `ValueError` to warning (#22472)
lewtun
committed
3 years ago
Verified
da68fd69
(Re-)Enable Nightly + Past CI (#22393)
ydshieh
committed
3 years ago
Verified
0fe6c6bd
Docs fix: Multinomial sampling decoding needs "num_beams=1", since by default it is usually not 1. (#22473)
manueldeprada
committed
3 years ago
Verified
d5de578c
Llama: support for `max_position_embeddings` (#22471)
gante
committed
3 years ago
Verified
165dd6dc
[NLLB-MoE] `model_type` update for auto mapping (#22470)
ArthurZucker
committed
3 years ago
Verified
349e1242
Guard imports of PreTrainedTokenizerFast on is_tokenizers_available (#22285)
Roy Hvaara
committed
3 years ago
Verified
11426641
🚨🚨🚨 Fix ordering of height, width for BLIP image processor (#22466)
amyeroberts
committed
3 years ago
Verified
4d7a5b5b
Generate: basic token streaming (#22449)
gante
committed
3 years ago
Verified
228792a9
Skip flaky NLLB Moe test for now (#22463)
amyeroberts
committed
3 years ago
Verified
f0aeb1be
Rescale image back if it was scaled during PIL conversion (#22458)
amyeroberts
committed
3 years ago
Verified
154c6bb7
Move common properties to BackboneMixin (#21855)
amyeroberts
committed
3 years ago
Verified
c15f9375
Update: ignore padding support for TransfoXL training when n_clusters==0 (#22457)
StefanHeng
committed
3 years ago
Verified
cd73b9a8
Pin ruff (#22455)
sgugger
committed
3 years ago
Verified
2194943a
Update release instructions (#22454)
sgugger
committed
3 years ago
Verified
4c295a26
Avoid using personal HF token in CI (#22453)
ydshieh
committed
3 years ago
Verified
97440e9c
Update Neptune docs (#22452)
Sabine
committed
3 years ago
Verified
173193cc
Revert "Fix --bf16 option support for Neuron after PR #22300" (#22451)
jeffhataws
committed
3 years ago
Verified
5e89a435
[`Pix2Struct`] Fix slow test (#22448)
younesbelkada
committed
3 years ago
Verified
b844f8a9
Revert "Error (also in original) model, scaling only q matrix not qk.T dot product (qk.T/sqrt(dim_per_head))" (#22444)
sgugger
committed
3 years ago
Verified
55dae94c
Older