Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
younesbelkada/transformers
Pull Requests
Commits
temp-2
19445-fix
adapters
add_accelerate_roberta
add_bart_accelerate
add_bert_accelerate
add_better_transformers
add_bloom_flax
add_bloom_for_qa
add_bt_pipeline
add_dpt_hybrid
add_dpt_hybrid_2
add_esm_accelerate
add_flan_t5_doc
add_m2m100_accelerate
add_owl_vit_accelerate
add_roberta_accelerate
add_switch_transformers
add_vit_accelerate
add_whisper_accelerate
add-awq-fused
add-badam
add-blip
add-blip2-accelerate
add-blip2-int8
add-bt-doc
add-cache-api
add-cogvlm
add-dpt-hybrid-support
add-dtype-fe
add-dtype-precision-processor
add-efficient-sam-clean
add-exllama-v2-depre
add-falcon-h1
add-flash-attn-2
add-galore-optimizer
add-gguf-convert-trainer
add-llava-sdpa
add-marian-mt-accelerate
add-mobile-sam
add-mp-tests
add-no-split-modules
add-owl-vit-accelerate-support
add-peft-pipeline-pox
add-pix2struct
add-pix2struct-2
add-pix2struct-pipeline
add-sgalgn
add-super-resolution-pipeline
add-torch-dtype
add-torch-fp18
add-trainer-error-8bit
add-umt5
benchmark
better-tf-examples
bigscience176b
bigscience176b-jz
bigscience176b-test-fused
bigscience-jz-176-ds
bit-remove-unused-func
blip-2-accelerate
blip-fix-tolerance
blip-support-training
blip-text-refactor-test
bloom-change-slow-test
bloom-docstring
bloom-enhance-doc
bloom-fix-attn-mask
bloom-fix-module-order
bloom-fix-sequential-order
bloom-fix-tokenizer
bloom-minor-fix-tests
bnb_add_custom_map
bnb_add_safety_checker
bnb_simplify_test
bnb-add-mem-efficient
bnb-fix-change-arg-name
bnb-fix-cls-bug
bnb-fix-doc
bnb-fix-slow-test
bnb-fix-small-details
bnb-warn-device
ci_tweaks
circular-reference-schedulers
cleaner-pt-tf-conversion
cli
cohere-diff
ctrl-tokenizer-standardize
cvt-fix-init
denim-capybara-22
distil-update
distill-bloom
documentation
dpt-flax-younes
enable-input-require-grad
experiments-static-cache
extract-cached-archives
fast-tokenizers
finish-cogvlm
fipt-doctest
fix_codegen_causal_mask
fix_head_masking
fix_pt_tf_loading
fix_redirected_downloads
fix_redirected_link
fix_redirections
fix_t5_fp16
fix-4bit-blip2
fix-2042
fix-awq-tests
fix-beit-nit
fix-bert-pipeline
fix-bit-fp16
fix-bli-doctest
fix-blip2-accelerate
fix-blip-doctest
fix-blip-text-docstring
fix-bloom-deepspeed
fix-bloom-pipeline-test
fix-bnb-error-message
fix-bnb-itemize
fix-bnb-peft
fix-bnb-serialization-2
fix-bnb-slow-test
fix-bnb-trainer-test
fix-correct-dtype-casting
fix-documentation-warnings
fix-esm-accelerate
fix-fa-2-bf16
fix-fix-copies-2
fix-fm-rms
fix-fp16-generate
fix-fsmt
fix-fsmt-compatibility
fix-fuyu-nit
fix-gc-opt
fix-gc-recent
fix-gc-use-reentrant
fix-geb-fp16
fix-generation-doc
fix-gpt2-finetuning-memory
fix-gpt-neo-multi-gpu
fix-gpt-neo-x
fix-hubert-doctest
fix-idefics-config
fix-import-2
fix-improt
fix-int8-conversion
fix-int8-docstrin
fix-int8-seqtoseq
fix-integrations
fix-japanese-readme-template
fix-last-test-vith
fix-mbart
fix-mistral-regex
fix-mistral-training-bug
fix-mobilenet-v2-auto
fix-module-fp32
fix-nllb-device
fix-opt-bias
fix-opt-nit
fix-peft-device
fix-perceiver-test
fix-slow-path
fix-slow-tests
fix-spaces-token-issue
fix-speech-doctest
fix-switch-slow
fix-t5-dtype
fix-tf-xlm
fix-tf-xlnet
fix-tfroberta
fix-tok-pipe
fix-training-quanti
fix-ved-doctest
fix-vision-config
fix-vit-hybrid-test
fix-xlm-roberta
flash-att-2-temp
flava-docs
flax-tp-inference
fp4-bnb
generation_sampler
hf-quantizer-work
ignored-index-coherence
image-fp16
improved-generation
improved-testing
int8-config
int8_with_accelerate
integration-8bit
integration-8bit-codegen-hotfix
main
main-temp
master
mbart-fix-jit-issue
mini-fix-hqq
mistral-static-kv-cache
modify_distill_bert_module_name
opt_branch/opt-350-m
opt-350
opt-fix-device
opt-fix-mask
opt-fix-softmax-debug
opt-flax-tf
patch-bit
peft-fix-multi-adapter
peft-integration
pipelines-issues
pix2struct-cross-attn
pix2struct-refactored-ip
plugin-v1
potential-fix-ewkv
question-answering
quickstart-model2model
rbert
refactor-workflows
roberta-no-token-types
roc_bert_fix
run_language_modeling
scripts-device-flag
serialize-8bit
serving_improvements
shape_list
skip-multimodal-pipeline
small_fix_realm
switch_backup
t5-parallelism
temp-2
tentative-pipeline-t5
test_bnb_training
test-static-kv-cache
test-younes-workflow
tf
tf2-extended
tf2-glue
tokenizers
tokenizers-v2
torchhub
tpu-experiments
trainer-model-parallel
trocr-fix-use-cache
uncompressed_storage
update-deps
upgrade-run-generation
vilt-accelerate-suuport
vit-fix-init-nit
vqgan
xnli
younes/patch-bit-3
[gptj] support older pytorch version (#22325)
stas00
committed
3 years ago
Verified
61f79b29
Really fix quality due to ruff release
sgugger
committed
3 years ago
Verified
80e3b363
Fix quality due to ruff release
sgugger
committed
3 years ago
ef28df05
[deepspeed zero3] need `generate(synced_gpus=True, ...)` (#22242)
stas00
committed
3 years ago
Verified
73fdc8c5
Fix PipelineTests skip conditions (#22320)
ydshieh
committed
3 years ago
Verified
8b05ace0
Chunkable token classification pipeline (#21771)
luccailliau
committed
3 years ago
Verified
d62e7d88
docs: Resolve incorrect type typo in trainer methods (#22316)
tomaarsen
committed
3 years ago
Verified
f48d3314
Add Pix2Struct (#21400)
younesbelkada
committed
3 years ago
Verified
0f68a7f4
Beef up Llama tests (#22314)
gante
committed
3 years ago
Verified
fd3eb3e3
Generate: Export TF generate with a TF tokenizer (#22310)
gante
committed
3 years ago
Verified
12febc20
Enforce `max_memory` for device_map strategies (#22311)
sgugger
committed
3 years ago
Verified
5fd4e3c8
Fixed bug to calculate correct xpath_sub_list in MarkupLMTokenizer (#22302)
silentghoul-spec
committed
3 years ago
Verified
48bef3a7
Fix position embeddings for GPT-J and CodeGen (#22069)
njhill
committed
3 years ago
Verified
4e94c6c0
fix: Allow only test_file in pytorch and flax summarization (#22293)
Connor Henderson
committed
3 years ago
Verified
8e6c34b3
add low_cpu_mem_usage option in run_clm.py example which will benefit… (#22288)
sywangyi
committed
3 years ago
Verified
4ccaf268
Enable traced model for text-generation task (#22265)
jiqing-feng
committed
3 years ago
Verified
8472a224
Add MaskedImageModelingOutput (#22212)
alaradirik
committed
3 years ago
Verified
0558914d
Final update of doctest (#22299)
ydshieh
committed
3 years ago
Verified
0dcb46e7
[deepspeed] offload + non-cpuadam optimizer exception doc (#22044)
stas00
committed
3 years ago
Verified
89a0a9ea
Correct NATTEN function signatures and force new version (#22298)
alihassanijr
committed
3 years ago
Verified
5990743f
Restore fp16 support on xla gpu device (#22300)
ymwangg
committed
3 years ago
Verified
d35f7296
Time to Say Goodbye, torch 1.7 and 1.8 (#22291)
ydshieh
committed
3 years ago
Verified
67c2dbdb
Add translation perf_infer_gpu_one for it (#22296)
davidegazze
committed
3 years ago
Verified
86c7931a
fix more doctests (#22292)
ydshieh
committed
3 years ago
Verified
d0b942d1
More doctests (#22268)
ydshieh
committed
3 years ago
Verified
48327c57
Fix error in mixed precision training of `TFCvtModel` (#22267)
gcuder
committed
3 years ago
Verified
5a2b77a6
replace_8bit_linear modules_to_not_convert default value fix (#22238)
Andrei Panferov
committed
3 years ago
Verified
330d8b99
Update vision docstring bool masked pos (#22237)
amyeroberts
committed
3 years ago
Verified
c07a02a4
Example of pad_to_multiple_of for padding and truncation guide & docstring update (#22278)
MKhalusova
committed
3 years ago
Verified
7bd86505
Move torch.compile() wrapping after DDP/FSDP wrapping to ensure correct graph breaks during training (#22279)
ani300
committed
3 years ago
Verified
fb0a38b4
Older