transformers
cd9274d0
- [FlaxBert] Add ForCausalLM (#16995)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Hide Minimap (CTRL+M)
Commit
3 years ago
[FlaxBert] Add ForCausalLM (#16995) * [FlaxBert] Add ForCausalLM * make style * fix output attentions * Add RobertaForCausalLM * remove comment * fix fx-to-pt model loading * remove comment * add modeling tests * add enc-dec model tests * add big_bird * add electra * make style * make repo-consitency * add to docs * remove roberta test * quality * amend cookiecutter * fix attention_mask bug in flax bert model tester * tighten pt-fx thresholds to 1e-5 * add 'copied from' statements * amend 'copied from' statements * amend 'copied from' statements * quality
References
#27720 - Add common processor tests
#32831 - [Docs] Update resources
#29969 - [SigLIP] Add fast tokenizer
#38622 - [AutoModelForMaskGeneration] Remove duplicate code
#33111 - [Backbone] Remove out_features everywhere
#33174 - [Zero-shot image classification pipeline] Remove tokenizer_kwargs
#19449 - [WIP] Fix weights initialization of several vision models
#16995 - [FlaxBert] Add ForCausalLM
Author
sanchit-gandhi
Parents
31616b8d
Files
24
docs/source/en/model_doc
bert.mdx
big_bird.mdx
electra.mdx
roberta.mdx
src/transformers
__init__.py
modeling_flax_outputs.py
models
auto
modeling_flax_auto.py
bert
__init__.py
modeling_flax_bert.py
big_bird
__init__.py
modeling_flax_big_bird.py
electra
__init__.py
modeling_flax_electra.py
roberta
__init__.py
modeling_flax_roberta.py
utils
dummy_flax_objects.py
templates/adding_a_new_model/cookiecutter-template-{{cookiecutter.modelname}}
modeling_flax_{{cookiecutter.lowercase_modelname}}.py
tests
bert
test_modeling_flax_bert.py
big_bird
test_modeling_flax_big_bird.py
electra
test_modeling_flax_electra.py
encoder_decoder
test_modeling_flax_encoder_decoder.py
roberta
test_modeling_flax_roberta.py
speech_encoder_decoder
test_modeling_flax_speech_encoder_decoder.py
utils
check_repo.py
Loading