transformers
cd9274d0 - [FlaxBert] Add ForCausalLM (#16995)

Commit

3 years ago

[FlaxBert] Add ForCausalLM (#16995) * [FlaxBert] Add ForCausalLM * make style * fix output attentions * Add RobertaForCausalLM * remove comment * fix fx-to-pt model loading * remove comment * add modeling tests * add enc-dec model tests * add big_bird * add electra * make style * make repo-consitency * add to docs * remove roberta test * quality * amend cookiecutter * fix attention_mask bug in flax bert model tester * tighten pt-fx thresholds to 1e-5 * add 'copied from' statements * amend 'copied from' statements * amend 'copied from' statements * quality

References

#27720 - Add common processor tests

#32831 - [Docs] Update resources

#29969 - [SigLIP] Add fast tokenizer

#38622 - [AutoModelForMaskGeneration] Remove duplicate code

#33111 - [Backbone] Remove out_features everywhere

#33174 - [Zero-shot image classification pipeline] Remove tokenizer_kwargs

#19449 - [WIP] Fix weights initialization of several vision models

#16995 - [FlaxBert] Add ForCausalLM

Author

sanchit-gandhi

Parents

31616b8d

Files24

docs/source/en/model_doc
- bert.mdx
- big_bird.mdx
- electra.mdx
- roberta.mdx
src/transformers
- __init__.py
- modeling_flax_outputs.py
- models
  - auto
    - modeling_flax_auto.py
  - bert
    - __init__.py
    - modeling_flax_bert.py
  - big_bird
    - __init__.py
    - modeling_flax_big_bird.py
  - electra
    - __init__.py
    - modeling_flax_electra.py
  - roberta
    - __init__.py
    - modeling_flax_roberta.py
- utils
  - dummy_flax_objects.py
templates/adding_a_new_model/cookiecutter-template-{{cookiecutter.modelname}}
- modeling_flax_{{cookiecutter.lowercase_modelname}}.py
tests
- bert
  - test_modeling_flax_bert.py
- big_bird
  - test_modeling_flax_big_bird.py
- electra
  - test_modeling_flax_electra.py
- encoder_decoder
  - test_modeling_flax_encoder_decoder.py
- roberta
  - test_modeling_flax_roberta.py
- speech_encoder_decoder
  - test_modeling_flax_speech_encoder_decoder.py
utils
- check_repo.py

transformers cd9274d0 - [FlaxBert] Add ForCausalLM (#16995)

transformers
cd9274d0 - [FlaxBert] Add ForCausalLM (#16995)