transformers
add shared experts for upcoming Granite 4.0 language models
#35894
Merged

Comment changes are shownComment changes are hidden
  • docs/source/en
    • File
      _toctree.yml
    • File
      index.md
    • model_doc
      • File
        granitemoeshared.md
    • File
      perf_infer_gpu_one.md
  • src/transformers
    • File
      __init__.py
    • models
      • File
        __init__.py
      • auto
        • File
          configuration_auto.py
        • File
          modeling_auto.py
      • granitemoeshared
        • File
          __init__.py
        • File
          configuration_granitemoeshared.py
        • File
          modeling_granitemoeshared.py
        • File
          modular_granitemoeshared.py
    • utils
      • File
        dummy_pt_objects.py
  • tests/models/granitemoeshared
    • File
      __init__.py
    • File
      test_modeling_granitemoeshared.py

Loading comments...