text-generation-inference
Flash Transformers modeling backend support
#2913
Merged

Flash Transformers modeling backend support #2913

Narsil merged 33 commits into main from transformers-backend
Cyrilvallez
Cyrilvallez add transformers_flash
ade0f44a
Cyrilvallez inits
da222900
Cyrilvallez switch version to make it work
b3b07474
Cyrilvallez Update Makefile-flash-att-v2
738f0b0e
Cyrilvallez Update Makefile-flash-att-v2
a84ecf26
Cyrilvallez Update Makefile-flash-att-v2
372799a4
Cyrilvallez Update Makefile-flash-att-v2
a0035e66
Cyrilvallez Update Makefile-flash-att-v2
e69a384d
Cyrilvallez Update Makefile-flash-att-v2
3a636ed1
runnable version
649cb1f5
working
490ca0ef
Cyrilvallez push change
f843b62a
Cyrilvallez fix high dim
715b2d19
Cyrilvallez init
e93ab925
Cyrilvallez default
f4c60ca5
Cyrilvallez latest transformers changes
2e2631e0
Cyrilvallez revert
44b36793
Cyrilvallez simplify check
266377b3
Cyrilvallez remove flag
32488c1a
Cyrilvallez improve type hints + required args
ac62bd15
Cyrilvallez Update based on transformers PR
b03d7ae9
Cyrilvallez small fix
b40c8893
Cyrilvallez Remove Warpers for Processor
42ae6dea
Cyrilvallez fix compatibility version issue
f01014de
Narsil
Narsil commented on 2025-01-20
Narsil
Narsil commented on 2025-01-20
Narsil
Narsil commented on 2025-01-20
Cyrilvallez raise error if needed
2659b599
Cyrilvallez Simplify with monkey patch
a2fe8427
Cyrilvallez revert + style + minor improvements
6e0f37c0
Cyrilvallez update comment
52afdcc2
Cyrilvallez Cyrilvallez changed the title Transformers backend Flash Transformers modeling backend support 335 days ago
Cyrilvallez device check
9af3ea4b
Cyrilvallez move the import to avoid device issue
6d9c011f
Narsil
Narsil dismissed these changes on 2025-01-20
Cyrilvallez Update __init__.py
2ef3002c
Cyrilvallez Cyrilvallez dismissed their stale review via 2ef3002c 335 days ago
Cyrilvallez check for non-native models
70ada578
Cyrilvallez oupsi
0d9ec75f
Narsil
Narsil approved these changes on 2025-01-21
Narsil Narsil merged b980848a into main 334 days ago
Narsil Narsil deleted the transformers-backend branch 334 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone