transformers
[Phi] Extend implementation to use GQA/MQA.
#28163
Merged

[Phi] Extend implementation to use GQA/MQA. #28163

ArthurZucker merged 15 commits into huggingface:main from phi-integration-update
gugarosa
gugarosa chore(phi): Updates configuration_phi with missing keys.
e3d73a8e
gugarosa gugarosa marked this pull request as draft 2 years ago
gugarosa
ArthurZucker
ArthurZucker commented on 2023-12-20
susnato
gugarosa
ArthurZucker
ArthurZucker
susnato
ArthurZucker
ArthurZucker commented on 2024-01-02
susnato
gugarosa
gugarosa
ArthurZucker
ArthurZucker
ArthurZucker commented on 2024-01-04
gugarosa chore(phi): Adds first draft of combined modeling_phi.
3d3e9c30
gugarosa gugarosa marked this pull request as ready for review 2 years ago
gugarosa
gugarosa fix(phi): Fixes according to latest review.
8ed81465
gugarosa fix(phi): Removes pad_vocab_size_multiple to prevent inconsistencies.
61bccf65
ArthurZucker
gugarosa fix(phi): Fixes unit and integration tests.
89ccc4ff
gugarosa fix(phi): Ensures that everything works with microsoft/phi-1 for firs…
f06c600d
gugarosa fix(phi): Fixes output of docstring generation.
4322dd42
gugarosa
ArthurZucker
ArthurZucker approved these changes on 2024-01-08
ArthurZucker
gugarosa fix(phi): Fixes according to latest review.
c8137ca3
gugarosa fix(phi): Fixes according to latest review.
6b3055d1
gugarosa Merge branch 'main' into phi-integration-update
11398a01
gugarosa fix(tests): Re-enables Phi-1.5 test.
c46e0504
gugarosa
gugarosa
susnato
gugarosa
gugarosa fix(phi): Fixes attention overflow on PhiAttention (for Phi-2).
f04fc40c
ArthurZucker
gugarosa
ArthurZucker
ArthurZucker commented on 2024-01-09
younesbelkada
younesbelkada commented on 2024-01-10
gugarosa fix(phi): Improves how queries and keys are upcast.
14fe45d7
gugarosa Merge branch 'main' of github.com:gugarosa/transformers into phi-inte…
38d59b81
ArthurZucker
ArthurZucker commented on 2024-01-10
gugarosa fix(phi): Small updates on latest changes.
ce2ebb40
gugarosa
gugarosa
ArthurZucker ArthurZucker merged 55090585 into main 2 years ago
ArthurZucker
vince62s
susnato
vince62s
vince62s
susnato
ArthurZucker
vince62s
ArthurZucker
ArthurZucker
susnato
ArthurZucker
ArthurZucker
susnato
ArthurZucker
ArthurZucker
amyeroberts

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone