Phi-3 #30423

ArthurZucker merged 18 commits into huggingface:main from main
gugarosa
gugarosa chore(root): Initial commit of Phi-3 files.
c1e38b0b
gugarosa fix(root): Fixes Phi-3 missing on readme.
416eaa41
gugarosa fix(root): Ensures files are consistent.
e0b68151
gugarosa fix(phi3): Fixes unit tests.
912edf15
gugarosa fix(tests): Fixes style of phi-3 test file.
b62e6f3e
gugarosa
CoderCowMoo
gugarosa chore(tests): Adds integration tests for Phi-3.
508ec8ef
gugarosa gugarosa marked this pull request as ready for review 1 year ago
ArthurZucker
gugarosa fix(phi3): Removes additional flash-attention usage, .e.g, swiglu and…
56e6464f
gugarosa fix(phi3): Fixes incorrect docstrings.
9bc1f1f1
gugarosa fix(phi3): Fixes docstring typos.
92d83790
gugarosa
ArthurZucker
gugarosa
gugarosa fix(phi3): Adds support for Su and Yarn embeddings.
c442d064
gugarosa
ArthurZucker
ArthurZucker commented on 2024-04-23
gugarosa fix(phi3): Improves according first batch of reviews.
d5aed89b
gugarosa fix(phi3): Uses up_states instead of y in Phi3MLP.
3a24a1d4
fakerybakery
gugarosa fix(phi3): Uses gemma rotary embedding to support torch.compile.
4cfa767d
gugarosa fix(phi3): Improves how rotary embedding classes are defined.
817fec7b
gugarosa
gugarosa fix(phi3): Fixes inv_freq not being re-computed for extended RoPE.
9427419d
ArthurZucker ArthurZucker added single-model-run-slow
ArthurZucker
ArthurZucker approved these changes on 2024-04-24
LZHgrla
LZHgrla commented on 2024-04-24
ArthurZucker
HuggingFaceDocBuilderDev
gugarosa Merge remote-tracking branch 'upstream/main' into main
06cd06d2
gugarosa fix(phi3): Adds last suggestions to modeling file.
2abcd4de
gugarosa fix(phi3): Splits inv_freq calculation in two lines.
aeb6ae7e
gugarosa
ArthurZucker
ArthurZucker approved these changes on 2024-04-24
ydshieh
ydshieh
ydshieh
ydshieh
ArthurZucker
gugarosa
ArthurZucker ArthurZucker merged c9693db2 into main 1 year ago
ArthurZucker
ydshieh
ydshieh
ydshieh

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone