transformers
Add afmoe model
#42168
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
19
Changes
View On
GitHub
Commits
Add AFMoE model support
alyosha-swamy
committed
219 days ago
Merge remote-tracking branch 'upstream/main' into add_afmoe_model
alyosha-swamy
committed
219 days ago
Address review feedback for AFMoE implementation
alyosha-swamy
committed
217 days ago
Add flex attention support to AFMoE model
alyosha-swamy
committed
213 days ago
Merge branch 'main' into add_afmoe_model
ArthurZucker
committed
212 days ago
Fix expert_bias routing in AFMoE
alyosha-swamy
committed
212 days ago
Remove test-results directory
alyosha-swamy
committed
210 days ago
Merge branch 'main' into add_afmoe_model
ArthurZucker
committed
203 days ago
Address PR review feedback for AFMoE model
alyosha-swamy
committed
203 days ago
fix(afmoe): ensure RMSNorm output dtype matches input dtype)
alyosha-swamy
committed
203 days ago
properly return attn weights
ArthurZucker
committed
202 days ago
fix most tests
ArthurZucker
committed
202 days ago
cleanup
ArthurZucker
committed
202 days ago
fix input embeds api
ArthurZucker
committed
202 days ago
update rope API, smaller test and should be good to go
ArthurZucker
committed
202 days ago
oups wront place to skip unittest
ArthurZucker
committed
202 days ago
quality
ArthurZucker
committed
202 days ago
update
ArthurZucker
committed
202 days ago
rope parameter docstring fill
ArthurZucker
committed
202 days ago
Loading