Mixtral FastGen Support #4828
Kernel changes, cleanup still necessary
72212feb
Add explicit theta to rotary embeddings
62fdf964
Mixtral model implementation
20823e5b
Commit the unsaved files
0e7d4e0d
Minor fixes
6b1761a0
Missing engine factor, rope theta fixes
28ab6441
MoE type mismatches
d28c327b
Misnamed mapping
c1e90a37
Clear output
880417e8
mrwyattii
approved these changes
on 2023-12-19
Fix unit test
77b871d5
Fix unit tests
a383b67e
Clean up top_k support in the C++ code
0255d6bd
Merge branch 'master' into cholmes/mixtral-fastgen-support
e67e15af
mrwyattii
merged
c00388a2
into master 2 years ago
mrwyattii
deleted the cholmes/mixtral-fastgen-support branch 2 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub