DeepSpeed
c00388a2 - Mixtral FastGen Support (#4828)

Commit
2 years ago
Mixtral FastGen Support (#4828) Adds support for Mixtral with FastGen. Key features implemented: 1. Top-2 MoE support 2. Better support for RoPE thetas 3. The mistral model implementation --------- Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>
Author
Parents
Loading