DeepSpeed
e97b4536 - Add llama3.2 vision autotp (#6577)

Commit
1 year ago
Add llama3.2 vision autotp (#6577) Llama3.2-11b and llama3.2-90b including vision model and text model, these two models have different num_kv_heads, so we need to set num_kv_heads dynamically. Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>
Author
Parents
Loading