DeepSpeed
Fix the sequence-parallelism for the dense model architecture
#4530
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
8
Changes
View On
GitHub
Loading