vllm
7c734ee0
- [Bugfix][Qwen][DCA] fixes bug in dual-chunk-flash-attn backend for qwen 1m models. (#21364)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
271 days ago
[Bugfix][Qwen][DCA] fixes bug in dual-chunk-flash-attn backend for qwen 1m models. (#21364) Signed-off-by: Tao He <linzhu.ht@alibaba-inc.com>
References
#21364 - [Bugfix][Qwen][DCA] fixes bug in dual-chunk-flash-attn backend for qwen 1m models.
Author
sighingnow
Parents
f59ec35b
Loading