vllm
7c734ee0 - [Bugfix][Qwen][DCA] fixes bug in dual-chunk-flash-attn backend for qwen 1m models. (#21364)

Commit
271 days ago
[Bugfix][Qwen][DCA] fixes bug in dual-chunk-flash-attn backend for qwen 1m models. (#21364) Signed-off-by: Tao He <linzhu.ht@alibaba-inc.com>
Author
Parents
Loading