vllm
be0dcc29
- [XPU] remove q/k/v force contiguous for flash_attn (#40356)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
8 days ago
[XPU] remove q/k/v force contiguous for flash_attn (#40356) Signed-off-by: Yan Ma <yan.ma@intel.com> Co-authored-by: Kunshang Ji <kunshang.ji@intel.com>
References
#40356 - [XPU] remove q/k/v force contiguous for flash_attn
Author
yma11
Parents
e3b65a5b
Loading