vllm
4a6b72c2
- [BugFix] Fix triton compile error in `kernel_unified_attention_2/3d` caused by attention sinks (#22368)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
126 days ago
[BugFix] Fix triton compile error in `kernel_unified_attention_2/3d` caused by attention sinks (#22368) Signed-off-by: LucasWilkinson <lwilkinson@neuralmagic.com>
References
#22368 - [BugFix] Fix triton compile error in `kernel_unified_attention_2/3d` caused by attention sinks
Author
LucasWilkinson
Parents
b4b9813b
Loading