vllm
[BUG] fix crash on flashinfer backend with cudagraph disabled, when attention group_size not in [1,2,4,8]
#7509
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
3
Changes
View On
GitHub
[BUG] fix crash on flashinfer backend with cudagraph disabled, when attention group_size not in [1,2,4,8]
#7509
comaniac
merged 3 commits into
vllm-project:main
from
learninmou:main
learninmou
changed the title
fix crash on flashinfer backend with cudagraph disabled, when attention group_size not in [1,2,4,8]
[BUG] fix crash on flashinfer backend with cudagraph disabled, when attention group_size not in [1,2,4,8]
1 year ago
comaniac
commented on 2024-08-14
learninmou
force pushed
1 year ago
comaniac
approved these changes on 2024-08-15
github-actions
added
ready
comaniac
enabled auto-merge (squash)
1 year ago
add utests for flashinfer fix
9d55ef1c
rebase latest main branch
d2e83e4f
disabled auto-merge
1 year ago
Head branch was pushed to by a user without write access
learninmou
force pushed
to
d2e83e4f
1 year ago
fix syntax error
db3b91a7
comaniac
merged
53328d75
into main
1 year ago
Login to write a write a comment.
Login via GitHub
Reviewers
comaniac
Assignees
No one assigned
Labels
ready
Milestone
No milestone
Login to write a write a comment.
Login via GitHub