vllm
[BUG] fix crash on flashinfer backend with cudagraph disabled, when attention group_size not in [1,2,4,8]
#7509
Merged

[BUG] fix crash on flashinfer backend with cudagraph disabled, when attention group_size not in [1,2,4,8] #7509

comaniac merged 3 commits into vllm-project:main from learninmou:main
learninmou
github-actions
learninmou learninmou changed the title fix crash on flashinfer backend with cudagraph disabled, when attention group_size not in [1,2,4,8] [BUG] fix crash on flashinfer backend with cudagraph disabled, when attention group_size not in [1,2,4,8] 1 year ago
comaniac
comaniac commented on 2024-08-14
learninmou learninmou force pushed 1 year ago
learninmou
comaniac
comaniac approved these changes on 2024-08-15
learninmou
github-actions github-actions added ready
comaniac comaniac enabled auto-merge (squash) 1 year ago
JaheimLee
comaniac
lxgsbqylbk add utests for flashinfer fix
9d55ef1c
lxgsbqylbk rebase latest main branch
d2e83e4f
disabled auto-merge 1 year ago
Head branch was pushed to by a user without write access
learninmou learninmou force pushed to d2e83e4f 1 year ago
lxgsbqylbk fix syntax error
db3b91a7
learninmou
comaniac comaniac merged 53328d75 into main 1 year ago
elfiegg
comaniac
elfiegg
comaniac
yzh119
yzh119

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone