vllm
[Gemma4] Allow per-layer attention backend selection for heterogeneou…
#38891
Open

[Gemma4] Allow per-layer attention backend selection for heterogeneou… #38891

CunXin1
github-actions
gemini-code-assist
gemini-code-assist commented on 2026-04-03
CunXin1 [Gemma4] Allow per-layer attention backend for heterogeneous head dims
7cece22c
CunXin1 CunXin1 force pushed from 44e38ee1 to 7cece22c 19 hours ago
CunXin1
HelloWorldU
HelloWorldU commented on 2026-04-03
janreges
LucasWilkinson

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone