onnxruntime
[webgpu] Restore FP16 math in flash attention generation
#24994
Merged

[webgpu] Restore FP16 math in flash attention generation #24994

guschmue merged 1 commit into main from gqa_f16
qjia7
qjia7 [webgpu] Restore FP16 math in flash attention generation
ea6f7c98
qjia7 qjia7 requested a review from sushraja-msft sushraja-msft 1 year ago
qjia7 qjia7 requested a review from guschmue guschmue 1 year ago
qjia7 qjia7 requested a review from fs-eire fs-eire 1 year ago
guschmue
guschmue approved these changes on 2025-06-09
guschmue guschmue added ep:WebGPU
guschmue
guschmue
guschmue guschmue merged c5b48ae3 into main 1 year ago
guschmue guschmue deleted the gqa_f16 branch 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone