Fix unused variable for CUDA EP builds with USE_FLASH_ATTENTION off #14404
Fix unused variable in bert/attention_impl.cu for builds with USE_FLA…
ffef05f5
adrianlizarraga
changed the title Fix unused variable for CUDA EP builds with USE_FLASH_ATTENTION set to OFF Fix unused variable for CUDA EP builds with USE_FLASH_ATTENTION off 3 years ago
tianleiwu
approved these changes
on 2023-01-23
adrianlizarraga
deleted the adrianl/fix-attention-unused-var branch 3 years ago
faxu
removed release:1.14
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub