transformers
3c289e21 - [performance_optim] reduce frequency of declaring attention_mask in Ascend NPU flash attention (#38278)

Commit
304 days ago
[performance_optim] reduce frequency of declaring attention_mask in Ascend NPU flash attention (#38278) [performance_optim] reduce frequency of declaring attention_mask in ASCEND NPU flash attention
Author
Parents
Loading