Fixed bug with hybrid engine generation when inference_tp_size > 1 #4493
gather attention mask
1a597655
hxdtest
changed the title gather attention mask Fixed bug with hybrid engine generation when inference_tp_size > 1 1 year ago
Merge branch 'master' into fix_attention_mask
3c7dc490
Merge branch 'master' into fix_attention_mask
e9eb917a
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub