onnxruntime
56ad6812 - Add support to use sequence as input ids in decoder inputs to Beam Search CUDA Op (#15232)

Commit
2 years ago
Add support to use sequence as input ids in decoder inputs to Beam Search CUDA Op (#15232) Add support to use sequence as input ids in decoder inputs to Beam Search CUDA Op ### Description Currently Beam search Op is only supported for CPU EP, added support for CUDA EP. ### Motivation and Context - For Turing models inference was throwing segmentation fault due to copy failing in cuda memory, also beam search support was not present in cuda.
Author
Parents
Loading