text-generation-inference
70217ac3 - [Gaudi] Fix the OOM issue of Llama-4-Scout-17B-16E-Instruct (#3245)

Commit
204 days ago
[Gaudi] Fix the OOM issue of Llama-4-Scout-17B-16E-Instruct (#3245) Signed-off-by: yuanwu <yuan.wu@intel.com>
Author
Parents
Loading