vllm
016b8d1b - Enabled BnB NF4 inference on Gaudi (#20172)

Commit
148 days ago
Enabled BnB NF4 inference on Gaudi (#20172) Signed-off-by: Ruheena Suhani Shaik <rsshaik@habana.ai>
Author
Parents
Loading