text-generation-inference
bf94df3c - fix(server): use mem_get_info to get kv cache size (#664)

Commit
2 years ago
fix(server): use mem_get_info to get kv cache size (#664) Close https://github.com/huggingface/text-generation-inference/issues/649 Close https://github.com/huggingface/text-generation-inference/issues/651 Close https://github.com/huggingface/text-generation-inference/issues/653 Close #636
Parents
Loading