text-generation-inference
Add basic FP8 KV cache support
#2603
Merged

Loading