text-generation-inference
c20025db - Add fp8 kv cache for ROCm (#2856)

Commit
336 days ago
Add fp8 kv cache for ROCm (#2856) * add fp8 kv cache for rocm * improvements * update log statement * remove bookkeeping field
Author
Parents
Loading