text-generation-inference
880ab9c2 - Add Flash decoding kernel ROCm (#2855)

Commit
339 days ago
Add Flash decoding kernel ROCm (#2855) * (vllm) updated vllm rocm kernels * revert silu * update partition size * remove grouped_topk * (nit) remove log * add flash decoding
Author
Parents
Loading