vllm
c7ea0b56 - [AMD] [Quantization] Add override flag for attention dtype instead of using kv_cache_dtype trigger (#17331)

Commit
227 days ago
[AMD] [Quantization] Add override flag for attention dtype instead of using kv_cache_dtype trigger (#17331) Signed-off-by: Randall Smith <Randall.Smith@amd.com>
Author
Parents
Loading