vllm
c7ea0b56
- [AMD] [Quantization] Add override flag for attention dtype instead of using kv_cache_dtype trigger (#17331)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
227 days ago
[AMD] [Quantization] Add override flag for attention dtype instead of using kv_cache_dtype trigger (#17331) Signed-off-by: Randall Smith <Randall.Smith@amd.com>
References
#17331 - [AMD] [Quantization] Add override flag for attention dtype instead of using kv_cache_dtype trigger
Author
rasmith
Parents
29fa5cac
Loading