text-generation-inference
293b8125 - ROCm: make CK FA2 default instead of Triton (#1924)

Commit
1 year ago
ROCm: make CK FA2 default instead of Triton (#1924) As per title. Triton autotune overhead is prohibitive, as it needs to be done for each different prompt length.
Author
Parents
Loading