vllm
916836bb - [FEAT] [ROCm] [Embedding] Add encoder-only model support into ROCm Flash Attention to enable embedding models. (#14664)

Commit
275 days ago
[FEAT] [ROCm] [Embedding] Add encoder-only model support into ROCm Flash Attention to enable embedding models. (#14664) Signed-off-by: tjtanaa <tunjian.tan@embeddedllm.com>
Author
Parents
Loading