vllm
6b6e9877 - [NVIDIA] flashinfer TRTLLM attention prefill token limit (#25998)

Commit
89 days ago
[NVIDIA] flashinfer TRTLLM attention prefill token limit (#25998) Signed-off-by: jasonlizhengjian <jason.li@centml.ai> Signed-off-by: jasonlizhengjian <jasonlizhengjian@gmail.com>
Parents
Loading