johnnynunez
changed the title Update FlashInfer to version 0.6.7.post1 in Dockerfiles and related f… [NVIDIA] Update FlashInfer to version 0.6.7.post13 days ago
johnnynunez
changed the title [NVIDIA] Update FlashInfer to version 0.6.7.post1 [NVIDIA] Update FlashInfer to version 0.6.7.post1. Hot fix for DGX Spark3 days ago
johnnynunez
changed the title [NVIDIA] Update FlashInfer to version 0.6.7.post1. Hot fix for DGX Spark [NVIDIA] Update FlashInfer to version 0.6.7.post1. Avoid re-downloading BMM export headers when flashinfer-cubin is installed3 days ago
Update FlashInfer to version 0.6.7.post1 in Dockerfiles and related f…
26bbbaa1
Remove pre-download step for FlashInfer TRTLLM BMM headers in Dockerfile
0e7b5ed8
johnnynunezforce pushedfrom1acabfaato0e7b5ed83 days ago
Merge branch 'main' into main
0a459b3f
Merge branch 'vllm-project:main' into main
88f7c9be
0.6.7.post2
e6a85912
Merge branch 'vllm-project:main' into main
12dcd479
johnnynunez
changed the title [NVIDIA] Update FlashInfer to version 0.6.7.post1. Avoid re-downloading BMM export headers when flashinfer-cubin is installed [NVIDIA] Update FlashInfer to version 0.6.7.post2. Avoid re-downloading BMM export headers when flashinfer-cubin is installed2 days ago
Add startup_max_wait_seconds parameter to Llama-4-Scout-BF16-fi-cutla…
johnnynunez
changed the title [NVIDIA] Update FlashInfer to version 0.6.7.post2. Avoid re-downloading BMM export headers when flashinfer-cubin is installed [NVIDIA] Update FlashInfer to version 0.6.7.post3. Avoid re-downloading BMM export headers when flashinfer-cubin is installed11 hours ago
Merge branch 'vllm-project:main' into main
656b6cac
Update FlashInfer to version 0.6.7.post3 in Dockerfiles and related f…
Login to write a write a comment.
Login via GitHub