Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
vllm-project/vllm
Pull Requests
Commits
rob-fixes
7snzwi-codex/change-default-logging-behavior
acc-rate
add-nixl-transfer-time-logging
add-sgl-config
add-symm-mem-to-compile-cache
add-utils
amd_dev
amd_mori
amd-ci
andy-neuma-testing
avoid-double-free
batched_triton_fallback
bench-latency
benchmark_serving_test
benchmark
benchmark-output
bind_kv_caches
build-flashinfer-aot-wheel
codex/add-auto-max-model-length-setting
codex/add-pandas-and-datasets-to-requirements
codex/change-default-logging-behavior
codex/remove-raydistributedexecutor-from-v0-engine
codex/remove-vllm-v0-engine-references-from-docs
codex/update-arch-overview-md-with-vllm-v1-details
compile-eplb
copilot/disable-batched-triton-kernel
copilot/fix-31e676e9-a4af-4ed2-b74d-19d27f0a57b2
copilot/fix-584be906-f283-4e17-8776-c14111357ee7
copilot/fix-56244f30-e76a-41ed-beaf-3bc9de22a2c9
copilot/fix-870996da-9146-438e-9a52-cdc6c1743086
copilot/fix-c6914add-1b66-46d0-9948-c2e7b6f2259f
copilot/fix-cudagraph-flag-combination
correct-docs-cuda-version
dbo-cudagraph-size-cherry
debug
debug-logging
debug-logs
deep_full_cudagraph_fix
deepep_tweaks
deepseek_optimizations_alex_rob
dependabot/github_actions/actions/checkout-5.0.0
dependabot/github_actions/actions/checkout-6.0.1
dependabot/github_actions/actions/stale-10.1.1
disable-sd
dockerfile-nvcc-compress
dynamo-patch
fix_ds_eagle
fix_hang
fix_use_ep
fix-doc-build
fix-hashing-partial-blocks
fix-precommit
fix-v1-test
fp8_ep_dp
full_cudagraph
fused-moe-tuning-ep
gemma3n-mm
gpu_ids2
gpu-ids
il_tool
jax-tpu
kevin_h100
khluu/clean_apt
khluu/nccl
khluu/refactor_ci
khluu/test_fixed_premerge
khluu/test_latest_feat
khluu/test_pull_through_cache
khluu/test_us_east_1
khluu/test
khluu/try_moc
khluu/use_ccache_premerge
khluu/0.11.1
low_latency_opt
lwilkinson/cg-support
lwilkinson/dbo-full-cudagraphs
lwilkinson/eagle-piecewise
lwilkinson/potential-cutlass-mla-fix
lwilkinson/refactor-cmake
main
mamba_tests
marlin_gptoss_swiglu
maybe_fix_hang_2
memory-leak-branch
mergify/houseroad/config-update
minus_x
mla_cuda_graphs
mla_decode_any_head
mla-support-awq-marlin
model-bash-tools
moondream2
nixl-debug-oh-fixed
nixl-upstreaming
optimize-prefix-caching-scheduling
pd_scheduling
pil_image
qwen25vl
rebased_fi_moe
reduce_scatter_comm
releases/v0.9.0
releases/v0.9.1
releases/v0.9.2
releases/v0.10.0
releases/v0.10.1
releases/v0.10.2
releases/v0.11.0
releases/v0.11.1
releases/v0.11.2
releases/v0.12.0
remove_mamba_ssm
remove-async-engine-tests
remove-metrics-and-tracing-test
remove-regression-test
revert-21550-chengji/fix-ci
revert-22299-main
revert-26740-wentao-optimize-startup-log-2
revert-27600-torch-utils-import
revert-29385-eplb_nightly_ci
rob-fixes
running-deque
sampler-env-variable
seemethere/cuda_arm64
simon-mo-patch-1
skip-lmfe-tests
skip-transformers-nightly
split_kv_cache_init
support_global_dp_logging
test-debug-lb
test-docker-cache
tms/distributed_timeout
topk_id_hack
torch_dynamo
torch-2.8
tpu_v1_optimized
tpu_v1
triton-configs
update_from_kv_xfer_finished_race_fix
use-uv-python-for-docker
v0.7.2-staging-branch
v0.8.0
v0.8.1
v0.8.2
v0.8.3
v0.8.4
v0.8.5
v1-sched-interface-2
v1_fix_profiler
verbose-prime-rl-ci
wentao-TritonMLA-support-without-prefix-caching
wentao-fix-python-install-ci-error
wentao-fix-torch-warning
wentao-optimize-group-topk
wentao-optimize-startup-logs-4
whisper-translate
wide_ep_working_branch
wide_ep_working_branch_2
woosuk/fa3-swa-cudagraph
woosuk/flashinfer-swa
woosuk/remove-req-idx-mapping
woosuk/rm-add-init-env
woosuk/router-nixl
woosuk/sampled-token-ids
woosuk/test-router
woosuk/v2-logit-bias
woosuk/v2-nan
woosuk/v2-penalties
woosuk-jf
wye-refactor-w8a8-quant
zhuohan/moe-kernel-experiment
zhuohan/remove-redundant-argument
zhuohan/remove-virtual-engine
zhuohan/revert-26709
updated
robertgshaw2-redhat
committed
259 days ago
220d6940
updated
robertgshaw2-redhat
committed
259 days ago
70e06dd5
updated
robertgshaw2-redhat
committed
259 days ago
7954461d
updated
robertgshaw2-redhat
committed
259 days ago
a10da866
added __init__.py
robertgshaw2-redhat
committed
259 days ago
284d5df4
added __init__.py
robertgshaw2-redhat
committed
259 days ago
d5b0db44
updated
robertgshaw2-redhat
committed
259 days ago
66349c33
updated
robertgshaw2-redhat
committed
259 days ago
28d0396f
added files
robertgshaw2-redhat
committed
259 days ago
2f29ae38
updated
robertgshaw2-redhat
committed
259 days ago
cf64b0e6
pre-commit
robertgshaw2-redhat
committed
260 days ago
f51f182d
fix pre-commit
Robert Shaw
committed
260 days ago
79e465f5
updated
Robert Shaw
committed
260 days ago
2ba687d3
updated
Robert Shaw
committed
260 days ago
5d57896e
cleanup
Robert Shaw
committed
260 days ago
f6f008ca
updated
Robert Shaw
committed
260 days ago
24cbbe47
working?
Robert Shaw
committed
260 days ago
2fec6e0b
updated
Robert Shaw
committed
260 days ago
47a3f26b
Merge branch 'main' into rob-fixes
Robert Shaw
committed
261 days ago
144162fc
Stash
Robert Shaw
committed
261 days ago
522279eb
Remove openvino support in favor of external plugin (#15339)
russellb
committed
261 days ago
Verified
b877031d
updated
Robert Shaw
committed
261 days ago
85687b43
updated
Robert Shaw
committed
261 days ago
120bbdfd
updated
Robert Shaw
committed
261 days ago
2ceb7bc5
updated
Robert Shaw
committed
261 days ago
9f7fb5ec
updated
Robert Shaw
committed
261 days ago
a8a621e4
[BugFix][Typing] Fix Imprecise Type Annotations (#15208)
WrRan
committed
261 days ago
Verified
dd861b99
[V1] Add `disable-any-whitespace` option support for xgrammar (#15316)
russellb
committed
261 days ago
Verified
eb63ea1e
[Model] Support Tele-FLM Model (#15023)
atone
committed
261 days ago
Verified
2f4bd358
[Bugfix] LoRA V0 - Fix case where `max_num_seqs` is between cudagraph capture sizes (#15308)
varun-sundar-rabindranath
committed
261 days ago
Verified
8a8b30ea
Older