Go
Home
Pricing
FAQ
Install
Home
Pricing
FAQ
Install
Login
via GitHub
vllm-project/vllm
Pull Requests
Commits
debug-logging
7snzwi-codex/change-default-logging-behavior
acc-rate
add-nixl-transfer-time-logging
add-sgl-config
add-symm-mem-to-compile-cache
add-utils
amd_dev
amd_mori
amd-ci
andy-neuma-testing
avoid-double-free
batched_triton_fallback
bench-latency
benchmark_serving_test
benchmark
benchmark-output
bind_kv_caches
build-flashinfer-aot-wheel
codex/add-auto-max-model-length-setting
codex/add-pandas-and-datasets-to-requirements
codex/change-default-logging-behavior
codex/remove-raydistributedexecutor-from-v0-engine
codex/remove-vllm-v0-engine-references-from-docs
codex/update-arch-overview-md-with-vllm-v1-details
compile-eplb
copilot/disable-batched-triton-kernel
copilot/fix-31e676e9-a4af-4ed2-b74d-19d27f0a57b2
copilot/fix-584be906-f283-4e17-8776-c14111357ee7
copilot/fix-56244f30-e76a-41ed-beaf-3bc9de22a2c9
copilot/fix-870996da-9146-438e-9a52-cdc6c1743086
copilot/fix-c6914add-1b66-46d0-9948-c2e7b6f2259f
copilot/fix-cudagraph-flag-combination
correct-docs-cuda-version
dbo-cudagraph-size-cherry
debug
debug-logging
debug-logs
deep_full_cudagraph_fix
deepep_tweaks
deepseek_optimizations_alex_rob
dependabot/github_actions/actions/checkout-5.0.0
dependabot/github_actions/actions/checkout-6.0.1
dependabot/github_actions/actions/stale-10.1.1
disable-sd
dockerfile-nvcc-compress
dynamo-patch
fix_ds_eagle
fix_hang
fix_use_ep
fix-doc-build
fix-hashing-partial-blocks
fix-precommit
fix-v1-test
fp8_ep_dp
full_cudagraph
fused-moe-tuning-ep
gemma3n-mm
gpu_ids2
gpu-ids
il_tool
jax-tpu
kevin_h100
khluu/clean_apt
khluu/nccl
khluu/test_fixed_premerge
khluu/test_latest_feat
khluu/test_pull_through_cache
khluu/test_us_east_1
khluu/test
khluu/try_moc
khluu/use_ccache_premerge
khluu/0.11.1
low_latency_opt
lwilkinson/cg-support
lwilkinson/dbo-full-cudagraphs
lwilkinson/eagle-piecewise
lwilkinson/potential-cutlass-mla-fix
lwilkinson/refactor-cmake
main
mamba_tests
marlin_gptoss_swiglu
maybe_fix_hang_2
memory-leak-branch
mergify/houseroad/config-update
minus_x
mla_cuda_graphs
mla_decode_any_head
mla-support-awq-marlin
model-bash-tools
moondream2
nixl-debug-oh-fixed
nixl-upstreaming
optimize-prefix-caching-scheduling
pd_scheduling
pil_image
qwen25vl
rebased_fi_moe
reduce_scatter_comm
releases/v0.9.0
releases/v0.9.1
releases/v0.9.2
releases/v0.10.0
releases/v0.10.1
releases/v0.10.2
releases/v0.11.0
releases/v0.11.1
releases/v0.11.2
releases/v0.12.0
remove_mamba_ssm
remove-async-engine-tests
remove-metrics-and-tracing-test
remove-regression-test
revert-21550-chengji/fix-ci
revert-22299-main
revert-26740-wentao-optimize-startup-log-2
revert-27600-torch-utils-import
revert-29385-eplb_nightly_ci
rob-fixes
running-deque
sampler-env-variable
seemethere/cuda_arm64
simon-mo-patch-1
skip-lmfe-tests
skip-transformers-nightly
split_kv_cache_init
support_global_dp_logging
test-debug-lb
test-docker-cache
tms/distributed_timeout
topk_id_hack
torch_dynamo
torch-2.8
tpu_v1_optimized
tpu_v1
triton-configs
update_from_kv_xfer_finished_race_fix
use-uv-python-for-docker
v0.7.2-staging-branch
v0.8.0
v0.8.1
v0.8.2
v0.8.3
v0.8.4
v0.8.5
v1-sched-interface-2
v1_fix_profiler
verbose-prime-rl-ci
wentao-fix-python-install-ci-error
wentao-fix-torch-warning
wentao-optimize-startup-logs-4
wentao-parallel_config-None-issue
whisper-translate
wide_ep_working_branch
wide_ep_working_branch_2
woosuk/fa3-swa-cudagraph
woosuk/flashinfer-swa
woosuk/remove-req-idx-mapping
woosuk/rm-add-init-env
woosuk/router-nixl
woosuk/sampled-token-ids
woosuk/test-router
woosuk/v2-logit-bias
woosuk/v2-nan
woosuk/v2-penalties
woosuk-jf
wye-refactor-w8a8-quant
zhuohan/moe-kernel-experiment
zhuohan/remove-redundant-argument
zhuohan/remove-virtual-engine
zhuohan/revert-26709
stash
Robert Shaw
committed
138 days ago
f0945e31
updated
Robert Shaw
committed
138 days ago
4ec76caa
updated
Robert Shaw
committed
138 days ago
1588294a
updated
Robert Shaw
committed
138 days ago
e82e9afe
Merge branch 'fix-connector-agg' into debug-logging
Robert Shaw
committed
138 days ago
10abfaf3
[BugFix] Fix KVConnector TP worker aggregation
njhill
committed
138 days ago
9ff1a2b5
updated
Robert Shaw
committed
139 days ago
0abe10e4
[Tests] Add tests for headless internal DP LB (#21450)
njhill
committed
139 days ago
Verified
316b1bf7
[Bugfix][Qwen][DCA] fixes bug in dual-chunk-flash-attn backend for qwen 1m models. (#21364)
sighingnow
committed
139 days ago
Verified
7c734ee0
[V1] Check all pooling tasks during profiling (#21299)
DarkLight1337
committed
139 days ago
Verified
f59ec35b
[Model] add Hunyuan V1 Dense Model support. (#21368)
Asher
committed
139 days ago
Verified
2671334d
[Docs] Clean up v1/metrics.md (#21449)
windsonsea
committed
139 days ago
Verified
2cc5016a
[Misc] fixed nvfp4_moe test failures due to invalid kwargs (#21246)
Yang Chen
committed
139 days ago
Verified
6929f8b4
Mamba V2 Test not Asserting Failures. (#21379)
fabianlim
committed
139 days ago
Verified
32ec9e2f
[Sampler] Introduce logprobs mode for logging (#21398)
houseroad
committed
139 days ago
Verified
accac829
[Docs] Fix bullets and grammars in tool_calling.md (#21440)
windsonsea
committed
139 days ago
Verified
23637dcd
Fixed typo in profiling logs (#21441)
sergiopaniego
committed
139 days ago
Verified
6364af92
[Bugfix] ensure tool_choice is popped when `tool_choice:null` is passed in json payload (#19679)
gcalmettes
committed
139 days ago
Verified
7aaa2bd5
add clear messages for deprecated models (#21424)
youkaichao
committed
139 days ago
Verified
2f5c14de
[Cleanup] Only log MoE DP setup warning if DP is enabled (#21315)
mgoin
committed
139 days ago
Verified
f002e9a8
[Core] Add basic unit test for maybe_evict_cached_block (#21400)
Jialin
committed
139 days ago
Verified
a1f3610f
[Bugfix] Fix nightly transformers CI failure (#21427)
Isotr0py
committed
139 days ago
Verified
4ecedd18
Changing "amdproduction" allocation. (#21409)
Alexei-V-Ivanov-AMD
committed
139 days ago
Verified
107111a8
[Bugfix][CUDA] fixes CUDA FP8 kv cache dtype supported (#21420)
elvischenv
committed
139 days ago
Verified
2dec7c1a
[BUGFIX] deepseek-v2-lite failed due to fused_qkv_a_proj name update (#21414)
xuechendi
committed
139 days ago
Verified
08d2bd78
[BugFix] Update python to python3 calls for image; fix prefix & input calculations. (#21391)
ericehanley
committed
139 days ago
Verified
4f76a05f
Simplify weight loading in Transformers backend (#21382)
hmellor
committed
139 days ago
Verified
f154bb9f
[Bugfix][ROCm][Build] Fix build regression on ROCm (#21393)
gshtras
committed
139 days ago
Verified
3ec7170f
[CI/Build] Fix model executor tests (#21387)
DarkLight1337
committed
139 days ago
Verified
c401c64b
[BugFix] Fix ray import error mem cleanup bug (#21381)
joerunde
committed
139 days ago
Verified
b77c7d32
Older