vllm-project/vllm

Pull Requests Commits

Robert Shaw committed 339 days ago

d0d68a4c

relax hybrid dp asserts

tlrmchlsmth committed 339 days ago

35f3782d

Merge remote-tracking branch 'origin/main' into one-pod-per-node-lb

njhill committed 340 days ago

5fb68091

[Tests] Add tests for headless internal DP LB (#21450)

njhill committed 340 days ago

Verified 316b1bf7

njhill committed 340 days ago

fb0cf7e2

fix internal_dp_lb tests

njhill committed 340 days ago

1c300fcf

[Bugfix][Qwen][DCA] fixes bug in dual-chunk-flash-attn backend for qwen 1m models. (#21364)

sighingnow committed 340 days ago

Verified 7c734ee0

[V1] Check all pooling tasks during profiling (#21299)

DarkLight1337 committed 340 days ago

Verified f59ec35b

CI tests for hybrid DPLB mode

njhill committed 340 days ago

6328c808

[Tests] Add tests for headless internal DP LB

njhill committed 340 days ago

d95aedd5

njhill committed 340 days ago

8601a22d

[Model] add Hunyuan V1 Dense Model support. (#21368)

Asher committed 340 days ago

Verified 2671334d

[Docs] Clean up v1/metrics.md (#21449)

windsonsea committed 340 days ago

Verified 2cc5016a

Merge remote-tracking branch 'refs/remotes/origin/main' into one-pod-per-node-lb

njhill committed 340 days ago

1bd5f2f1

[Misc] fixed nvfp4_moe test failures due to invalid kwargs (#21246)

Yang Chen committed 340 days ago

Verified 6929f8b4

Mamba V2 Test not Asserting Failures. (#21379)

fabianlim committed 340 days ago

Verified 32ec9e2f

[Sampler] Introduce logprobs mode for logging (#21398)

houseroad committed 340 days ago

Verified accac829

[Docs] Fix bullets and grammars in tool_calling.md (#21440)

windsonsea committed 340 days ago

Verified 23637dcd

Fixed typo in profiling logs (#21441)

sergiopaniego committed 340 days ago

Verified 6364af92

[Bugfix] ensure tool_choice is popped when `tool_choice:null` is passed in json payload (#19679)

gcalmettes committed 340 days ago

Verified 7aaa2bd5

fix handshake mock test

njhill committed 340 days ago

f63cc192

add clear messages for deprecated models (#21424)

youkaichao committed 340 days ago

Verified 2f5c14de

[Cleanup] Only log MoE DP setup warning if DP is enabled (#21315)

mgoin committed 340 days ago

Verified f002e9a8

[Core] Add basic unit test for maybe_evict_cached_block (#21400)

Jialin committed 340 days ago

Verified a1f3610f

[Bugfix] Fix nightly transformers CI failure (#21427)

Isotr0py committed 340 days ago

Verified 4ecedd18

Changing "amdproduction" allocation. (#21409)

Alexei-V-Ivanov-AMD committed 340 days ago

Verified 107111a8

[Bugfix][CUDA] fixes CUDA FP8 kv cache dtype supported (#21420)

elvischenv committed 340 days ago

Verified 2dec7c1a

[BUGFIX] deepseek-v2-lite failed due to fused_qkv_a_proj name update (#21414)

xuechendi committed 340 days ago

Verified 08d2bd78

[BugFix] Update python to python3 calls for image; fix prefix & input calculations. (#21391)

ericehanley committed 340 days ago

Verified 4f76a05f

Simplify weight loading in Transformers backend (#21382)

hmellor committed 340 days ago

Verified f154bb9f

Older