vllm
[Metrics] Model FLOPs Utilization estimation
#30738
Merged

[Metrics] Model FLOPs Utilization estimation #30738

zhuohan123 merged 9 commits into vllm-project:main from SungMinCho:main
SungMinCho
SungMinCho SungMinCho requested a review from markmc markmc 5 days ago
SungMinCho SungMinCho requested a review from WoosukKwon WoosukKwon 5 days ago
SungMinCho SungMinCho requested a review from robertgshaw2-redhat robertgshaw2-redhat 5 days ago
SungMinCho SungMinCho requested a review from njhill njhill 5 days ago
SungMinCho SungMinCho requested a review from ywang96 ywang96 5 days ago
SungMinCho SungMinCho requested a review from alexm-redhat alexm-redhat 5 days ago
SungMinCho SungMinCho requested a review from heheda12345 heheda12345 5 days ago
SungMinCho SungMinCho requested a review from ApostaC ApostaC 5 days ago
chatgpt-codex-connector
mergify mergify added v1
gemini-code-assist
gemini-code-assist commented on 2025-12-16
SungMinCho
zhuohan123
zhuohan123 approved these changes on 2025-12-16
zhuohan123 zhuohan123 added ready
zhuohan123 zhuohan123 enabled auto-merge (squash) 5 days ago
mergify
SungMinCho
disabled auto-merge 5 days ago
Head branch was pushed to by a user without write access
SungMinCho SungMinCho force pushed from 156e6482 to 13ef03e8 5 days ago
mergify
SungMinCho SungMinCho force pushed from 13ef03e8 to dec07b90 5 days ago
mergify
SungMinCho SungMinCho force pushed from dec07b90 to af378f98 5 days ago
SungMinCho SungMinCho force pushed from af378f98 to 0b0adc5a 5 days ago
markmc markmc changed the title Add mfu stats logging [Metrics] Model FLOPs Utilization estimation 5 days ago
markmc
markmc
robertgshaw2-redhat
robertgshaw2-redhat commented on 2025-12-16
SungMinCho
SungMinCho SungMinCho force pushed from 3ec003f4 to cf5536fd 4 days ago
markmc
markmc requested changes on 2025-12-17
markmc
markmc
SungMinCho Add mfu stats logging
89bb95ba
SungMinCho Use ObservabilityConfig instead of ENVVAR to specify MFU logging level
6fb35c29
markmc [MFU] Re-instate test suite
b8fd1021
markmc [MFU] Move request counts into ExecutionContext
f5598f44
markmc [MFU] Delete unused get_perf_stats_per_gpu()
3a7fab1c
markmc [MFU] Remove unused PerfStats.__add__()
587d22ed
markmc [MFU] Refactor debug logging
661a7666
markmc [MFU] Change CLI arg to --enable-mfu-metrics
fe6564e9
markmc markmc force pushed from 0d0730b5 to fe6564e9 4 days ago
markmc
markmc
markmc approved these changes on 2025-12-17
markmc markmc enabled auto-merge (squash) 3 days ago
SungMinCho
SungMinCho Add Engine visibility to MFU logs & Fix PP duration
5302f3e3
disabled auto-merge 3 days ago
Head branch was pushed to by a user without write access
SungMinCho
SungMinCho
zhuohan123 zhuohan123 enabled auto-merge (squash) 3 days ago
SungMinCho
zhuohan123 zhuohan123 merged a0b782f9 into main 3 days ago
SungMinCho

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone