Skip to content

Commit bbeb280

Browse files
authored
fix(vllm): warn that stream interval is not respected for now (#4650)
Signed-off-by: alec-flowers <[email protected]> Signed-off-by: Alec <[email protected]>
1 parent f26dbd0 commit bbeb280

File tree

1 file changed

+6
-0
lines changed

1 file changed

+6
-0
lines changed

components/src/dynamo/vllm/args.py

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -225,6 +225,12 @@ def parse_args() -> Config:
225225
args.enable_local_indexer = str(args.enable_local_indexer).lower() == "true"
226226
engine_args = AsyncEngineArgs.from_cli_args(args)
227227

228+
if hasattr(engine_args, "stream_interval") and engine_args.stream_interval != 1:
229+
logger.warning(
230+
"--stream-interval is currently not respected in Dynamo. "
231+
"Dynamo uses its own post-processing implementation on the frontend, "
232+
"bypassing vLLM's OutputProcessor buffering. "
233+
)
228234
# Workaround for vLLM GIL contention bug with NIXL connector when using UniProcExecutor.
229235
# With TP=1, vLLM defaults to UniProcExecutor which runs scheduler and worker in the same
230236
# process. This causes a hot loop in _process_engine_step that doesn't release the GIL,

0 commit comments

Comments
 (0)