File tree Expand file tree Collapse file tree 1 file changed +6
-0
lines changed
components/src/dynamo/vllm Expand file tree Collapse file tree 1 file changed +6
-0
lines changed Original file line number Diff line number Diff line change @@ -225,6 +225,12 @@ def parse_args() -> Config:
225225 args .enable_local_indexer = str (args .enable_local_indexer ).lower () == "true"
226226 engine_args = AsyncEngineArgs .from_cli_args (args )
227227
228+ if hasattr (engine_args , "stream_interval" ) and engine_args .stream_interval != 1 :
229+ logger .warning (
230+ "--stream-interval is currently not respected in Dynamo. "
231+ "Dynamo uses its own post-processing implementation on the frontend, "
232+ "bypassing vLLM's OutputProcessor buffering. "
233+ )
228234 # Workaround for vLLM GIL contention bug with NIXL connector when using UniProcExecutor.
229235 # With TP=1, vLLM defaults to UniProcExecutor which runs scheduler and worker in the same
230236 # process. This causes a hot loop in _process_engine_step that doesn't release the GIL,
You can’t perform that action at this time.
0 commit comments