feat(extension): add raw OpenAI-compatible stream path for thinking-mode providers by lcomplete · Pull Request #149 · lcomplete/huntly

lcomplete · 2026-04-27T15:06:02Z

Summary

Adds requiresRawOpenAICompatibleStream flag to ProviderMeta; set for qwen, zhipu, and minimax providers
Introduces a fetch-based streamOpenAICompatibleChatCompletion helper that can pass arbitrary request-body extras (e.g. enable_thinking) not exposed by the Vercel AI SDK
Routes providers with the flag through the new raw path in background.ts; all others keep the existing Vercel AI SDK path
Adds getThinkingModeOptions helper in thinkingMode.ts to build provider-specific thinking flags
Fixes the loading indicator in AssistantMessage to remain visible while reasoning content is streaming (not just before any parts arrive)

Test plan

yarn test passes in app/extension
Manual smoke test: qwen / zhipu / minimax providers stream responses with thinking mode enabled
Other providers (OpenAI, Anthropic, Ollama) unaffected — still use Vercel AI SDK path
Loading dots visible during reasoning phase, hidden once final text is rendered

🤖 Generated with Claude Code

…ode providers Introduce a direct fetch-based streaming pipeline for providers (qwen, zhipu, minimax) that need explicit request-body flags such as `enable_thinking`. Also improves the loading indicator to stay visible while reasoning output is in flight. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

augmentcode · 2026-04-27T15:10:23Z

🤖 Augment PR Summary

Summary: Adds a raw fetch-based OpenAI-compatible streaming path for providers that require explicit thinking-mode flags.

Changes:

Added requiresRawOpenAICompatibleStream to ProviderMeta and enabled it for qwen/zhipu/minimax.
Introduced usesRawOpenAICompatibleStream to decide when to bypass the Vercel AI SDK.
Implemented streamOpenAICompatibleChatCompletion to POST to /chat/completions and parse SSE deltas (content + reasoning).
Added getThinkingModeOptions to always send an explicit enable_thinking boolean.
Updated background.ts to route flagged providers through the raw stream and keep others on streamText.
Adjusted AssistantMessage so the loading indicator stays visible during reasoning/tool/step phases, and added Jest coverage.

Technical notes: The raw path caps output tokens at 8000 and extracts deltas from SSE data: frames (including reasoning_content/reasoning).

_{🤖 Was this summary useful? React with 👍 or 👎}

augmentcode

Review completed. 3 suggestions posted.

Comment augment review to trigger a new review at any time.

augmentcode · 2026-04-27T15:10:24Z

+  const text = useMemo(() => getMessageText(message.parts), [message.parts]);
+  const lastResponseTextIndex = useMemo(
+    () => findLastResponseTextIndex(message),
+    [message]


app/extension/src/sidepanel/components/AssistantMessage.tsx:56 lastResponseTextIndex is memoized with [message]; if message.parts changes without replacing the message object, the memo can go stale and showPreparingResponse may not reflect the latest streamed parts.
Consider depending on message.parts (or deriving directly) so the loading indicator stays in sync with streaming updates.

Severity: medium

_{🤖 Was this useful? React with 👍 or 👎, or 🚀 if it prevented an incident/outage.}

augmentcode · 2026-04-27T15:10:24Z

-      const nextStreamState = applyStreamingPreviewChunk(streamState, chunk, {
-        includeReasoning: includeReasoningPreview,
+          streamState = nextStreamState;
+          sendStreamingPreviewUpdate(


app/extension/src/background.ts:504 In the raw OpenAI-compatible path, sendStreamingPreviewUpdate isn’t wrapped in a try/catch like the Vercel AI path below, so a messaging failure could throw and abort the entire stream/task.
It may be safer to handle this error similarly so the stream stops gracefully instead of failing the run.

Severity: medium

_{🤖 Was this useful? React with 👍 or 👎, or 🚀 if it prevented an incident/outage.}

augmentcode · 2026-04-27T15:10:24Z

+    dataLines = [];
+
+    const delta = extractOpenAICompatibleStreamDelta(eventData);
+    if (delta.done) {


app/extension/src/ai/openAICompatibleStream.ts:140 OpenAICompatibleStreamDelta includes a done flag, but onDelta is never invoked with done: true (the function returns early when it sees [DONE]).
This can surprise callers that expect a terminal callback to finalize UI/state.

Severity: low

_{🤖 Was this useful? React with 👍 or 👎, or 🚀 if it prevented an incident/outage.}

augmentcode Bot reviewed Apr 27, 2026

View reviewed changes

lcomplete merged commit 2217099 into main Apr 27, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(extension): add raw OpenAI-compatible stream path for thinking-mode providers#149

feat(extension): add raw OpenAI-compatible stream path for thinking-mode providers#149
lcomplete merged 1 commit into
mainfrom
dev

lcomplete commented Apr 27, 2026

Uh oh!

augmentcode Bot commented Apr 27, 2026

Uh oh!

augmentcode Bot left a comment

Uh oh!

augmentcode Bot Apr 27, 2026 •

edited

Loading

Uh oh!

augmentcode Bot Apr 27, 2026 •

edited

Loading

Uh oh!

augmentcode Bot Apr 27, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

lcomplete commented Apr 27, 2026

Summary

Test plan

Uh oh!

augmentcode Bot commented Apr 27, 2026

Uh oh!

augmentcode Bot left a comment

Choose a reason for hiding this comment

Uh oh!

augmentcode Bot Apr 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

augmentcode Bot Apr 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

augmentcode Bot Apr 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

augmentcode Bot Apr 27, 2026 •

edited

Loading

augmentcode Bot Apr 27, 2026 •

edited

Loading

augmentcode Bot Apr 27, 2026 •

edited

Loading