fix(stt): persist interrupted batch transcripts#5246
Merged
Conversation
af049f4 to
8e75bbb
Compare
80b466e to
393fc08
Compare
There was a problem hiding this comment.
Cursor Bugbot has reviewed your changes and found 1 potential issue.
❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.
Reviewed by Cursor Bugbot for commit 393fc08. Configure here.
4b5d00f to
95fd574
Compare
393fc08 to
bf2a462
Compare
95fd574 to
4aab320
Compare
bf2a462 to
1644e67
Compare
Infer participant speaker counts for live and batch STT requests, pass them through Hyprnote proxy URLs, and map them to supported provider diarization parameters.
Persist streamed batch transcript snapshots and flush them so interrupted batch jobs can recover transcript progress.
Skip the progressive idle timeout for direct cloud batch transcription so long uploaded audio can wait for the final provider response.
4aab320 to
3ea0efa
Compare
1644e67 to
bf94153
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.

Persist streamed batch transcript snapshots and flush them so interrupted batch jobs can recover transcript progress.
This is part 2 of 2 in a stack made with GitButler:
Note
Medium Risk
Changes how progressive batch transcripts are persisted (replace-mode snapshots + explicit store flush), which could affect transcript integrity/duplication and write frequency. Also alters timeout behavior by disabling idle timeouts for non-progressive/cloud batch runs, which may change failure modes for those providers.
Overview
Progressive batch runs now persist recoverable transcript snapshots. Streamed
BatchResponseStreamedsegmentevents are flattened across channels and persisted as full replace-mode snapshots, and finalhandleBatchResponsealso writes viareplace.Batch persistence is made safer/more durable.
BatchPersistCallbackgains an optional{ mode: "append" | "replace" }, the default batch persist inuseRunBatchhonorsreplace(avoiding re-reading existing words/hints), wraps updates in a single transaction, and triggers asave()flush after streamed writes.Idle-timeout monitoring is gated to progressive batches. A new core helper
expects_progressive_batch(exported fromlistener2-core) is used by the plugin to only start the 60s idle timeout monitor when a provider/model should emit progressive batch events; tests cover the new behavior.Reviewed by Cursor Bugbot for commit bf94153. Bugbot is set up for automated code reviews on this repo. Configure here.