Daily Perf ImproverResearch and Plan #213

2026-02-20T13:13:33Z

github-actions[bot]
bot Feb 20, 2026

Performance Landscape — FSharp.Control.AsyncSeq

FSharp.Control.AsyncSeq is a core F# async sequences library exposing AsyncSeq<'T>, computation expressions, and a rich API of combinators (mapAsync, chooseAsync, append, collect, bufferByTime, etc.). Performance directly affects every F# async pipeline built on top of it.

Current Performance Testing Infrastructure

Tools and Harnesses

BenchmarkDotNet suite: tests/FSharp.Control.AsyncSeq.Benchmarks/AsyncSeqBenchmarks.fs
- AsyncSeqCoreBenchmarks — unfoldAsync, replicate, mapAsync, chooseAsync at 1k/10k elements
- AsyncSeqAppendBenchmarks — chained append at 10/50/100 chain depth
- AsyncSeqBuilderBenchmarks — recursive asyncSeq { ... } builder at 50/100/200 recursion depth
Ad-hoc F# script: tests/FSharp.Control.AsyncSeq.Tests/AsyncSeqPerf.fsx — manual timing for N=1,000,000 scenarios including unfoldIter, replicate, collect, bufferByTime

CI

PR builds run dotnet test -c Release (no dedicated benchmark CI job)
No automated performance regression detection in CI

Identified Performance Issues and Bottlenecks

1. Allocation-heavy async state machines

Every MoveNext() on an AsyncSeq goes through F#'s Async<'T> which allocates continuations. The inline comments in AsyncSeqPerf.fsx show historical GC pressure: GC gen0: 1114 allocations for N=1M unfoldChooseIter. Each combinator in a pipeline compounds allocation.

2. `append` / `collect` chains — O(n²) risk

The AsyncGenerator.GenerateCont.Bind logic attempts right-associativity to avoid O(n²) continuation chains. The benchmark covers chains of 10–100 appends. There is still concern about deeply nested asyncSeq { yield! ... } patterns building up GenerateCont wrappers.

3. `asyncSeq` computation builder recursive patterns

The RecursiveAsyncSeq benchmark directly tests whether recursive yield! degrades super-linearly. Historical data in the fsx script shows 69 seconds for N=10,000 recursive binds before the AsyncGenerator fix, vs 97ms after.

4. `mapAsyncUnorderedParallel` / `iterAsyncParallel`

Two loose scripts at repo root (mapAsyncUnorderedParallel_test.fsx, iterAsyncParallel_cancellation_test.fsx) suggest active development on parallel combinators. These use System.Threading.Channels for concurrency and have correctness and throughput implications.

5. No `ValueTask`/`IAsyncEnumerable(T)` bridging optimisation

The library predates C# IAsyncEnumerable(T). There is a IAsyncEnumerator<'T> interface but it returns Async<'T option> (boxed F# async), not ValueTask(bool). Bridging or native support could reduce allocations significantly for hot paths.

6. `bufferByTime` / time-based operations

Use of Async.Sleep and Task.WhenAny inside timing combinators can create thread-pool pressure in high-throughput scenarios.

Optimization Target Priorities

Priority	Area	Rationale
High	Reduce per-element allocation in core combinators (`mapAsync`, `chooseAsync`, `iterAsync`)	Largest user-facing impact; measurable with existing benchmarks
High	Validate/improve `asyncSeq` builder O(n²) safety at larger depths	Risk of pathological regressions; benchmark only goes to 200
Medium	`mapAsyncUnorderedParallel` throughput and back-pressure	Active development; correctness + performance intertwined
Medium	BenchmarkDotNet CI integration (nightly or PR-triggered)	Enables regression detection without manual effort
Low	`ValueTask` / `IAsyncEnumerable(T)` adapter for .NET 5+	Large scope but high long-term value
Low	Fable compilation size / tree-shaking	JS bundle size matters for Fable consumers

Performance Engineering Gaps

No automated perf regression baseline stored in CI artefacts
BenchmarkDotNet parameters are small (max 10k/200); real workloads hit millions
No memory-profile test for long-running sequences (leaks)
No documentation on how to run benchmarks locally
AsyncSeqPerf.fsx has many commented-out tests — not integrated into any harness

Proposed Plan

Extend BenchmarkDotNet suite to cover N=100k–1M and memory allocation per element
Audit asyncSeq builder at recursion depths 500–5000 to check for regressions beyond current 200
Profile mapAsync/chooseAsync allocations — consider struct-based optimisations or Async.map inline path
Parallel combinator performance — add throughput benchmark for mapAsyncUnorderedParallel
Add CI benchmark step (comment-on-PR with perf delta using dotnet-benchmark action or similar)

How to Control this Workflow

Add comments to this discussion to provide feedback or adjustments to the plan.

You can also control the workflow with these commands:

gh aw disable daily-perf-improver --repo fsprojects/FSharp.Control.AsyncSeq
gh aw enable daily-perf-improver --repo fsprojects/FSharp.Control.AsyncSeq
gh aw run daily-perf-improver --repo fsprojects/FSharp.Control.AsyncSeq --repeat (number-of-repeats)
gh aw logs daily-perf-improver --repo fsprojects/FSharp.Control.AsyncSeq

What Happens Next

The next time this workflow runs, Phase 2 will be performed: it will analyse the codebase to create a build-steps/action.yml configuration and performance engineering guides under .github/copilot/instructions/.
After Phase 2 completes (and its PR is merged), Phase 3 will begin on subsequent runs to implement actual performance improvements from the plan above.
If running in repeat mode, the workflow will automatically run again to proceed to the next phase.
Humans can review this research and add comments before the workflow continues — use the comment thread below.

AI generated by Daily Perf Improver

expires on Feb 27, 2026, 1:13 PM UTC

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Daily Perf ImproverResearch and Plan #213

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Daily Perf ImproverResearch and Plan #213

Uh oh!

github-actions[bot] bot Feb 20, 2026

Performance Landscape — FSharp.Control.AsyncSeq

Current Performance Testing Infrastructure

Tools and Harnesses

CI

Identified Performance Issues and Bottlenecks

1. Allocation-heavy async state machines

2. append / collect chains — O(n²) risk

3. asyncSeq computation builder recursive patterns

4. mapAsyncUnorderedParallel / iterAsyncParallel

5. No ValueTask/IAsyncEnumerable(T) bridging optimisation

6. bufferByTime / time-based operations

Optimization Target Priorities

Performance Engineering Gaps

Proposed Plan

How to Control this Workflow

What Happens Next

Replies: 0 comments

github-actions[bot]
bot Feb 20, 2026

2. `append` / `collect` chains — O(n²) risk

3. `asyncSeq` computation builder recursive patterns

4. `mapAsyncUnorderedParallel` / `iterAsyncParallel`

5. No `ValueTask`/`IAsyncEnumerable(T)` bridging optimisation

6. `bufferByTime` / time-based operations