[otap-df-quiver] Add Write-Ahead-Log (WAL) implementation to Quiver #1537

AaronRM · 2025-12-05T21:52:04Z

This pull request introduces an implementation of the Quiver write-ahead log (WAL). The most significant change from the initial spec includes a rewrite and clarification of the WAL file rotation and checkpointing mechanism. Documentation has been updated to reflect the new design.

…mentations

… tests.

… be based on logical cursor.

…bility

codecov · 2025-12-05T21:54:41Z

Codecov Report

❌ Patch coverage is 96.69247% with 94 lines in your changes missing coverage. Please review.
✅ Project coverage is 83.74%. Comparing base (90148f9) to head (d6096d2).
⚠️ Report is 3 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1537      +/-   ##
==========================================
+ Coverage   83.48%   83.74%   +0.26%     
==========================================
  Files         428      433       +5     
  Lines      118652   121877    +3225     
==========================================
+ Hits        99054   102068    +3014     
- Misses      19064    19275     +211     
  Partials      534      534

Components	Coverage Δ
otap-dataflow	`85.08% <96.69%> (+0.36%)`	⬆️
query_abstraction	`80.61% <ø> (ø)`
query_engine	`90.26% <ø> (+0.01%)`	⬆️
syslog_cef_receivers	`∅ <ø> (∅)`
otel-arrow-go	`53.50% <ø> (ø)`

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

AaronRM · 2025-12-05T22:50:14Z

rust/otap-dataflow/deny.toml

    "MIT-0",
    "Apache-2.0",
    "Unicode-3.0",
+    "BSD-2-Clause",


Required due to the array-ref transitive dependency (from blake3) using this license. BSD-2-Clause is even more permissive than BSD-3-Clause (which is already allowed here), so I don't believe this should be a concern.

lalitb · 2025-12-06T20:45:04Z

rust/otap-dataflow/crates/quiver/src/wal/writer.rs

+            test_crashed: false,
+        };
+        writer.next_sequence = writer.coordinator.detect_next_sequence()?;
+        Ok(writer)


Is this an issue if the collector crashes mid-write? The WAL file could have a partial/corrupt record at the end. After restart, WalWriter::open() will seek to physical EOF, and subsequent writes happening after corrupted data could result in:

Before crash: [Batch 1] [Batch 2] [Batch 3] [partial...] After restart: [Batch 1] [Batch 2] [Batch 3] [garbage] [Batch 4] [Batch 5]

Maybe this is more theoretical. Either way, would be good to have a test that simulates a partial write and verifies recovery behavior (whether that's truncation, detection, or something else).

lalitb · 2025-12-06T21:01:35Z

rust/otap-dataflow/crates/quiver/src/wal/reader.rs

+        }
+
+        let entry_len = u32::from_le_bytes(len_buf) as usize;
+        self.buffer.resize(entry_len, 0);


Here, the reader is trusting the length got from the file, and doing the allocation. In case of the WAL file is corrupted/or malicious attack, the length is large enough ( say 0xFFFF..), the reader will try allocate that size. While the 4 bytes will limit the allocation to 4GB, all df_instance doing this allocation can result in OOM crash. Should we have some kind of max limit check (say WAL size won't be more than 64MB) ?

lalitb · 2025-12-06T21:04:03Z

rust/otap-dataflow/crates/quiver/src/engine.rs

+}
+
+fn wal_path(config: &QuiverConfig) -> PathBuf {
+    config.data_dir.join("wal").join("quiver.wal")


Just confirming - each df_engine instance has a separate data_dir, right? Otherwise multiple instances would conflict on this path.

lalitb · 2025-12-06T21:26:00Z

rust/otap-dataflow/crates/quiver/src/wal/writer.rs

+            flush_policy,
+            max_wal_size: u64::MAX,
+            max_rotated_files: 8,
+            rotation_target_bytes: 64 * 1024 * 1024,


Both above two magic numbers seems to be important operational defaults that affect capacity behavior. Good to define them as named constants to make them more discoverable along with doc comments to explain the rational behind using these default values.

lalitb · 2025-12-06T21:27:54Z

rust/otap-dataflow/crates/quiver/src/wal/writer.rs

+            path,
+            segment_cfg_hash,
+            flush_policy,
+            max_wal_size: u64::MAX,


More of a question - what will happen if we have reached the limit of 8 wal rotated files of 64MB each, while this max_wal_size limit is still not reached?

lalitb · 2025-12-06T21:43:37Z

rust/otap-dataflow/crates/quiver/src/wal/writer.rs

+            }
+        }
+        Ok(highest.map_or(0, |seq| seq.wrapping_add(1)))
+    }


detect_next_sequence() scans all entries in all WAL files on startup. For the default ( 64MB * 8 = 512MB ) capacity this is fine, but would it make sense to persist the last sequence in the checkpoint sidecar for faster recovery?

github-project-automation bot added this to OTel-Arrow Dec 5, 2025

github-actions bot added the rust Pull requests that update Rust code label Dec 5, 2025

AaronRM added 28 commits December 5, 2025 13:52

Remove stray README.md

3404624

Add initial WAL framing and writer

047f7a2

Refactor WAL contants to mod level; initial WAL reader

6f1f630

Initial engine + WAL integration

4018dbd

Add additional test coverage, cross-cutting WAL tests

86c9ea0

Address clippy errors

5157167

Add copyright headers

d20b541

Add missing copyright header line

b9a90a5

Add error injection plumbing to reader tests

ff109fa

Add header tests for additional line coverage

48c90da

Add tests for cases around header/config mismatch.

febe725

Add truncate_to to WalWriter; tests for recovery and error condition.

da7fc58

Add flush on drop and when bytes have exceeded max_unflushed_bytes.

2ebdd88

Add cross segment iteration tests

25825d5

Add tests for 64 bitmap slots, large RecordBundles

38ec8f2

Update ARCHITECTURE.md to clarify the CRC algorithm.

7347361

Add test + implementation for flush syncing data for durability

aeecbbb

Add truncate.offset implementation

948d465

Add tests and implementation for Prefix Reclamation (Hole Punching)

2ca8053

Add rotation_target_bytes; additional rotation & cap tests

fe9a7b0

Fix doctest to write to tempdir

7fbb0c0

Add support for safe offset boundaries

80c8c8b

Add additional comments to key methods in the WAL reader/writer imple…

91a28e6

…mentations

Add tests for crash/resume scenarios

699dddd

Fix warning about unused variable

9206451

Formatting

5e00da6

Address clippy errors, formatting

67c01ed

Reload rotated chunks on WalWriter restart

31f8065

AaronRM added 14 commits December 5, 2025 13:52

Refactor to include WalCoordinator, WalSegment. Reduce duplication in…

32892b4

… tests.

Remove hole-punching for initial implementation; fix sidecar state to…

6290235

… be based on logical cursor.

Remove reclaim_prefix

9c48b86

Update terminology used in WAL implementation for consistency & reada…

1e49492

…bility

Remove some extraneous functionality from prior edits

83cd09b

Update README.md

8260d85

Simplify checkpoint sidecar format, file rotation, flush policy

fa93cfd

Remove rotation_generation mention from ARCHITECTURE.md

e2d6e7d

Resolve clippy warnings

fe96508

Formatting

4d1b3a0

Add additional testcases, reduce boilerplate.

ada76af

Encapsulate WAL checkpoint details

515c953

Ensure parent directory sync after rename

575fd25

Add additional doc comments; rename checkpoint API names for clarity.

61b4d9c

AaronRM force-pushed the quiver-wal branch from 4424606 to 61b4d9c Compare December 5, 2025 21:52

AaronRM added 2 commits December 5, 2025 13:57

Remove non-ASCII chars

d693425

Add BSD-2-Clause to allowed licenses

d6096d2

AaronRM commented Dec 5, 2025

View reviewed changes

AaronRM marked this pull request as ready for review December 5, 2025 22:57

AaronRM requested a review from a team as a code owner December 5, 2025 22:57

lalitb reviewed Dec 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[otap-df-quiver] Add Write-Ahead-Log (WAL) implementation to Quiver #1537

[otap-df-quiver] Add Write-Ahead-Log (WAL) implementation to Quiver #1537

Uh oh!

AaronRM commented Dec 5, 2025

Uh oh!

codecov bot commented Dec 5, 2025 •

edited

Loading

Uh oh!

AaronRM Dec 5, 2025

Uh oh!

lalitb Dec 6, 2025 •

edited

Loading

Uh oh!

lalitb Dec 6, 2025 •

edited

Loading

Uh oh!

lalitb Dec 6, 2025 •

edited

Loading

Uh oh!

lalitb Dec 6, 2025 •

edited

Loading

Uh oh!

lalitb Dec 6, 2025

Uh oh!

lalitb Dec 6, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[otap-df-quiver] Add Write-Ahead-Log (WAL) implementation to Quiver #1537

Are you sure you want to change the base?

[otap-df-quiver] Add Write-Ahead-Log (WAL) implementation to Quiver #1537

Uh oh!

Conversation

AaronRM commented Dec 5, 2025

Uh oh!

codecov bot commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

AaronRM Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

lalitb Dec 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lalitb Dec 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lalitb Dec 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lalitb Dec 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lalitb Dec 6, 2025

Choose a reason for hiding this comment

Uh oh!

lalitb Dec 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Dec 5, 2025 •

edited

Loading

lalitb Dec 6, 2025 •

edited

Loading

lalitb Dec 6, 2025 •

edited

Loading

lalitb Dec 6, 2025 •

edited

Loading

lalitb Dec 6, 2025 •

edited

Loading

lalitb Dec 6, 2025 •

edited

Loading