[draft] Integrate a fuzzer to the CI #126

let-def · 2025-11-18T14:04:33Z

This PR integrates a fuzzer (ocamlgrammarfuzzer) into the CI. The idea is to derive a deterministic set of sentences from the grammar that guarantees exhaustive coverage of all syntactic constructions, and then to classify the behavior of ocamlformat on these sentences.

HACKING.jst.md has been updated to describe the fuzzer integration.

Catching regressions in the CI

The main job of the CI is to catch regressions, not to guarantee the absence of errors.
Here are sample outputs of the CI catching changes in coverage:

Helping developers improve coverage

For developers, another workflow is provided. Running make fuzz produces reports that give a detailed classification of all failures of ocamlformat, as well as regressions compared to a previous baseline set with make fuzz-update-state.

Otherwise, dune would execute the fuzzer for default builds because it has file targets.

let-def · 2025-11-18T14:16:03Z

I am not sure whether the CI logs can be consulted, so here is the relevant lines for the regression case:

Regression: (* C0 *) exception (* C1 *) false (* C2 *) let (* C3 *) x (* C4 *)
Regression: (* C0 *) val (* C1 *) x (* C2 *) : (* C3 *) ( (* C4 *) {%ext|s|} (* C5 *) ) (* C6 *)
Regression: (* C0 *) kind_abbrev_ (* C1 *) x (* C2 *) = (* C3 *) x (* C4 *) with (* C5 *) {%ext|s|} (* C6 *) -> (* C7 *) {%ext|s|} (* C8 *)
Tested 490799 sentences:
- 251417 successfully formated (51.23%)
- 59851 failed with syntax errors (12.19%)
- 715 had comments dropped (0.15%) (732 comments were dropped in total)
- 178818 caused internal errors (36.43%)
❌ valid sentences: -3 (REGRESSION)
❌ syntax errors: +1 (REGRESSION)
❌ comment errors: +1 (REGRESSION)
❌ comments dropped: +1 (REGRESSION)
❌ internal errors: +1 (REGRESSION)

And in the improvement case:

Tested 490799 sentences:
- 251417 successfully formated (51.23%)
- 59851 failed with syntax errors (12.19%)
- 715 had comments dropped (0.15%) (732 comments were dropped in total)
- 178818 caused internal errors (36.43%)
✅ valid sentences: +3
✅ syntax errors: -1
✅ comment errors: -2
✅ comments dropped: -1
✅ internal errors: -1

Previously, we would just check for the absence of regressions. Ensuring the fuzzer state is updated prevents a developer from forgetting to update the baseline when there are no regressions.

let-def added 8 commits November 18, 2025 21:14

test/fuzzer: dune rules for fuzzing

b12c913

Add Makefile targets to run the fuzzer

9fc2e30

Run the fuzzer in CI workflow

c0e9f0d

Eol_compat: fix crash triggered by the fuzzer

5cec24d

Save fuzzer state

fbfddf7

Update HACKING.jst.md

c93a393

Gate fuzzer rules with WITH_FUZZER envvar

76f9740

Otherwise, dune would execute the fuzzer for default builds because it has file targets.

Update fuzzer documentation

201a0af

let-def added 3 commits November 19, 2025 10:04

CI: check that fuzzer state is uptodate

c1ef9ff

Previously, we would just check for the absence of regressions. Ensuring the fuzzer state is updated prevents a developer from forgetting to update the baseline when there are no regressions.

Streamline update logic in test/fuzzer/run.sh

0f40a3a

Automatically update the state file.

609edbf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[draft] Integrate a fuzzer to the CI #126

[draft] Integrate a fuzzer to the CI #126

Uh oh!

let-def commented Nov 18, 2025

Uh oh!

let-def commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[draft] Integrate a fuzzer to the CI #126

Are you sure you want to change the base?

[draft] Integrate a fuzzer to the CI #126

Uh oh!

Conversation

let-def commented Nov 18, 2025

Catching regressions in the CI

Helping developers improve coverage

Uh oh!

let-def commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant