feat(ai): outputFormat strategy + riteway ai init [8/8] by ianwhitedeveloper · Pull Request #423 · paralleldrive/riteway

ianwhitedeveloper · 2026-03-04T19:38:35Z

Summary

Replace parseOutput: fn with declarative outputFormat: 'json' | 'ndjson' | 'text' string in agent configs, making configs fully serializable for custom JSON config files
Add riteway ai init [--force] subcommand that writes all built-in agent configs to riteway.agent-config.json, enabling teams to add custom agents or modify existing entries
Add three-level agent config resolution chain: --agent-config > project registry > built-in defaults
Add riteway ai section to README
Fix: unknown CLI flags (e.g. --agentConfig) were silently ignored by minimist — now throws a ValidationError naming the unrecognized flag(s)

Status: Code review remediated (2026-03-04) · Rebased onto merged PR #421 (2026-03-09) · 212 tests · 0 lint errors

Reviewer Verification

A human test script is available for manual verification of the riteway ai CLI:

plan/riteway-ai-command-human-test.md

It covers 25 scenarios across the full flag surface (--runs, --threshold, --timeout, --agent, --agent-config, --color, --concurrency), the three-level config resolution chain, and validation error cases including the --agentConfig camelCase regression fixed in this PR. Reviewers can use it to walk through the CLI manually and confirm behavior matches expectations. I've successfully run through it myself (as well as some cases it didn't include around the init functionality).

ianwhitedeveloper · 2026-03-05T17:53:01Z

@cursoragent please /review

- Add no-prompt-under-test.sudo, missing-user-prompt.sudo, no-assertions.sudo fixtures to trigger extraction validation errors (MISSING_PROMPT_UNDER_TEST, MISSING_USER_PROMPT, NO_ASSERTIONS_FOUND) through the full E2E pipeline - Add sudolang-prompt-test.sudo fixture to verify the framework handles SudoLang syntax in the userPrompt field - Add describe.skipIf blocks for each validation error case and for the SudoLang happy-path scenario - Replace try/catch error capture with Try helper, consistent with ai-runner.test.js and test-extractor.test.js conventions - Update fixtures/README.md with new fixture descriptions Made-with: Cursor

- Replace { name, code } partial assertion shapes with full error?.cause comparisons for all three validation error tests (MISSING_PROMPT_UNDER_TEST, MISSING_USER_PROMPT, NO_ASSERTIONS_FOUND) - Expected objects now include name, message, code, and testFile — the complete deterministic cause shape from test-extractor.js - Follows Jan's review principle from PR #409: deterministic functions should assert the complete expected value, not individual properties Made-with: Cursor

- A1: validate userPrompt/assertions before resolveImportPaths so structural errors surface before any filesystem IO - A2: guard CONTEXT section in buildJudgePrompt matching existing buildResultPrompt pattern; add test for empty case - A3: use JSON.stringify for TAP YAML in mock helper, eliminating backtick-in-template fragility Made-with: Cursor

- Rewrite buildExtractionPrompt with explicit EXTRACTION RULES block: agent must return "" / [] for missing fields rather than inferring userPrompt or assertions from import/context - Update MISSING_USER_PROMPT and NO_ASSERTIONS_FOUND messages to correctly attribute failures to the agent, not the test file - Add explicit "return []" fallback to importPaths rule for consistency with rules 1 and 3 - Remove !assertions dead code (parseExtractionResult guarantees an array; only .length === 0 check is needed) - Update e2e expected messages and test-extractor snapshot test to match revised prompt and error messages - buildJudgePrompt now conditionally omits CONTEXT section when promptUnderTest is empty, matching buildResultPrompt pattern - Reorder extractTests validation: structural checks before IO Made-with: Cursor

- Replace parseOutput fn with declarative outputFormat string on all built-in agent configs (claude, opencode, cursor) - Add outputFormat lookup map in execute-agent.js (json/ndjson/text) - Add outputFormat to agentConfigFileSchema (z.enum, default 'json') — configs loaded from file are now fully serializable - Update tests: remove parseOutput mock, add real NDJSON integration test and schema round-trip test - Add source/fixtures/ndjson-agent-config.json fixture Made-with: Cursor

- Add `riteway ai init [--force]` subcommand that writes all built-in agent configs to riteway.agent-config.json - Add loadAgentRegistry / resolveAgentConfig: three-level priority chain (--agent-config > registry > built-ins) - Relax aiArgsSchema agent field to z.string().min(1) so custom registry agents pass validation before resolution - Add registry + bad-schema-registry + invalid-registry fixtures - fix: guard passRate against division by zero (NaN -> 0%) - refactor: import defaults/runsSchema/thresholdSchema/ concurrencySchema from constants.js; remove local duplicates - refactor: move agentConfigs to module scope in agent-config.js - chore: add "node": true to .eslintrc.json; switch agent-config.test.js to fileURLToPath(new URL(...)) - docs: add riteway ai section to README with .sudo example, options table, and riteway ai init walkthrough Made-with: Cursor

ianwhitedeveloper · 2026-03-09T18:41:41Z

Re: automated review — post-rebase status (2026-03-09)

The branch was rebased onto the merged PR #421 today. The PR 8 code is unchanged — only the base shifted. The review above still accurately reflects the current diff.

One stale finding to note:

HIGH #1 — "Verify this fix wasn't mistakenly attributed to PR 8 when it belongs to PR 7"

The passRate division-by-zero guard (totalAssertions > 0 ? ... : 0) is intentionally in PR 8. It was written TDD (failing test first) as part of the code review remediation pass on 2026-03-04, not carried over from PR 7. The commit message and test coverage confirm this is deliberate PR 8 work. No action needed.

Everything else in the review remains open and accurate.

…ISTRY - Use destructuring default `outputFormat = 'json'` in execute-agent.js processAgentOutput instead of `|| 'json'` (project style: avoid || for defaults; Avoid using || for defaults. Use parameter defaults instead.) - Add AGENT_NOT_IN_REGISTRY code assertion to resolveAgentConfig test, matching the pattern used for INVALID_AI_ARGS in ai-command.test.js - Expand AGENT_NOT_IN_REGISTRY error message to include "remove the file to use built-in defaults" as a third recovery path Made-with: Cursor

…ptionsSchema - Remove 'text' from outputFormatParsers and agentConfigFileSchema enum: identical to 'json' (identity fn), never used by any config or fixture, rawOutput:true already handles the plain-string use case (YAGNI) - Remove agentSchema: only used by aiTestOptionsSchema, which is itself dead - Remove aiTestOptionsSchema: never called in production; runAITests takes plain JS parameter defaults. Its agent field still used the old enum of 3, contradicting PR 8's relaxed z.string().min(1) for custom registry agents - Delete constants.test.js: tested only the dead schema, not application logic - Remove constraints.supportedAgents: last consumer (agentSchema) is gone 14 test files, 209 tests, 0 lint errors. Made-with: Cursor

Made-with: Cursor

- Add --timeout MS CLI flag: parsed and validated via timeoutSchema (1000–3600000ms), flows through parseAIArgs → runAICommand → runAITests. timeoutSchema was exported but unconnected; now enforced. - Remove export from constraints in constants.js: only used internally to build the individual schemas; no external consumers. - Remove re-export of defaults from ai-command.js: bin/riteway.js already imports defaults directly from constants.js. - Keep authGuidance export in validation.js: legitimately imported by validation.test.js to assert the full error message contract. 211 tests, 0 lint errors. Made-with: Cursor

The --trust flag causes the cursor agent CLI to emit non-JSON content to stdout, breaking output parsing when spawned as a subprocess. Made-with: Cursor

Made-with: Cursor

- Unrecognized flags (e.g. --agentConfig) were silently ignored by minimist, causing misconfigured runs with no error. Now throws ValidationError naming the unknown flag(s). - Add test: unknown camelCase flag throws ValidationError with INVALID_AI_ARGS code and names the flag in the message. - Add human and agent user test scripts to plan/. Made-with: Cursor

- Steps 11-15 cover: writing the registry, using and customizing it, --force overwrite, and the exists-without-force error. - Renumber old validation error steps 11-25 to 16-30 to maintain a clean sequential index. Made-with: Cursor

ianwhitedeveloper force-pushed the pr/agent-output-format branch from e314dd4 to 93b412d Compare March 4, 2026 19:41

ianwhitedeveloper changed the base branch from ai-testing-framework-implementation-consolidation to pr/ai-e2e-fixtures March 4, 2026 19:42

ianwhitedeveloper changed the base branch from pr/ai-e2e-fixtures to ai-testing-framework-implementation-consolidation March 4, 2026 19:43

This was referenced Mar 4, 2026

feat(ai): PR 7 — E2E Tests + Fixtures + Config #421

Merged

feat(ai): AI Testing Framework — consolidation staging branch [6/8 → master] #411

Draft

This comment was marked as resolved.

Sign in to view

ianwhitedeveloper added 6 commits March 9, 2026 12:28

ianwhitedeveloper force-pushed the pr/agent-output-format branch from 93b412d to e48cee3 Compare March 9, 2026 17:30

ianwhitedeveloper added 7 commits March 9, 2026 14:20

fix(ai): remove stale 'text' from AgentConfigValidationError hint

7087d8a

Made-with: Cursor

fix(cursor): remove --trust from cursor agent args

8513c59

The --trust flag causes the cursor agent CLI to emit non-JSON content to stdout, breaking output parsing when spawned as a subprocess. Made-with: Cursor

refactor(validation): remove noisy auth console logs

a2c2ece

Made-with: Cursor

ianwhitedeveloper marked this pull request as ready for review March 9, 2026 21:39

ianwhitedeveloper requested review from ericelliott and janhesters March 9, 2026 21:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(ai): outputFormat strategy + riteway ai init [8/8]#423

feat(ai): outputFormat strategy + riteway ai init [8/8]#423
ianwhitedeveloper wants to merge 14 commits intoai-testing-framework-implementation-consolidationfrom
pr/agent-output-format

ianwhitedeveloper commented Mar 4, 2026 •

edited

Loading

Uh oh!

ianwhitedeveloper commented Mar 5, 2026

Uh oh!

This comment was marked as resolved.

ianwhitedeveloper commented Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ianwhitedeveloper commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Reviewer Verification

Uh oh!

ianwhitedeveloper commented Mar 5, 2026

Uh oh!

This comment was marked as resolved.

ianwhitedeveloper commented Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ianwhitedeveloper commented Mar 4, 2026 •

edited

Loading