Automatic Evaluation Generation

The system now automatically creates evaluation test cases from successful migrations. This behavior is enabled by default and can be disabled using the --dont-create-evals flag:

ai-migrate migrate --dont-create-evals <file_paths>

Each successful migration will generate a new evaluation in the project's evals directory with:

The original source files
The manifest used for the migration
The verification command that was used

This ensures that all successful migrations contribute to the test suite, improving coverage and helping to catch regressions.

Managing Evaluations

You can also manage evaluations using the CLI:

ai-migrate migrate --manage evals

This provides options to:

List existing evaluations: View all evaluations in the project with details like file count and creation date
Generate evaluations from a PR: Create evaluations based on a GitHub Pull Request
Generate evaluations from recent migrations: Create evaluations from recent successful migrations (coming soon)

How Automatic Evaluation Works

When a migration succeeds (passes verification), the system:

Captures the original source files before migration
Captures the transformed files after migration
Creates a new directory in the evals directory with a timestamp-based name
Saves the original files in the source subdirectory
Creates a manifest file with the verification command and file information

These evaluations can then be used to:

Test future versions of the migration system
Ensure that regressions don't occur
Benchmark performance and accuracy

Disabling Automatic Evaluation

If you don't want to automatically create evaluations (for example, during development or testing), use the --dont-create-evals flag:

ai-migrate migrate --dont-create-evals <file_paths>

This will skip the evaluation creation step while still performing the migration as usual.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatic Evaluation Generation

Managing Evaluations

How Automatic Evaluation Works

Disabling Automatic Evaluation

FilesExpand file tree

eval_improvement.md

Latest commit

History

eval_improvement.md

File metadata and controls

Automatic Evaluation Generation

Managing Evaluations

How Automatic Evaluation Works

Disabling Automatic Evaluation