Skip to content

Conversation

@christso
Copy link
Collaborator

@christso christso commented Jan 2, 2026

No description provided.

… structure

- Add comprehensive OpenSpec proposal for vision evaluation
- Create self-contained examples/showcase/vision/ directory structure
- Add 14 eval cases (7 basic, 7 advanced) for vision testing
- Add 10 evaluators (6 LLM judges, 4 code validators)
- Support local files, URLs, and multiple image formats
- Rename evals/ to datasets/ following AgentV conventions
- Include comprehensive documentation and configuration
- Research findings from 4 leading frameworks (ADK-Python, Mastra, Azure SDK, LangWatch)
@christso christso marked this pull request as draft January 2, 2026 04:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants