Skip to content

Conversation

@mathuryash5
Copy link
Member

Issue

  • Detect OSWorld failures for GTA1 and UITARS-1.5. Trajectories available here.
  • Take the intersection of these failures.
  • Sample the required count from that intersection.

@Locke0 Locke0 linked an issue Sep 24, 2025 that may be closed by this pull request
4 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Existing dataset repurposing

2 participants