This is a handy feature in ApprovalTests.Java. Bring it in to Python. Then teach AI agents to use it to run the build with a command-line diff reporter.