Generate New Prompt
Re-run Full Test Suite
Reruns all 40 prompts, re-judges, and rebuilds the contact sheet. Takes ~10 min.
draw_a_cat
error
[1/3] Generating 'draw_a_cat' …
[ERROR] [Errno 2] No such file or directory: '/app/.venv/bin/python3'