* Scale to 10k+ images * Train baseline detector on synthetic data only * Evaluate on MultiNet v1 benchmark (or other real data) for synthetic data validation Expected outcome: Meet mAP target within 5% of real data baseline