Build out dataset

* Scale to 10k+ images
* Train baseline detector on synthetic data only
* Evaluate on MultiNet v1 benchmark (or other real data) for synthetic data validation

Expected outcome: Meet mAP target within 5% of real data baseline