-Need to learn against adversary -Need to learn with games until the end (in order to give Victory/Defeat reward) -Benchmarking the augmentation of the patch_size for learning and choosing action
-Need to learn against adversary
-Need to learn with games until the end (in order to give Victory/Defeat reward)
-Benchmarking the augmentation of the patch_size for learning and choosing action