One curious after reading the paper, how did you handle Paths and commands. As one command like C might have more representation than other or M can have more representation. So, how did you handle class imbalance.
Also, could you also publish the CE loss for VP-VAE objective.
Thanks.