Thanks to the great work on VAR! π
Building upon it, we introduce MVAR, which incorporates scale and spatial Markovian assumptions into visual autoregressive modeling.
MVAR achieves a 1.7Γ speedup and 3Γ reduction in GPU memory usage, enabling efficient training on eight RTX 4090 GPUs.
π Paper: https://arxiv.org/abs/2505.1274
π GitHub: https://github.com/LabShuHangGU/MVAR
Thanks to the great work on VAR! π
Building upon it, we introduce MVAR, which incorporates scale and spatial Markovian assumptions into visual autoregressive modeling.
MVAR achieves a 1.7Γ speedup and 3Γ reduction in GPU memory usage, enabling efficient training on eight RTX 4090 GPUs.
π Paper: https://arxiv.org/abs/2505.1274
π GitHub: https://github.com/LabShuHangGU/MVAR