MoGlow: Probabilistic and controllable motion synthesis using normalising flows logo
This video introduces our SIGGRAPH Asia 2020 paper on animating motion using so-called normalising flows. In particular, we describe a new, deep machine-learning architecture, called MoGlow, for data-driven animation, that: 1) is general and does not make any task-specific assumptions (such as the motion being cyclic) 2) can be controlled interactively using high-level, "weak" input signals, and 3) is probabilistic and can describe (and sample) many different behaviours. The resulting animation looks highly natural regardless of the task (e.g., animating a human or a dog). Abstract: Data-driven modelling and synthesis of motion is an active research area with applications that include animation, games, and social robotics. This paper introduces a new class of probabilistic, generative, and controllable motion-data models based on normalising flows. Models of this kind can describe highly complex distributions, yet can be trained efficiently using exact maximum likelihood, unlike GANs or VAEs. Our proposed model is autoregressive and uses LSTMs to enable arbitrarily long time-dependencies. Importantly, is is also causal, meaning that each pose in the output sequence is generated without access to poses or control inputs from future time steps; this absence of algorithmic latency is important for interactive applications with real-time motion control. The approach can in principle be applied to any type of motion since it does not make restrictive, task-specific assumptions regarding the motion or the character morphology. We evaluate the models on motion-capture datasets of human and quadruped locomotion. Objective and subjective results show that randomly-sampled motion from the proposed method outperforms task-agnostic baselines and attains a motion quality close to recorded motion capture.