643 - S-VVAD: Visual Voice Activity Detection by Motion Segmentation