MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning

ACL 2020