Video Modeling Via Implicit Motion Representations