Grounding Language Learning In Vision For Artificial Intelligence And Brain Research