Mapping Highly Nonconvex Energy Landscapes In Clustering, Grammatical And Curriculum Learning