Hierarchical Heterogeneous Cluster Systems For Scalable Distributed Deep Learning