The loss surfaces of deep neural networks have been the subject several studies, theoretical and experimental, over last few years. One strand work considers complexity, in sense local optima, high dimensional random functions with aim informing how optimisation methods may perform such complicated settings. Prior Choromanska et al (2015) established a direct link between training multi-layer p...