The loss landscape of deep linear neural networks: A second-order analysis

El Mehdi Achour (IMT Toulouse)

Live Stream

Abstract

We study the optimization landscape of deep linear neural networks with the square loss. It is known that, under weak assumptions, there are no spurious local minima and no local maxima. However, the existence and diversity of non-strict saddle points, which can play a role in first-order algorithms' dynamics, have only been lightly studied. We go a step further with a full analysis of the optimization landscape at order 2. We characterize, among all critical points, which are global minimizers, strict saddle points, and non-strict saddle points. We enumerate all the associated critical values. The characterization involves conditions on the ranks of partial matrix products, and sheds some light on global convergence or implicit regularization that have been proved or observed when optimizing linear neural networks. In passing, we provide an explicit parameterization of the set of all global minimizers and exhibit large sets of strict and non-strict saddle points.

Links

seminar

07.08.25 09.10.25

Math Machine Learning seminar MPI MIS + UCLA Math Machine Learning seminar MPI MIS + UCLA

MPI for Mathematics in the Sciences Live Stream

Details anzeigen

Upcoming Events of this Seminar

07.08.25 Efficient compression of neural networks and datasets with Lukas Barth
Donnerstag, 14.08.25 Topological Aspects of Symmetry-Preserving Neural Networks with Jonathan Siegel
Donnerstag, 21.08.25 Empirical Bayes Langevin dynamics in the linear model with Zhou Fan
Donnerstag, 28.08.25 Curvature Tuning: Provable Model Steering From a Single Parameter with Randall Balestriero
Donnerstag, 02.10.25 to be announced with Marcello Carioni
Donnerstag, 09.10.25 to be announced with Baharan Mirzasoleiman