The geometry of the loss function of deep neural networks

Yaim Cooper (Institute for Advanced Study, Princeton)

Live Stream

Abstract

The mathematical heart of deep learning is gradient descent on a loss function L. If gradient descent converges, it will converge to a critical point of L. Thus the geometry of the locus of critical points is of great interest. We will discuss what is known about the critical points of L, including dimension estimates and connectedness results.

seminar

07.08.25 09.10.25

Math Machine Learning seminar MPI MIS + UCLA Math Machine Learning seminar MPI MIS + UCLA

MPI for Mathematics in the Sciences Live Stream

Details anzeigen

Upcoming Events of this Seminar

Donnerstag, 07.08.25 Efficient compression of neural networks and datasets with Lukas Barth
Donnerstag, 14.08.25 to be announced with Jonathan Siegel
Donnerstag, 21.08.25 to be announced with Zhou Fan
Donnerstag, 28.08.25 to be announced with Randall Balestriero
Donnerstag, 02.10.25 to be announced with Marcello Carioni
Donnerstag, 09.10.25 to be announced with Baharan Mirzasoleiman