Statistical efficiency and optimization of deep learning from the viewpoint of non-convexity

Taiji Suzuki (The University of Tokyo, and Center for Advanced Intelligence Project, RIKEN, Tokyo)

Live Stream

Abstract

In this talk, I discuss how deep learning can statistically outperform shallow methods such as kernel methods utilizing the notion of sparsity of a target function space, and present a non-convex optimization framework with a generalization and excess risk bounds. In the first half, I will summarize our recent work on the excess risk bounds of deep learning in the Besov space and its variants. It will be shown that the superiority of deep learning stems from sparsity of the target function space, and more essentially non-convex geometry of the space characterizes this property. In such a situation, deep learning can achieve the so-called adaptive estimation which gives a better excess risk than shallow methods. In the latter half, I present a deep learning optimization framework based on a noisy gradient descent in infinite dimensional Hilbert space (gradient Langevin dynamics), and show generalization error and excess risk bounds for the solution obtained by the optimization procedure. The proposed framework can deal with finite and infinite width networks simultaneously unlike existing one such as neural tangent kernel and mean field analysis.

Links

seminar

05.06.25 02.10.25

Math Machine Learning seminar MPI MIS + UCLA Math Machine Learning seminar MPI MIS + UCLA

MPI for Mathematics in the Sciences Live Stream

Details anzeigen

Upcoming Events of this Seminar

Donnerstag, 05.06.25 Complexity of Deciding Injectivity and Surjectivity of ReLU Neural Networks with Moritz Grillo
Donnerstag, 12.06.25 Where Does Mini-Batch SGD Converge? with Pierfrancesco Beneventano
Donnerstag, 19.06.25 to be announced with Jingfeng Wu
Donnerstag, 03.07.25 to be announced with Xingyu Zhu
Donnerstag, 10.07.25 to be announced with Avrajit Ghosh a.o.
Donnerstag, 14.08.25 to be announced with Jonathan Siegel
Donnerstag, 02.10.25 to be announced with Marcello Carioni