Towards Lower Bounds on the Depth of ReLU Neural Networks

Christoph Hertrich (TU Berlin)

Live Stream

Abstract

We contribute to a better understanding of the class of functions that is represented by a neural network with ReLU activations and a given architecture. Using techniques from mixed-integer optimization, polyhedral theory, and tropical geometry, we provide a mathematical counterbalance to the universal approximation theorems which suggest that a single hidden layer is sufficient for learning tasks. In particular, we investigate whether the class of exactly representable functions strictly increases by adding more layers (with no restrictions on size). We also present upper bounds on the sizes of neural networks required for exact function representation. This is joint work with Amitabh Basu, Marco Di Summa, and Martin Skutella.

Links

seminar

07.08.25 09.10.25

Math Machine Learning seminar MPI MIS + UCLA Math Machine Learning seminar MPI MIS + UCLA

MPI for Mathematics in the Sciences Live Stream

Details anzeigen

Upcoming Events of this Seminar

07.08.25 Efficient compression of neural networks and datasets with Lukas Barth
Donnerstag, 14.08.25 Topological Aspects of Symmetry-Preserving Neural Networks with Jonathan Siegel
Donnerstag, 21.08.25 Empirical Bayes Langevin dynamics in the linear model with Zhou Fan
Donnerstag, 28.08.25 Curvature Tuning: Provable Model Steering From a Single Parameter with Randall Balestriero
Donnerstag, 02.10.25 to be announced with Marcello Carioni
Donnerstag, 09.10.25 to be announced with Baharan Mirzasoleiman