Learning ReLU networks to high uniform accuracy is intractable

Julius Berner (Caltech)

Live Stream

Abstract

Statistical learning theory provides bounds on the necessary number of training samples needed to reach a prescribed accuracy in a learning problem formulated over a given target class. This accuracy is typically measured in terms of a generalization error, that is, an expected value of a given loss function. However, for several applications -- for example in a security-critical context or for problems in the computational sciences -- accuracy in this sense is not sufficient. In such cases, one would like to have guarantees for high accuracy on every input value, that is, with respect to the uniform norm. In this paper we precisely quantify the number of training samples needed for any conceivable training algorithm to guarantee a given uniform accuracy on any learning problem formulated over target classes containing (or consisting of) ReLU neural networks of a prescribed architecture. We prove that, under very general assumptions, the minimal number of training samples for this task scales exponentially both in the depth and the input dimension of the network architecture. As a corollary we conclude that the training of ReLU neural networks to high uniform accuracy is intractable. In a security-critical context this points to the fact that deep learning based systems are prone to being fooled by a possible adversary. We corroborate our theoretical findings by numerical results.

ArXiv / ICLR'23: arxiv.org/abs/2205.13531

openreview.net/forum

Github: github.com/juliusberner/theory2practice

jberner.info

Links

seminar

07.08.25 09.10.25

Math Machine Learning seminar MPI MIS + UCLA Math Machine Learning seminar MPI MIS + UCLA

MPI for Mathematics in the Sciences Live Stream

Details anzeigen

Upcoming Events of this Seminar

07.08.25 Efficient compression of neural networks and datasets with Lukas Barth
Donnerstag, 14.08.25 Topological Aspects of Symmetry-Preserving Neural Networks with Jonathan Siegel
Donnerstag, 21.08.25 Empirical Bayes Langevin dynamics in the linear model with Zhou Fan
Donnerstag, 28.08.25 Curvature Tuning: Provable Model Steering From a Single Parameter with Randall Balestriero
Donnerstag, 02.10.25 to be announced with Marcello Carioni
Donnerstag, 09.10.25 to be announced with Baharan Mirzasoleiman