Institute
General information about the institute, such as our mission statement, organizational structure, staff directory, history, directions, etc.

See More
Research
Scientific profile with all research groups, topics, collaborations, as well as columns on research at the institute.

See More
News
News and press releases about the institute, as well as a media archive.

See More
- News Overview
- Press Releases
Events
Overview of all events around the institute, such as talks, seminars, lectures, workshops, conferences and public events.

See More
Publications
Overview of all scientific publications of the institute, as well as our preprint and software repositories.

See More
Career
Information on open positions at the institute, benefits of working with us, graduate school, and postdoctoral supervision.

See More

Research
Research Spotlights
Guido Montúfar — Implicit Bias in Wide Neural Networks

Research Spotlights

Guido Montúfar — Implicit Bias in Wide Neural Networks

Published Jul 19, 2021

We investigate gradient descent training of overparametrized neural networks with rectified linear units and the corresponding implicit bias in function space. For 1D mean squared error regression, the solution found by gradient descent is a function which interpolates the training data and has a small spatially weighted two norm of the second derivative relative to the initial function. The curvature penalty function is expressed in terms of the probability distribution that is utilized to initialize the network parameters, and we compute it explicitly for various common parameter initialization procedures. Based on these results, the training trajectories can be described in function space as trajectories of spatially adaptive smoothing splines with decreasing regularization strength. The results generalize to multivariate regression and different activation functions. This is joint work with Hui Jin.

I agree to the display of external content. This implies that personal data may be transferred to third party platforms.

Scientific Contact

Research Group Leader

Guido Montúfar

Research Group Leader

Email - 731 E2 05

Mathematical Machine Learning MiS Profile MiS Profile