Institute
General information about the institute, such as our mission statement, organizational structure, staff directory, history, directions, etc.

See More
Research
Scientific profile with all research groups, topics, collaborations, as well as columns on research at the institute.

See More
News
News and press releases about the institute, as well as a media archive.

See More
- News Overview
- Press Releases
Events
Overview of all events around the institute, such as talks, seminars, lectures, workshops, conferences and public events.

See More
Publications
Overview of all scientific publications of the institute, as well as our preprint and software repositories.

See More
Career
Information on open positions at the institute, benefits of working with us, graduate school, and postdoctoral supervision.

See More

Talk

Thursday, August 6, 2020 17:00

Ultra-wide Neural Network and Neural Tangent Kernel

Simon S. Du (University of Washington)

Live Stream

Abstract

I will talk about the result on the equivalence between the over-parameterized neural network and a new kernel, Neural Tangent Kernel. This equivalence implies two surprising phenomena: 1) the simple algorithm gradient descent provably finds the global optimum of the highly non-convex empirical risk, and 2) the learned neural network generalizes well despite being highly over-parameterized. I will also present empirical results showing Neural Tangent Kernel is a strong predictor.

Links

seminar

5/9/24 6/13/24

Math Machine Learning seminar MPI MIS + UCLA

MPI for Mathematics in the Sciences Live Stream

See Details

Katharina Matschke

MPI for Mathematics in the Sciences Contact via Mail

Upcoming Events of This Seminar

May 9, 2024 Achieving equivariance in neural networks with Axel Flinth
May 16, 2024 Conservation Laws for Gradient Flows with Rémi Gribonval
May 23, 2024 Why interpolating neural nets generalize well: recent insights from neural tangent model with Yiqiao Zhong
May 30, 2024 to be announced with Mariya Toneva
Jun 6, 2024 Are activation functions required for learning in all deep networks? with Grigoris Chrysos
Jun 13, 2024 to be announced with Vahid Shahverdi