ELLIS Reading Group on Mathematics of Deep Learning

Meetings

Jul 22, 2025
How Modern Optimization Techniques Balance Learning in Deep Neural Networks
Jun 24, 2025
Understanding Mode Connectivity via Parameter Space Symmetry
May 22, 2025
Do Deep Neural Network Solutions Form a Star Domain?
Dec 5, 2024
Modular Duality in Deep Learning
Oct 8, 2024
Approaching Deep Learning through the Spectral Dynamics of Weights
May 30, 2024
The importance of discretisation drift in deep learning
Mar 12, 2024
Information-theoretic generalization bounds for black-box learning algorithms
Feb 13, 2024
Stochastic Collapse: How Gradient Noise Attracts SGD Dynamics Towards Simpler Subnetworks
Jan 9, 2024
Predicting grokking long before it happens
Dec 19, 2023
The geometry of neural nets' parameter spaces under reparametrization
Oct 18, 2023
Can Neural Network Memorization Be Localized?
Sep 20, 2023
DINO v1 and v2: Self-Supervised Vision Transformers
Aug 23, 2023
SGD with Large Step Sizes Learns Sparse Features
Jul 12, 2023
Bottleneck structure in large depth networks
Jun 20, 2023
Loss Landscapes are All You Need: Neural Network Generalization Can Be Explained Without the Implicit Bias of Gradient Descent
May 24, 2023
Understanding edge of stability
Apr 26, 2023
Lottery ticket hypothesis and its current state
Feb 3, 2023
Intrinsic Dimension, Persistent Homology and Generalization in Neural Networks
Jan 13, 2023
A Loss Curvature Perspective on Training Instability in Deep Learning
Dec 9, 2022
When Are Solutions Connected in Deep Networks?
Nov 18, 2022
From Gradient Flow on Population Loss to Learning with Stochastic Gradient Descent
Oct 28, 2022
Towards Understanding Sharpness-Aware Minimization
Jun 21, 2022
When Do Neural Networks Outperform Kernel Methods?
May 24, 2022
SGD: The Role of Implicit Regularization, Batch-size and Multiple-epochs
Apr 26, 2022
Does the Data Induce Capacity Control in Deep Learning?
Mar 29, 2022
Deep Ensembles: A Loss Landscape Perspective
Mar 8, 2022
Taxonomizing local versus global structure in neural network loss landscapes
Feb 22, 2022
The Effects of Mild Over-parameterization on the Optimization Landscape of Shallow ReLU Neural Networks
Feb 8, 2022
The Geometry of Neural Network Landscapes: Symmetry-Induced Saddles & Global Minima Manifold
Jan 18, 2022
Exploring Generalization in Deep Learning