Marius Zeinhofer - Error Analysis and Optimization Methods for Scientific Machine Learning

Mufan Li - Infinite-Depth Neural Networks as Depthwise Stochastic Processes

Внезапный поворот. Кремль заговорил о победе

Вот какое средство помогает им не болеть #shorts

SCRUB: SpaceX Attempt One - Starship Flight Test

Я решил купить САМУЮ МОЩНУЮ ТАЧКУ! - Миссия НЕВЫПОЛНИМА!

Aditya Varre - On the spectral bias of two-layer linear networks

One world theoretical machine learning

Подписаться 1,9 тыс.

Просмотров 192

50% 1

Видео Поделиться Скачать Добавить в

Abstract: In this talk, we analyze the behaviour of two-layer fully connected networks with linear activations trained with gradient flow on the square loss. We show how the optimization process carries an implicit bias on the parameters that depends on the scale of its initialization. The main result of the paper is a variational characterization of the loss minimizers retrieved by the gradient flow for a specific initialization shape. This characterization reveals that, in the small-scale initialization regime, the linear neural network's hidden layer is biased toward having a low-rank structure. To complement our results, we showcase a hidden mirror flow that tracks the dynamics of the singular values of the weights matrices and describe their time evolution. Towards the end, we discuss the implications for stochastic gradient descent and show some empirical evidence beyond linear networks.

Опубликовано:

26 мар 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 2

@oncedidactic 3 месяца назад

great talk thanks! very interested in more about the effect of discretization on SGD

@javier2luna 3 месяца назад

you says two hidden layers in a neural network, right?

Далее

Marius Zeinhofer - Error Analysis and Optimization Methods for Scientific Machine Learning

55:37

Marius Zeinhofer - Error Analysis and Optimization Methods for Scientific Machine Learning

Просмотров 259

Mufan Li - Infinite-Depth Neural Networks as Depthwise Stochastic Processes

44:50

Mufan Li - Infinite-Depth Neural Networks as Depthwise Stochastic Processes

Просмотров 908

Внезапный поворот. Кремль заговорил о победе

13:24

Внезапный поворот. Кремль заговорил о победе

Просмотров 227 тыс.

Вот какое средство помогает им не болеть #shorts

0:35

Вот какое средство помогает им не болеть #shorts

Просмотров 309 тыс.

SCRUB: SpaceX Attempt One - Starship Flight Test

9:9:58

SCRUB: SpaceX Attempt One - Starship Flight Test

Просмотров 3,4 млн

Я решил купить САМУЮ МОЩНУЮ ТАЧКУ! - Миссия НЕВЫПОЛНИМА!

35:22

Я решил купить САМУЮ МОЩНУЮ ТАЧКУ! - Миссия НЕВЫПОЛНИМА!

Просмотров 728 тыс.

Hemant Tyagi - Dynamic ranking and translation synchronization

1:00:05

Hemant Tyagi - Dynamic ranking and translation synchronization

Просмотров 59

Sebastian Goldt - Gaussian world is not enough: Analysing neural nets beyond Gaussian models of data

53:25

Sebastian Goldt - Gaussian world is not enough: Analysing neural nets beyond Gaussian models of data

Просмотров 213

Effect of Leaky ReLUs on the training & generalization of overparameterized networks - Yinglong Guo

53:56

Effect of Leaky ReLUs on the training & generalization of overparameterized networks - Yinglong Guo

Просмотров 111

Nicolas Boulle - Elliptic PDE learning is provably data-efficient

46:41

Nicolas Boulle - Elliptic PDE learning is provably data-efficient

Просмотров 195

Tan Nguyen - Transformers Meet Image Denoising: Mitigating Over-smoothing in Transformers

39:07

Tan Nguyen - Transformers Meet Image Denoising: Mitigating Over-smoothing in Transformers

Просмотров 263

General Science Quiz - How Many Can You Answer?

26:55

General Science Quiz - How Many Can You Answer?

Просмотров 740 тыс.

Lisa Kreusser - Unveiling the role of the Wasserstein Distance in Generative Modelling

51:17

Lisa Kreusser - Unveiling the role of the Wasserstein Distance in Generative Modelling

Просмотров 330

Ting Lin - Universal Approximation and Expressive Power of Deep Neural Networks

49:20

Ting Lin - Universal Approximation and Expressive Power of Deep Neural Networks

Просмотров 215

SQLite: How it works, by Richard Hipp

1:39:27

SQLite: How it works, by Richard Hipp

Просмотров 2,9 тыс.

Amnon Geifman,Meirav Galun,David Jacobs,Ronen Basri - Spectral Bias of Convolutional Neural Tangent

12:46

Amnon Geifman,Meirav Galun,David Jacobs,Ronen Basri - Spectral Bias of Convolutional Neural Tangent

Просмотров 103

🔥 Лютая вещь для геймеров Да и вообще для тех кто проводит время за компом 💻

0:20

🔥 Лютая вещь для геймеров Да и вообще для тех кто проводит время за компом 💻

Просмотров 4,4 млн

Телефон-електрошокер

0:43

Телефон-електрошокер

Просмотров 390 тыс.

Сколько реально стоит ПК Величайшего?

0:37

Сколько реально стоит ПК Величайшего?

Просмотров 1,6 млн

Не Бери INFINIX NOTE 40 Pro + 5g, Не Посмотрев Это Видео!

12:25

Не Бери INFINIX NOTE 40 Pro + 5g, Не Посмотрев Это Видео!

Просмотров 46 тыс.

ХУДШАЯ ВИДЕОКАРТА ДЛЯ ПОКУПКИ!?🤬

17:22

ХУДШАЯ ВИДЕОКАРТА ДЛЯ ПОКУПКИ!?🤬

Просмотров 76 тыс.

Мощное УСИЛЕНИЕ СВЯЗИ и ИНТЕРНЕТА НА СМАРТФОНЕ Android 👉 КАК УСИЛИТЬ ИНТЕРНЕТ СИГНАЛ на Android ✔

9:08

Мощное УСИЛЕНИЕ СВЯЗИ и ИНТЕРНЕТА НА СМАРТФОНЕ Android 👉 КАК УСИЛИТЬ ИНТЕРНЕТ СИГНАЛ на Android ✔

Просмотров 307 тыс.

5 причин взять именно MAC! #пк #игры #гейминг #сборкапк #игровойпк #pc #mac #apple

0:58

5 причин взять именно MAC! #пк #игры #гейминг #сборкапк #игровойпк #pc #mac #apple

Просмотров 479 тыс.