Aditya Varre - On the spectral bias of two-layer linear networks

Ting Lin - Universal Approximation and Expressive Power of Deep Neural Networks

The Most Impressive Basketball Moments!

Прохождение Скулбоя на Все концовки // SchoolBoy Runaway

Их было двое😍 #аняищук #димасблог #семья #anyaischuk #дети

САМАЯ ОСТРАЯ ЛАПША в МИРЕ🥵🔥 || 💫жду тебя в тгк: марк калинин💫

Lei Wu - Understanding the implicit bias of SGD: A dynamical stability perspective

One world theoretical machine learning

Подписаться 1,9 тыс.

Просмотров 269

50% 1

Видео Поделиться Скачать Добавить в

Abstract: In deep learning, models are often over-parameterized, which leads to concerns about algorithms picking solutions that generalize poorly. Fortunately, stochastic gradient descent (SGD) always converges to solutions that generalize well even without needing any explicit regularization, suggesting certain “implicit regularization” at work. This talk will provide an explanation of this striking phenomenon from a stability perspective. Specifically, we show that a stable minimum of SGD must be flat, as measured by various norms of local Hessian. Furthermore, these flat minima provably generalize well for two-layer neural networks and diagonal linear networks. As opposed to popular continuous-time analysis, our stability analysis respects the discrete nature of SGD and can explain the effect of finite learning rates, batch size, and why SGD often generalizes better than GD.

Опубликовано:

7 июн 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии

Далее

Aditya Varre - On the spectral bias of two-layer linear networks

50:35

Aditya Varre - On the spectral bias of two-layer linear networks

Просмотров 190

Ting Lin - Universal Approximation and Expressive Power of Deep Neural Networks

49:20

Ting Lin - Universal Approximation and Expressive Power of Deep Neural Networks

Просмотров 212

The Most Impressive Basketball Moments!

00:36

The Most Impressive Basketball Moments!

Просмотров 13 млн

Прохождение Скулбоя на Все концовки // SchoolBoy Runaway

45:36

Прохождение Скулбоя на Все концовки // SchoolBoy Runaway

Просмотров 702 тыс.

Их было двое😍 #аняищук #димасблог #семья #anyaischuk #дети

00:10

Их было двое😍 #аняищук #димасблог #семья #anyaischuk #дети

Просмотров 1,7 млн

САМАЯ ОСТРАЯ ЛАПША в МИРЕ🥵🔥 || 💫жду тебя в тгк: марк калинин💫

01:00

САМАЯ ОСТРАЯ ЛАПША в МИРЕ🥵🔥 || 💫жду тебя в тгк: марк калинин💫

Просмотров 405 тыс.

While Loop | Break statement | Continue Statement | Popping up elements from list

24:03

While Loop | Break statement | Continue Statement | Popping up elements from list

Просмотров 6

Marius Zeinhofer - Error Analysis and Optimization Methods for Scientific Machine Learning

55:37

Marius Zeinhofer - Error Analysis and Optimization Methods for Scientific Machine Learning

Просмотров 256

Sebastian Goldt - Gaussian world is not enough: Analysing neural nets beyond Gaussian models of data

53:25

Sebastian Goldt - Gaussian world is not enough: Analysing neural nets beyond Gaussian models of data

Просмотров 209

Autoencoders | Deep Learning Animated

11:41

Autoencoders | Deep Learning Animated

Просмотров 3 тыс.

Hao Ni - Path development network for sequential data analysis

48:10

Hao Ni - Path development network for sequential data analysis

Просмотров 67

Kolmogorov-Arnold Networks: MLP vs KAN, Math, B-Splines, Universal Approximation Theorem

1:15:39

Kolmogorov-Arnold Networks: MLP vs KAN, Math, B-Splines, Universal Approximation Theorem

Просмотров 24 тыс.

Andrew Dudzik - Three Problems in the Mathematics of Deep Learning

43:53

Andrew Dudzik - Three Problems in the Mathematics of Deep Learning

Просмотров 12 тыс.

Deep Networks Are Kernel Machines (Paper Explained)

43:04

Deep Networks Are Kernel Machines (Paper Explained)

Просмотров 59 тыс.

Tan Nguyen - Transformers Meet Image Denoising: Mitigating Over-smoothing in Transformers

39:07

Tan Nguyen - Transformers Meet Image Denoising: Mitigating Over-smoothing in Transformers

Просмотров 248

BERT explained: Training, Inference, BERT vs GPT/LLamA, Fine tuning, [CLS] token

54:52

BERT explained: Training, Inference, BERT vs GPT/LLamA, Fine tuning, [CLS] token

Просмотров 32 тыс.

ссылка в описании #рек #smartphone #посылка #топ

0:17

ссылка в описании #рек #smartphone #посылка #топ

Просмотров 5 млн

Не заряжает батарею / Робот-пылесос Kitfort | РЕМОНТ

5:57

Не заряжает батарею / Робот-пылесос Kitfort | РЕМОНТ

Просмотров 28 тыс.

MSI уже НЕ ТОРТ! Печальные реалии ремонта современных ноутбуков

47:23

MSI уже НЕ ТОРТ! Печальные реалии ремонта современных ноутбуков

Просмотров 54 тыс.

Так ли Хорош Founders Edition RTX 4080 ?

13:00

Так ли Хорош Founders Edition RTX 4080 ?

Просмотров 56 тыс.

Smart Home👍👍Cool gadget I New Gadget #xuhuong #kitchen #review #dogiadung #goodthing

0:32

Smart Home👍👍Cool gadget I New Gadget #xuhuong #kitchen #review #dogiadung #goodthing

Просмотров 4,8 млн

Нейросетевой iPhone, революционная батарея, кожаный ананас и телефонный сканер | В цепких лапах

16:58

Нейросетевой iPhone, революционная батарея, кожаный ананас и телефонный сканер | В цепких лапах

Просмотров 39 тыс.

OZON РАЗБИЛИ 3 КОМПЬЮТЕРА

0:57

OZON РАЗБИЛИ 3 КОМПЬЮТЕРА

Просмотров 506 тыс.