Тёмный

ConvNet beats Vision Transformers (ConvNeXt) Paper explained 

Soroush Mehraban
Подписаться 3 тыс.
Просмотров 1,4 тыс.
50% 1

The paper presented at the 2022 Conference on Computer Vision and Pattern Recognition (CVPR) details a newly proposed architecture that adopts the design principles of Swin Transformers but replaces them with convolutions to achieve superior performance. In essence, the authors propose a Convolutional Neural Network (ConvNet) architecture that outperforms Swin Transformers while still following the underlying design principles.
Paper link: arxiv.org/abs/2201.03545
Table of Content:
00:00 Introduction
01:09 Training Techniques
01:40 Data Augmentation
04:27 Label Smoothing
06:39 Changing stage compute ratio
08:11 Changing stem to "Patchify"
09:20 ResNeXt-ify
11:11 Inverted Bottleneck
12:57 Larger Kernel Sizes
15:19 Micro Design
19:39 Making it scalable
19:48 Result
Icon made by Freepik from flaticon.com

Опубликовано:

 

30 июн 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 16   
@nadhembenhadjali9063
@nadhembenhadjali9063 3 месяца назад
Nice explanation ! thank you so much !
@williamashbee
@williamashbee 7 месяцев назад
You're clearly very knowledgeable on these topics. Hopefully your channel blows up. 😊
@soroushmehraban
@soroushmehraban 7 месяцев назад
Thanks for the kind words! Hopefully I’ll post better videos in future.
@buh357
@buh357 4 месяца назад
This modified resent has same architecture as efficientnet. Depth wise convolution, inverted block.
@phattailam9814
@phattailam9814 Год назад
Thank you very much. This is very helpful
@soroushmehraban
@soroushmehraban Год назад
Glad you liked it
@user-ui5dg3nr3r
@user-ui5dg3nr3r Месяц назад
usefull
@az-vv3mg
@az-vv3mg Год назад
Great video thanks.
@soroushmehraban
@soroushmehraban Год назад
Glad you enjoyed!
@duongbinh23
@duongbinh23 Год назад
Love your content
@soroushmehraban
@soroushmehraban Год назад
Thanks!
@suesarnwilainuch8429
@suesarnwilainuch8429 5 месяцев назад
deformable convolution and attention please🔥
@soroushmehraban
@soroushmehraban 5 месяцев назад
I read those papers and prepared slides almost a year ago. I will post a video about them if I couldn't find anything more interesting 🙂
@alihadimoghadam8931
@alihadimoghadam8931 Год назад
🤘❤
@deepsingh274
@deepsingh274 Год назад
Hey your content is very good. Can i connect with you?
@soroushmehraban
@soroushmehraban Год назад
Thanks! Sure my LinkedIn name is same as my channel name.
Далее
Why Does Diffusion Work Better than Auto-Regression?
20:18
When You Get Ran Over By A Car...
00:15
Просмотров 3,7 млн
The last one surprised me! 👀 🎈
00:30
Просмотров 3,8 млн
ConvNeXt: A ConvNet for the 2020s | Paper Explained
40:08
CNN Receptive Field | Deep Learning Animated
10:28
Просмотров 1 тыс.
DINO: Self-Supervised Vision Transformers
21:12
Просмотров 2,2 тыс.
ConvNeXt: A ConvNet for the 2020s
11:19
Просмотров 5 тыс.
When You Get Ran Over By A Car...
00:15
Просмотров 3,7 млн