Тёмный

The U-Net (actually) explained in 10 minutes 

rupert ai
Подписаться 6 тыс.
Просмотров 79 тыс.
50% 1

Want to understand the AI model actually behind Harry Potter by Balenciaga or the infamous image of the Pope in the puffer jacket? Well.. diffusion frameworks such as DALL-E 2, Midjourney, Imagen or Stable Diffusion seem to get a lot of credit, where as the true unsung hero of the story is the underlying U-Net architecture that they all actually use under the hood. Don't get me wrong Diffusion models are awesome but the U-Net is an absolute STAPLE when it comes to computer vision and this video aims to break it down in an easy way. Originally used for image segmentation the U-Net has developed into so much more. Happy watching!
U-Net paper: arxiv.org/abs/1505.04597
Many thanks to numerous online resources that helped me create this video.

Наука

Опубликовано:

 

4 май 2023

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 97   
@salmanzafarsatti1346
@salmanzafarsatti1346 9 месяцев назад
man, this video is such a great explainer. I was confused about the use of skip connections since a long a time, but he explained the intuition behind it very nicely.
@Anton_Sh.
@Anton_Sh. 8 месяцев назад
This architecture is one of the truly brilliant ones in the world of deep learning in terms of its simplicity and efficiency.
@mayankukani9600
@mayankukani9600 Год назад
Why didn't I find your channel before. Please upload more content, the best content on Deep Learning I have seen.
@rupert_ai
@rupert_ai 11 месяцев назад
Thanks a lot :)
@rippingmyheartwassoeasy
@rippingmyheartwassoeasy 4 месяца назад
Thank you for creating this video! Its the best explaination of how a U-Net works that was easy to understand. The visual animation is superbly done!!
@Natstranaut
@Natstranaut 9 месяцев назад
Oh my god man. Awesome videos. Keep it up, I'm really enjoying them!
@pushkar9021
@pushkar9021 9 месяцев назад
Continue this series, very helpful
@thebakareview8009
@thebakareview8009 2 месяца назад
This channel deserves more subss!! Great content and delivery :)
@mridulsehgal7773
@mridulsehgal7773 26 дней назад
The best ever video you can get on Unet explaination
@user-kv2pi9mf8r
@user-kv2pi9mf8r 6 месяцев назад
Extremely useful for beginners like me. This is very good
@jacobidoko3924
@jacobidoko3924 4 месяца назад
Yooo...this is quality content right here. Thank you so much for putting this out
@jayhu2296
@jayhu2296 2 месяца назад
your explained under 10 minutes videos are goated
@jsparger
@jsparger 10 месяцев назад
This was extremely helpful. Thank you
@LucaBovelli
@LucaBovelli 21 день назад
dude thankssssss i thought this was another one of these things thatll take me 2 hours of youtube to *not* understand, but u saved me
@JohnZakaria
@JohnZakaria 4 месяца назад
This was the best unet explanation I have ever seen
@ubanaga
@ubanaga 4 месяца назад
Very nice my friend, this has been most helpful
@transcendingvictor
@transcendingvictor 3 месяца назад
Thank you very much for the time put on doing thisvideo. Interesting and helpful :)
@shubhamarle96
@shubhamarle96 Месяц назад
thanks for the video, I am trying to use U-net for anomaly detection in time series and your video gave me the idea.
@user-ux4st6hh2d
@user-ux4st6hh2d 6 месяцев назад
Woooooow! Finally I understood it , really great explanation, thank you
@hexeldev
@hexeldev 6 месяцев назад
This video has been extremely useful. I subbed.
@sakethsreeram6981
@sakethsreeram6981 2 месяца назад
Great presentation!, Easy to understand
@TheHopeOfTruth
@TheHopeOfTruth 2 месяца назад
Thank you for great explanation.On basic level it helps better understand unet
@niralpatel5889
@niralpatel5889 Месяц назад
This was great, would love a video on diffusion transformers! It looks like they are taking off and replacing U-Net's as the backbone to new diffusion models.
@aligreen786
@aligreen786 5 месяцев назад
Very nice explanation. Thanks a lot.
@xarisalkiviadis2162
@xarisalkiviadis2162 2 месяца назад
Amazing video, cleared everything!
@pratyushsahoo4948
@pratyushsahoo4948 2 месяца назад
Absolutely amazing work 🎉
@miguelxplayer9641
@miguelxplayer9641 2 месяца назад
Dude, you're great. I'm from Portuga 🇵🇹 🟩🟨🟥🟥and I'm learning Machine Learning and Neural Networks. Thank you very much! I loved how you teach. You are intuitive and dynamic. A person is learning a difficult subject and still manages to laugh when watching the videos. I loved. I already subscribed and liked. I'm going to watch more of your videos now. Hugs from Portugal😉
@puekai
@puekai 3 месяца назад
Still don't know how it works
@vardhan254
@vardhan254 6 дней назад
me when reading goodfellow all night
@gokulsaisrinivas5312
@gokulsaisrinivas5312 5 месяцев назад
very good explanation of U-NET
@mincasurong
@mincasurong 13 дней назад
Great summary, Great thanks
@VikashSingh-vd9cp
@VikashSingh-vd9cp День назад
bestvideo for understanding U-net model
@coffeestudi0s
@coffeestudi0s 7 месяцев назад
Yooo the effort haha. Amazing Video!!!
@willlowtree
@willlowtree 8 месяцев назад
i love your presentation style
@ozzafar1982
@ozzafar1982 18 дней назад
great explanation thanks!
@nagham96
@nagham96 7 месяцев назад
Thank you that was so helpful and cute! 🤩
@nikhilchouhan1802
@nikhilchouhan1802 Месяц назад
You might not find my comment since the video is too old, but man I just want to thank you for this video. I am a student who has always been interested in computer graphics and related fields like game engines, physical rendering, ray tracing, etc, and jst didnt get the ML/AI hype everyone was on the past 2 years. I only ever managed to study ML basics for 2 weeks before I left it for good. But recently I got in a team where my friends were working on CNN based projects, and that made me learn about many basics about NNs and DL. This explaination for Unet seals the deal for me, and I will strive to work on integrating my two interests into one and hopefully create something I love.
@s4lome792
@s4lome792 26 дней назад
Clearly explained. What caused my consfusion in the first place is, in the graphic in the original paper, why does the segmentation mask not have the same dimensionality than the input image?
@amolkumar1538
@amolkumar1538 9 месяцев назад
This is Just awesome, great video
@Topninja6
@Topninja6 Месяц назад
Thank you so much. Now I just need to figure out how to implement this for my project lol
@LautaHillkirk
@LautaHillkirk Месяц назад
nice video, very helpful
@r.walid2323
@r.walid2323 Месяц назад
thanks, good explanation
@user-ef7je7yw7r
@user-ef7je7yw7r Месяц назад
wow awesome video and explanation
@TechHuntBD
@TechHuntBD 20 дней назад
Nice explanation
@_the_one_who_asked_
@_the_one_who_asked_ 6 месяцев назад
Hi, thank u for this video. can u pls do a video to explain YOLO?
@usaid3569
@usaid3569 29 дней назад
Great video champ
@vijaykumarb9622
@vijaykumarb9622 4 месяца назад
Great Explanation.
@ny8828
@ny8828 8 месяцев назад
hi its very helpful, how can I reach the PowerPoint of it?
@gregorioosorio16687
@gregorioosorio16687 8 месяцев назад
Thanks for sharing!
@dfparker2002
@dfparker2002 5 месяцев назад
This explains inference (I think) by decomposition (dividing) and recomposition (adding) images. Is that accurate?
@alirezasaberi6383
@alirezasaberi6383 11 месяцев назад
awesome! can you calso make similar (actually) for Unet++ and Unet3+ please??? thank you so much.
@rupert_ai
@rupert_ai 11 месяцев назад
Glad you liked it! Its not currently on my list of to-do videos as I like to cover the most popular fundamentals at the moment, but I'll let you know if I get around to it! :)
@kiraqueenyt5161
@kiraqueenyt5161 5 месяцев назад
such a well made video
@BooleanDisorder
@BooleanDisorder 3 месяца назад
What's the background music called in this video?
@JohnVinchi-bk2dw
@JohnVinchi-bk2dw 9 месяцев назад
this is extreeeemely helpful,and funny
@rupert_ai
@rupert_ai 9 месяцев назад
Thanks John!
@atifadib
@atifadib 17 дней назад
If you want to just use the Decoder how would you do it?
@PAHADIBABAJI
@PAHADIBABAJI 4 месяца назад
Very helpful
@Grapemaid
@Grapemaid 9 месяцев назад
Thanks a lot lot. I understand it!
@Ngochi-ff7hk
@Ngochi-ff7hk 2 месяца назад
I still don't understand that the output is x2 or x3 or x4.I don't understand why that is the case?
@sisami2109
@sisami2109 7 месяцев назад
very nice dude thank you so much
@Nerthexx
@Nerthexx 8 месяцев назад
If downsampling works by max-pooling, how does upsampling work? In traditional image processing, we would just interpolate image colors, but how does the network apply it's "convolution" in this process? I would understand "deconvolution", but in my mind it wouldn't work here.
@AyushGupta-fv1lx
@AyushGupta-fv1lx Месяц назад
May be Transpose Convolution
@ingenuity8886
@ingenuity8886 Месяц назад
Thank you very much bro...
@user-xm1zy3pj5k
@user-xm1zy3pj5k 3 месяца назад
Hi. I find the video very interresting. As I'm at the begining, i'm little confused. please, can you also propose a pdf file ? thank yu. Nicely
@ajipboy
@ajipboy 2 месяца назад
bro , immediate subscribe!
@poggiesgw
@poggiesgw 8 месяцев назад
good stuff
@Englishwithshima1993
@Englishwithshima1993 5 месяцев назад
Perfect
@abhishekkanojia2816
@abhishekkanojia2816 10 месяцев назад
cool videos
@notrito
@notrito Месяц назад
If anyone wonders how to concatenate the features if they don't match the size... they crop it.
@yyww4267
@yyww4267 9 месяцев назад
Really impressive vedio! And fun work at the end!!!!! LOVE LOVE LOVE!!!
@rupert_ai
@rupert_ai 9 месяцев назад
Thank you very much! :)
@1.4142
@1.4142 8 месяцев назад
Dalle 3 is coming to gpt 4 and it can write text!
@MrMadmaggot
@MrMadmaggot Год назад
Now how they coded it?
@rupert_ai
@rupert_ai 11 месяцев назад
Hahaha well there are actually plenty of online code implementations available but I will see if I can get round to a code tutorial on the u-net sooner rather than later!
@rishabhbhardwajiitb178
@rishabhbhardwajiitb178 5 месяцев назад
@@rupert_ai can u provide one
@timanb2491
@timanb2491 7 месяцев назад
goodgood
@007bindass007
@007bindass007 6 месяцев назад
Nice Comment: Useful 👍👍😎😎
@LucaBovelli
@LucaBovelli 21 день назад
bro why did u stop making videos i need you lmao (its a painful lmao.)
@linamallek6900
@linamallek6900 2 месяца назад
nice video, but ideo i hate the music in the background ( so disturbing )
@luisluiscunha
@luisluiscunha 7 дней назад
You are very funny!
@jaybrodnax
@jaybrodnax 20 дней назад
I feel like this is more a description to experts than an actual explanation of how and why it works. Questions I'm left with: What is the purpose of downsampling/upsampling (I'm guessing performance?) How is segmentation actually done by the u-net? How is feature extraction actually done? What are max pooling layers? What does "channel doubling" mean, and what does it achieve? How does the encoder know "these are the pixels where the bike is"? Why is it beneficial to connect the encoder features to the decoder features at each step, versus in the last step? How does unet achieve anything other than downscaling/upscaling performance efficiency? Where are the actual operations to derive features? How is u-net specifically applied for various use cases like diffusion? What does diffusion add or change, for example.
@abansalah4677
@abansalah4677 20 дней назад
(Disclaimer: I am a beginner, and this is not intended to be a complete answer.) You should read about convolutional layers and pooling layers to better understand this video. At any rate: A colored image has three channels: R, G, and B. A convolutional layer is specified by some spatial parameters (stride, kernel size, padding) and how many filters are there - the number of filters is the number of channels of the output. You can think of each filter as trying to capture different information. Doubling the channels, therefore, means using double the number of filters when using a stride of 2. The segmentation is done just like any ML task - the training data consists of pairs of images and their annotated versions. I think it's often hard to decipher the inner workings of a particular neural networks, and your question can/should be asked in a more general way - how do neural networks learn?
@user-mn2bj1hw1vdtfhgh
@user-mn2bj1hw1vdtfhgh Месяц назад
Me seeing the video at 1.5x 😂😅
@MacProUser99876
@MacProUser99876 3 месяца назад
nice explanation. but why distracting background music?
@endlesshybrids
@endlesshybrids Месяц назад
Agreed. Good explanation but I wish people would stop using background music.
@leoyu6400
@leoyu6400 7 месяцев назад
hope you can come back to life
@c.e1187
@c.e1187 6 месяцев назад
Is he dead?
@BooleanDisorder
@BooleanDisorder 5 месяцев назад
​@@c.e1187nah, just busy I imagine. He was active on github in December so
@truck.-kun.
@truck.-kun. 4 месяца назад
​@@c.e1187maybe yes. Only on RU-vid
@jonathangallagher3116
@jonathangallagher3116 Месяц назад
TIGHT TIGHT TIGHT
@jcpouce
@jcpouce 3 месяца назад
music is too distracting... :(
@alteshaus3149
@alteshaus3149 3 месяца назад
no
@SarraAissaoui-sp3sm
@SarraAissaoui-sp3sm Месяц назад
I clicked on thumb down for wasting one minute of my precious time in the intro. Get to the F point !!
@websterfenoff8936
@websterfenoff8936 Год назад
Promo_SM ✅
Далее
Why Does Diffusion Work Better than Auto-Regression?
20:18
But what is a convolution?
23:01
Просмотров 2,5 млн
The Most Important Algorithm in Machine Learning
40:08
Просмотров 254 тыс.
How to train simple AIs
12:59
Просмотров 53 тыс.
ARRAYLIST VS LINKEDLIST
21:20
Просмотров 49 тыс.
ChatGPT: 30 Year History | How AI Learned to Talk
26:55
AIs learn to WALK
20:21
Просмотров 51 тыс.
Mem VPN - в Apple Store
0:30
Просмотров 42 тыс.
КЛИЕНТ СЛОМАЛ НАУШНИКИ ? 😳
0:51