Diffusion Models From Scratch | Score-Based Generative Models Explained | Math Explained

Outlier

Подписаться 14 тыс.

Просмотров 9 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

13 окт 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 79

@outliier 4 дня назад

Since these videos take an enormous amount of time (this one took about 300 hours), would you like to see, additionally, paper explanations in the style of Yannic Kilcher (www.youtube.com/@YannicKilcher) ? I could cover papers very quickly after they are released and also cover topics I wouldn’t do an animated video for. Let me know what you think :)

@r00t257 4 дня назад

1000% yessssss ❤❤❤🎉

@DonCat-sc3qo 4 дня назад

Sure 👍🏻

@suraj7984 4 дня назад

Sure! But I would prefer a deep dive once in a while to many simple paper explanations. There aren't many (video) resources for diffusion that go in such depth. So this is really great, thanks a lot for doing the video!

@outliier 4 дня назад

@@suraj7984 gotcha, yea I will keep doing normal videos. Was just wondering if other formats are also interesting

@nirajpudasaini4450 4 дня назад

I think you should do both ... sorry. You explain in such a better way. Thanks alot for doing this.

@tilaksharma7768 14 часов назад

A series on topics like this would be a gold mine. Great work!!

@Cyan-g2g 3 дня назад

Wow! I did not expect this video to go this deep. But this is awesome! Please make more in depth explanation like this. It’s clear a lot of hard work went into it and the animation is sooo elegant

@venkatbalachandra5965 4 дня назад

I absolutely love how you started from scratch, as in what the underlying PDF was. I'm working on a project on diffusion models and I don't know anything about it, and all the resources available are catered towards those with prerequisites I don't have yet, until this one. I haven't yet watched the whole thing, but I'm going to keep coming back to this till I understand everything in this video. Cheers mate!

@Тима-щ2ю День назад

Thank you for your work! I have started to learn about diffusion models and found that this is more complex idea than VAE idea and GAN idea. However, the people who try to explain these complex concepts to others are very impressive!

@phucnguyenthanh9223 4 дня назад

1 year. See you back with a really easy to understand explanation. Thank you!

@outliier 4 дня назад

Will be more active!

@novantha1 3 дня назад

Your videos are somehow simultaneously timely and timeless. Your content is absolutely appreciated and I wish you the best in your endeavors.

@איילתדמור 4 дня назад

Amazing video, thank you. I learned most of it a year ago in university but this was a great refresher which also provided me with new insights to some of the stuff. I really liked the conclusion of the Denoising Score Matching part, very beautiful.

@InturnetHaetMachine День назад

Regarding your pinned comment. No offense to Yannic, but your explanations are 10x better. The topics you've covered you actually understand, you explain not only what is going on, but also why. That, and you going into mathematical explanations are really appreciated. Don't worry about the quantity, it's easy to read a paper, and put surface level explanations out for more views, what you're doing is more valuable. Your videos are a treasure for amateur Deep Learning hobbyists like me who want to dig deeper into this field.

@chocobelly День назад

The mathematical derivation and explanation is such a lifesaver, I also never really understood the underlying meaning when reading the diffusion models but now everything clicked. Thank you so much for the videos, really enjoyed it. Please make more of such videos. Liked and subscribed : ).

@talhaahmed6488 8 часов назад

What an amazing video! I did not expect the video to contain the derivations which I have personally struggled to search for. If its not too much, can you do a pytorch implementation of VP-SDE or SDE - DDPM/DDIM? Your previous video of DDPM in Pytorch was extremely useful and would appreciate it if a similar video for this is possible. Finally, love the work you put in this. This channel is a gem for AI enthusiasts.

@outliier 8 часов назад

@@talhaahmed6488 thank you so much for the nice comment! I will do an implementation video after the next one!

@boydkane5469 21 час назад

Had an epiphany watching you explain so many things that I never fully grilled, thank you so much

@nicolasdufour315 4 дня назад

Great video! Would be great to see a video on flow matching in the same style!

@outliier 4 дня назад

@@nicolasdufour315 That actually is my plan to do for the next video haha

@MrMIB983 3 дня назад

@@outliierI really want that video bro, awesome job!

@tell2rain 2 дня назад

excellent work done by you, thanks for your explaining!

@laurenznagler7405 День назад

Very nice introduction to the topic!

@arpanpoudel 3 дня назад

I used Score-SDE in my thesis and I have my defense next week :D what a timing

@RadientAI 4 дня назад

I haven't seen it yet, but pretty sure is an awesome video. Keep it up man!

@joshp8820 4 дня назад

youtube giving good content??? i’ve been looking for exactly this lmao, thanks for your work

@DenisShiryaev 4 дня назад

Thank you for the video, love it!

@tell2rain 2 дня назад

7:35 i have a question, the second line -Ep(x)[ abla_x s_theta(x)] = -\int p(x) abla_x s_theta(x) dx, but you wrote a positive sign?

@Topakhok День назад

There was another mistake with a sign, which cancels this one out. He was wrong with a sign after integrating by parts (after that it should have changed and be plus instead of minus)

@outliier 4 дня назад

32:38 To correct myself here, the paper gives explanation how to derive the sampler. I personally just find that approach much harder to understand and generally the papers don’t go into too much details for their derivations.

@hanzhiyin5239 15 часов назад

Thanks for your hard work! Amazing explanation! Just want to check the squared equation at 5:55. Can you explain why $\mathbb{E}[p(x)] = \int p(x) dx$? I feel like the equation has something missing...

@dmitriizhilenkov2673 3 дня назад

Wow! Great job. Many thanks for sharing =)

@swaystar1235 2 дня назад

Id love to see a video on training video models cheaply like you did for image models with wurchsten

@outliier 2 дня назад

@@swaystar1235 Unfortunately even doing Würstchen style video models is still super expensive and there are many things that you have to solve first outside the model :/

@NikolajKuntner 11 часов назад

Calling ∇s stretches terminology a bit, right? Given s is a gradient vector field itself. Cool effort, thanks for going through all the manipulations. As for designing a read thread for the video, I'm not sure fully sure why you work 10 minutes for the E[s^2]+... term, but then in the explained denoising approach it's not really showing up anymore. Last note: Unlike Lagrang-ian dynamics, Langevin dynamics is not Langev-ian dynamics. But I think Langevin is still on the easier side to pronounce - don't be afraid.

@vinc6966 4 дня назад

Really nice explanation, intuitive but also math oriented. Now I am looking forward for implementation

@outliier 2 дня назад

@@vinc6966 My plan is to do Flow Matching next and then an implementation tutorial :)

@vinc6966 2 дня назад

@@outliier ah yes, GANs, diffusion, score-based models, and flow matching, the four horsemen of generative AI, keep up the good work! :))

@Тима-щ2ю День назад

@@outliier Yeah, Flow Matching sounds interesting. There are not a lot of explanations in the internet. implementation tutorial is also very cool

@ihmejakki2731 День назад

Every time you say theta I hear feta. Very nice video.

@outliier День назад

@@ihmejakki2731 bon appetit

@waynenilsen3422 3 дня назад

i know its a short video but some of the syntax may be confusing eg the subscript on the \mathbb{E} that is p(x) in a financial context we often use things such as \mathbb{E}_t [ h(X_T) ] = the conditional probability of h(X_T) where X is a stochastic process creating a filtration such as so it is equal to \mathbb{E} [ h(X_T) | \mathcal{F}_t ] I know its a totally different domains but oftentimes notation like this can be dripping with meaning, so, what is the _meaning_ of the subscript p(x) and what is the _meaning_ of the double bar ( ||_2^2 ) in the expectation ? is that the L2 Norm? timestamp 8:17

@NoahElRhandour 4 дня назад

schön, dich mal wieder zu sehen \o/

@outliier 4 дня назад

@@NoahElRhandour hehe

@venkatbalachandra5965 4 дня назад

If you want to make videos with quicker production, maybe you could use a whitescreen and write everything out, so you can still explain it intuitively but quicker.

@nanjiang2738 3 дня назад

awesome!

@高鑫-i2r 3 дня назад

It appears that the minus sign in the integration by parts was mistakenly written as a plus

@SY-fb7yc 3 дня назад

Can you explain more about classifier free guidance code implementation during training？ 😂

@SY-fb7yc 3 дня назад

Love the music background, very relaxing when learning, pls don’t change! Thx!

@oguzhanercan4701 День назад

I wonder that, for a year, did you studied on this, only? Because I really wonder that being able to go this much deep takes a year?

@outliier День назад

@@oguzhanercan4701 no I was just doing bunch of other things too and didn‘t spend so much time always on the video.

@oguzhanercan4701 19 часов назад

@@outliier To ask more clearly, have you been working on the basics of score matching and diffusion models for the last year? Assuming that you are using diffusion models at Luma, you also studied advanced topics on the related subject.

@outliier 19 часов назад

@@oguzhanercan4701 yea I have been mostly working with diffusion models over the last 2 years

@NikolajKuntner 4 дня назад

thx