Тёмный
No video :(

06 - Latent Variable Energy Based Models (LV-EBMs), training 

Alfredo Canziani
Подписаться 39 тыс.
Просмотров 10 тыс.
50% 1

Опубликовано:

 

28 авг 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 59   
@hamedrashidi2199
@hamedrashidi2199 3 года назад
Cant watch enough of your videos man. Checking youtube for a new upload everyday. Nice work. Keep it up. Wish we had more men like you👌👌👌
@alfcnz
@alfcnz 3 года назад
🥰🥰🥰
@kshitizmalhotra1394
@kshitizmalhotra1394 3 года назад
This is GOLD!! The DL course in my university is completely different from this. This course just gives you a new perspective. Thanks to you Alfredo and whoever else was responsible for making this public :)
@alfcnz
@alfcnz 3 года назад
💛💛💛
@gnorts_mr_alien
@gnorts_mr_alien Год назад
I'm an outsider to ML, CS and even maths and thought I'd never understand this (EBM) yet decided to persist. Glad I did, thanks to your work I feel I'm about 60% there. Just have to play with the math myself but my intuition is solid, which is what matters to me. You're a legend.
@alfcnz
@alfcnz Год назад
🥳🥳🥳
@anondoggo
@anondoggo 2 года назад
Watching this for the second time I finally understood the inference step of LVM. thank you!
@alfcnz
@alfcnz 2 года назад
💪🏻💪🏻💪🏻
@user-co6pu8zv3v
@user-co6pu8zv3v 3 года назад
I watched a video from spring 2020 about this. I watched this video now. And only when I took a pen and paper and painted everything neatly I understood everything. I'm getting old
@alfcnz
@alfcnz 3 года назад
Cool, cool. It took me 3 years to understand this. 😅😅😅
@user-co6pu8zv3v
@user-co6pu8zv3v 3 года назад
I have a great teacher :))
@andrewkatz9678
@andrewkatz9678 3 года назад
Absolute unit in the background. Video is pretty legendary, too 🤩
@alfcnz
@alfcnz 3 года назад
Unit? 😮😮😮 I'm glad you enjoyed it. It took _some_ time to craft this 😅😅😅
@andrewkatz9678
@andrewkatz9678 3 года назад
@@alfcnz not just a unit but an absolute unit (r/AbsoluteUnits). 🙀 Was today the day I get to return the favor and teach Alf something after the countless days of him teaching us?! 🤞Seems too good to be true 😺
@alfcnz
@alfcnz 3 года назад
@@andrewkatz9678 😮😮😮 🐻
@giovanni59128
@giovanni59128 3 года назад
Coming from a physical chemistry background turned to work on ML, I am increasingly finding the two worlds coming together. No wonder why I called my company Softmax😅 Cool stuff + I did not know you studied in Trieste too!
@alfcnz
@alfcnz 3 года назад
Hopefully, it means a softer-max rather than a softer-argmax! Electrical engineering background here. So, applied physicist? Yup, born and raised in Trieste. Escaped to London. Changed my personality. Left for the States.
@anondoggo
@anondoggo 2 года назад
(I think youtube lost one of my comments? Sorry if this comment is duplicated) Hi Dr. Canziani, thank you so much for sharing this course with us! Many DL course I encountered so far is a jumble of different model architectures with no overarching insight. This course showed me for the first time how (seemingly) different DL models relate to each other and this was extremely valuable. I just hope to say that I cannot thank you enough for your time and dedication. And lastly sorry I do have one small question / request: would it be possible for us to access the notebook for this tutorial as well, so we can play with the code a little bit? Thank you so much and looking forward to hearing from you!
@alfcnz
@alfcnz Год назад
I'm glad you found my work of some value. The notebook you're referring to was not yet available (there's a difference between teaching notebooks and notebooks I create for my own to generate illustrations). Nevertheless, since I've already written this book's chapter, the teaching notebook is now available. It will be released around Christmas together with the first draft of the book. So, a few months and you'll have it!
@JaskaranSingh-hp3zy
@JaskaranSingh-hp3zy Год назад
Thanks for this amazing video Dr Alfredo. I have a question, during the training of the model, do you use actual-softmin with value of Beta = 1 to do the minimization process and backpropagate through that? and if i want to use Beta = infinity would you do the same with actual-softmax or simply take the minimum distance point for z and backpropagate through it? I could not find training notebook for the same video.
@alfcnz
@alfcnz Год назад
You're welcome 😊 I did use both, β = 1 and gradient descent for the zero temperature limit. The notebook was not a teaching one (I made it for creating the illustrations). The teaching one now do exist and will be released with the first draft of the book.
@kalokng3572
@kalokng3572 2 года назад
Hi Alfredo thanks for the clear explanation on a not so intuitive topic :)) I've got a question which might be a bit silly to ask lol At the end of the video you've mentioned that there are 2 things to research/work on: 1. How to find the right decoder 2. How to determine the constraint of the latent variable May I ask for point number 2, apart from the dimension of the latent, should the subset of value of latent also be learnt? Or actually by constraint you mean constraint of all kind no matter it is dimension of z or subset of z or other weird stuff. For example 59:37, should the range of z be learnt so that it is between 0 and 2pi? I understand that in fact z can range from [-inf: inf] in this case coz eventually it is fed into a sin and cos function.
@kalokng3572
@kalokng3572 2 года назад
I think I might have figured that out. It should be all kind of constraint, like for VAE we learnt the mean and variance of a normal distribution. For other type of architecture it can be some other weird stuff...
@alfcnz
@alfcnz 2 года назад
The VAE adds some regularisation over the code, such that there's no infinite low energy space. So, point 2 says that you need to figure out a way to limit the information content of z.
@kalokng3572
@kalokng3572 2 года назад
Cooool seems information theory is a must-learn…
@humanintheloop_official
@humanintheloop_official 2 года назад
Hi Alfredo, firstly thanks for letting people learn from these outstanding resources! Is there a way to get the homework/exercise materials of the course? Thank you! Grazie (:
@alfcnz
@alfcnz 2 года назад
Have you checked the course website? 👀👀👀
@-mwolf
@-mwolf 2 года назад
@@alfcnz Hi, are there also solutions available? I couldn't find it on the website.
@alfcnz
@alfcnz 2 года назад
No, they are not.
@youtugeo
@youtugeo 2 года назад
Probably naively, I would assume that as a 1d z corresponds to a 1d zero-energy space, a 2d z would correspond to a 2d zero-energy space... But at 1:02:44 you said that if z is 2d, there is no known constraint on the zero-energy space and that the zero-energy space would be flat. Is there an intuition on why is that? Or maybe I misunderstood?
@alfcnz
@alfcnz 2 года назад
*If the ambient space is 2d,* a 2d latent would let you reach all locations, effectively giving you a flat free energy, unless you limit the volume of low energy by using a regulariser.
@youtugeo
@youtugeo 2 года назад
@@alfcnz Ah I see... If I understand correctly, the ambient space is the y space which is 2d in this case (y1, y2). And this would imply that in other cases, z can be high dimensional so long as it has less dimensions than y (the ambient space), so that the zero energy space is a subspace of the ambient space.
@alfcnz
@alfcnz 2 года назад
Doesn't necessarily need to be of smaller dimensionality. For example it could be larger but sparse.
@blackcurrant0745
@blackcurrant0745 2 года назад
Isn't average kinetic energy 3/2k_BT, each degree of freedom having 1/2k_BT energy, as opposed to 2/3k_BT at 10:30?
@alfcnz
@alfcnz 2 года назад
Absolutely! Thanks for catching this! 😅😅😅
@fernandocossio3055
@fernandocossio3055 3 года назад
This is great, thanks for the video. Do you have a public place to take a look at the exercises left as homework and the notebook you are using for the example presented (I'm not a NYU student)?
@alfcnz
@alfcnz 3 года назад
All notebooks I'm covering are on GitHub. The link it's below the video. This semester's homework are not out (yet).
@LiamchKim
@LiamchKim 2 года назад
Hello, Professor Canziani. First of all, thank you for sharing this great video. I have a question about Free energy with beta (Video around 8:58). You and Dr.Lecun mentioned the lowest energy will survive. When we multiply any number with infinity, how they survive? Sorry for the dumb question. But to understand this, I revisited lecture video several times.. Thank you! Best regards, CK
@alfcnz
@alfcnz 2 года назад
Multiply the integral by exp(-β E_min) and the term inside by exp(β E_min). You'll have that the minimum value of E will get a contribution of exp(β E_min - β E_min) = exp(0) = 1. All other terms will be exp(-β × (>0) ) → 0, β → +∞.
@LiamchKim
@LiamchKim 2 года назад
@@alfcnz Thank you so much! Now I totally understand the point. And the relationship between infinite free energy and beta free energy when beta goes to infinity. Thank you again!
@hamedgholami261
@hamedgholami261 2 года назад
Hi alf, I have a question from you: can I share my answers to the homeworks on GitHub so that others could use it if they needed, or do you think it is better to not have any answers around so that everyone could think deeply? anyways, whatever your preference is I would obey because It was nice of you to share the homeworks and you certainly didn't have to do it. tnx for your kindness.
@alfcnz
@alfcnz 2 года назад
Plz, don't share the solutions. Not only this would give me troubles for the next editions of the course, but would make online students not think deeply about the questions. We post the solutions every time the students receive their graded assignments.
@anondoggo
@anondoggo 2 года назад
I don't quite understand the difference between conditional and unconditional and their relationship to unsupervised and self-supervised training... I think I missed the formal definitions of un- and self-supervised learning was introduced in the lectures :/ Must go back to slides again...
@anondoggo
@anondoggo 2 года назад
Wait, is the only difference between the conditional and unconditional case the fact that we have a new predictor? Before we're learning the latent factors from y only (unconditional), now we're learning how x can help predict y_hat as well (conditional)? But why would it be classified as self-supervised?
@anondoggo
@anondoggo 2 года назад
tbc after I watch the video on self-supervision
@alfcnz
@alfcnz 2 года назад
Conditional: there is an x. Unconditional: there is no x. Supervised: conditional and y is human-provided. Self-supervised: conditional and y is just part of the _raw_ data. Unsupervised: uncoditional.
@anondoggo
@anondoggo 2 года назад
@@alfcnz thank you so much!
@user-gh6wd5cv1h
@user-gh6wd5cv1h Год назад
Great video. At 15:22 why is there a delta z?
@alfcnz
@alfcnz Год назад
I'm using not infinitesimal dz.
@user-gh6wd5cv1h
@user-gh6wd5cv1h Год назад
@@alfcnz yeah, if it's summation, there shouldn't be any dz right?
@alfcnz
@alfcnz Год назад
I've discretised the integral as sums of areas.
@sanskarkumar1614
@sanskarkumar1614 3 года назад
Hi, can you do some videos on solving machine learning problems (approaches and stuff) ?
@alfcnz
@alfcnz 3 года назад
For example…?
@sanskarkumar1614
@sanskarkumar1614 3 года назад
@@alfcnz Like the problems in Kaggle competitions, but just the approaches on how to go about the problem.
@alfcnz
@alfcnz 3 года назад
@@sanskarkumar1614 hum… Like practical stuff / engineering? I've never done a Kaggle competition, so I'm not really sure what problems they typically offer.
@henridehaybe525
@henridehaybe525 3 года назад
Is it me or the content of this video is exactly the same as last week's ?
@alfcnz
@alfcnz 3 года назад
Yup, this is part two. Last episode was about interference, this episode is week is training. 🙂🙂🙂
Далее
The Fan’s Fang Skin🔥 | Brawl Stars Sneak Peek
00:16
나랑 아빠가 아이스크림 먹을 때
00:15
Просмотров 4,5 млн
07 - Unsupervised learning: autoencoding the targets
56:42
02L - Modules and architectures
1:42:27
Просмотров 22 тыс.
06L - Latent variable EBMs for structured prediction
1:48:54
01 - History and resources
50:18
Просмотров 99 тыс.