Тёмный

Text-to-video models explained 

Google Research
Подписаться 398 тыс.
Просмотров 6 тыс.
50% 1

Опубликовано:

 

28 сен 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 25   
@GoogleResearch
@GoogleResearch Год назад
Subscribe to the Google Research Channel → goo.gle/GoogleResearch
@sabaokangan
@sabaokangan Год назад
Thank you Laurence for making this accessible to dummies like me
@LaurenceMoroney
@LaurenceMoroney Год назад
You're not a dummy! But I'm happy you can enjoy it! :)
@xidchen
@xidchen Год назад
Wonderful video! But why the orchestration order is two space then two time, instead of one space then one time?
@stefan-bayer
@stefan-bayer Год назад
Awesome explanation- just the right lenght and good abstraction. The host did a really great job. I am subscribed now!
@LaurenceMoroney
@LaurenceMoroney Год назад
That's wonderful, thanks!
@user-wr4yl7tx3w
@user-wr4yl7tx3w Год назад
What’s confusing is how do you get an image that is sensible when you denoise the image. That part I don’t quite see.
@MattiaCeccopieri
@MattiaCeccopieri Год назад
The magic of the "black box"
@LaurenceMoroney
@LaurenceMoroney Год назад
Check the earlier episode.
@kevinbuehler9213
@kevinbuehler9213 Год назад
The second episode of Hidden Layers, “Text to video models explained,” maintains the same high standard as the first episode. Many thanks once again to Laurence Moroney and Google Research! Any chance we could cover Google’s LaMDA next? Perhaps there is another breakthrough conversation model you might touch upon as well. The whole idea of RLHF (Reinforcement Learning from Human Feedback) would be a great topic to dive into.
@LaurenceMoroney
@LaurenceMoroney Год назад
At some point, definitely. I filmed these back in December, and things have been moving fast since then :)
@kevinbuehler9213
@kevinbuehler9213 Год назад
Fair enough! Question for you: you described Google’s Imagen and Parti in episode 1. I was curious as to how Google’s Muse fits into the picture.
@SMASH_REVIEWS
@SMASH_REVIEWS Год назад
REALLY AWESOME almost Unbelievable 💯💯💯
@LaurenceMoroney
@LaurenceMoroney Год назад
Thanks!
@sotasearcher
@sotasearcher Год назад
Awesome! I'm reacting to this live. I feel that these 2 hidden layer videos now beg the question: have we tried the auto regressive approach for text-to-video?
@LaurenceMoroney
@LaurenceMoroney Год назад
There's so many techniques in-flight and at various stages. It's hard to keep up!
@tomoki-v6o
@tomoki-v6o Год назад
cool supper resolution models also trained with text or just labels?
@asatorftw
@asatorftw 11 месяцев назад
I would love to dive deeper into this to learn how it works!
@avi12
@avi12 Год назад
That's pretty cool, though the last few models of the upscaling and time lengthening sound very inefficient Like, it would be much better to have a single model that upscales the video to resolution X×Y @ Z FPS
@LaurenceMoroney
@LaurenceMoroney Год назад
Models are generally very static in their operation -- data of one shape in, and data of one shape out. Thus using multiple ones each for a specific task, and pipelining them together is generally more efficient than trying to do a single model to be more generic.
@neel4fun
@neel4fun Год назад
Awesome crisp explanation ... even suitable for a high schooler & AI beginners
@LaurenceMoroney
@LaurenceMoroney Год назад
Thanks! :)
@scottmiller2591
@scottmiller2591 Год назад
Does no one know the difference between "amount" and "number" anymore?
@LaurenceMoroney
@LaurenceMoroney Год назад
I think I do. But sometimes when getting excited in speaking it's easy to use one when you mean the other. Did that happen here?
@scottmiller2591
@scottmiller2591 Год назад
@@LaurenceMoroney Perhaps. Some RU-vidrs do it consistently.
Далее
Why Does Diffusion Work Better than Auto-Regression?
20:18
How AI 'Understands' Images (CLIP) - Computerphile
18:05
Это нужно попробовать
00:42
Просмотров 234 тыс.
Coding Was HARD Until I Learned These 5 Things...
8:34
The BEST AI Video Model Is Out & FREE!
12:44
Просмотров 176 тыс.
Text-to-video-synthesis with Diffusers and Colab
12:35
This new, free AI video generator is INSANE
23:10
Просмотров 122 тыс.