Тёмный

The Absurd Evolution of AI Video Generators 

bycloud
Подписаться 160 тыс.
Просмотров 11 тыс.
50% 1

Опубликовано:

 

21 окт 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 55   
@bycloudAI
@bycloudAI День назад
You can try out Luma AI's Dream Machine here! luma.1stcollab.com/bycloudai I am really good at having great timing. MovieGen came out when I nearly finished the video. I'm sad. So here's a quick definition of DiT: A diffusion transformer (DiT) is a model that combines elements of diffusion models and transformers to generate data like image synthesis, audio generation, or text generation. Diffusion models are a class of probabilistic generative models that create data by iteratively denoising a latent variable, which starts from pure noise and is gradually transformed into a coherent sample. Transformers on the other hand, are neural network architectures known for their ability to model long-range dependencies in data, primarily through self-attention mechanisms. You could ultimately say that, a diffusion transformer is just a transformer with the goal of denoising. Yum. Here's MovieGen's paper: arxiv.org/abs/2410.13720 it contains a better run down to crafting the latest near SoTA video generation
@El_Carlangas
@El_Carlangas 34 минуты назад
Thanks a lot for this video, it was really helpfull to start to understand how all these ai technology works. All the people working behind this is literal geniuses.
@tannenbaumxy
@tannenbaumxy День назад
Yes, a deep dive into diffusion transformers for one of the next videos would be awesome!
@cdkw2
@cdkw2 День назад
that bread analogy really got me hooked, nice work and animation!
@MilesBellas
@MilesBellas День назад
A video on Diffusion Transformers = 😊👍
@andrey2001v
@andrey2001v 6 часов назад
This video is so cool, a literal gold mine of information on how modern AI models work Bread analogy was extra nice - I finally understand why diffusion models struggle with different resolutions
@m_e_m_es4649
@m_e_m_es4649 День назад
Could you possibly make the same video for Openai's advanced voice mode?
@Words-.
@Words-. 20 часов назад
I second this
@authenticallysuperficial9874
@authenticallysuperficial9874 9 часов назад
Upvote
@nilaier1430
@nilaier1430 День назад
Hey, bycloud, even if you fell off, I won't stop watching your nerdy videos because they're cool ❤
@thenoblerot
@thenoblerot 20 часов назад
Yes please, a video on diffusion transformers! Great channel
@DeepakSingh-ji3zo
@DeepakSingh-ji3zo 21 час назад
This is just excellent!! Animations and Analogies were pure gold.
@lex_darlog_fun
@lex_darlog_fun 12 часов назад
Diffusion transformers in general? Yes, please!
@Nazrininator
@Nazrininator 20 часов назад
I like how you added the Physics Simulation clip. I like it.
@huraqan3761
@huraqan3761 23 часа назад
De-noised bread, got it!
@Words-.
@Words-. 20 часов назад
Thank you for finally explaining!
@MilesBellas
@MilesBellas День назад
Baking Bread = great metaphor
@TankorSmash
@TankorSmash 16 часов назад
That bread analogy was 100% chatgpt
@niklase5901
@niklase5901 7 часов назад
You are my fav AI channel so it would be great to hear your take on Yann LeCun idea on how to build human level intelligence. He held a talk about this on the Hudson forum recently. Instead of LLM:s he wants to build models that truly models works by predicting the state of the world given some action. I can see how that would be a very effective model, but I suspect it will be easier to get around all the short falls of LLM, than to build this fancy model LeCun suggests. What do you think?
@dpactootle2522
@dpactootle2522 8 часов назад
I watched half of the video to remind myself that life can suck a lot sometimes.
@TahuRock
@TahuRock 13 часов назад
GOATED VIDEO 💪🏾💪🏾💪🏾
@iknowsolittle
@iknowsolittle День назад
How are you this smart and knowledgeable? Dont answer that. I just think ur super cool dude haha
@TheDreamFx
@TheDreamFx 23 часа назад
Hey! Great video! It would be nice if you cloud link your blog in the video description :)
@kingki1953
@kingki1953 9 часов назад
In summary: put noise dough to oven and cook it to become AI video generator 🗿
@n45a_
@n45a_ День назад
wth i just thought that i need an explanation for diff transformers erlier today
@Нокии
@Нокии День назад
3 views in 2 mins bro fell off🔥Shout out my favorite nigerian tech youtuber
@bycloudAI
@bycloudAI День назад
going for the "ranking by views: 10 of 10" for this one 🔥🔥🔥🗣️🗣️🗣️
@DynamicLights
@DynamicLights День назад
​@@bycloudAIlol
@DynamicLights
@DynamicLights День назад
He is Nigerian how do u know?
@StefanReich
@StefanReich День назад
Bro does NOT sound Nigerian
@Нокии
@Нокии День назад
@@DynamicLights I personally met him in abuja
@pedrogorilla483
@pedrogorilla483 6 часов назад
Just one day after you release this video we have Allegro, new open source video model. Check it out.
@albert123a
@albert123a 14 часов назад
Just put the fries in the bag bro
@snylekkie
@snylekkie 21 час назад
@bycloud do you know if anyone encoded math statements as integers like Gödel did, and used that as a custom LLM encoder for math proofs?
@ulamss5
@ulamss5 12 часов назад
at some point the bread analogy was harder to understand than the actual math
@LonewolfeSlayer
@LonewolfeSlayer 18 часов назад
Someone mentioned it but is the algorithm just messing with you at this point. You used to get a lot of views.
@starbez
@starbez 21 час назад
Shouldn't sponsored content be mentioned within the first minute of a RU-vid video?
@haukauntrie
@haukauntrie 10 часов назад
Why would it?
@Eric-yd9dm
@Eric-yd9dm 4 часа назад
> I am really good at having great timing - cloud,By on making videos about an area with research speed bonus modifiers correlated to the number of youtube videos about it =P
@abhrodipsingharoy4508
@abhrodipsingharoy4508 18 часов назад
All i learnt how to make bread.
@B-gj3tj
@B-gj3tj 19 часов назад
Does open sora have a huggingface?
@LumiLumiLumiLumiLumiLumiLumiL
@LumiLumiLumiLumiLumiLumiLumiL День назад
Can u cover Neuro Sama? How she's made etc. how one could re-create her?
@raspberryjam
@raspberryjam 23 часа назад
Vedal isn't making that information public. Maybe one day, but for now it's under lock and key
@LumiLumiLumiLumiLumiLumiLumiL
@LumiLumiLumiLumiLumiLumiLumiL 10 часов назад
@@raspberryjam well its easy to Kind of guess! Its clearly a LLM and maybe some tts like sovits... The llm will prolly be something like Mistral as Qwen needs commercial and Llama the 'Built with Llama' etc. He said there is an LLM as a filter and a way for the Ai to feel emotions. He said something about watching movies and having feelings.
@the2bros693
@the2bros693 11 часов назад
you better name it "some nerd shit"
@awaisamin3819
@awaisamin3819 18 часов назад
450 th like
@trymleiknesbruvik2052
@trymleiknesbruvik2052 День назад
first
@DynamicLights
@DynamicLights День назад
Correct.
@GamingCoderzX
@GamingCoderzX День назад
damn bro fell off, can i get a pin :3
Далее
Handsoms😍💕
00:15
Просмотров 4,8 млн
Меня знают уже все соседи😅
00:34
The Largest Unsolved Problem in VR.
25:43
Просмотров 955 тыс.
How AI Theft is Killing Free Speech
34:38
Просмотров 259 тыс.
BlackRock: The Conspiracies You Don’t Know
15:13
Просмотров 3,5 млн
The Unreasonable Effectiveness of Prompt "Engineering"
15:12
Replacing god with GPT-4o (in Minecraft)
28:30
Просмотров 192 тыс.
5 Languages I Will NEVER Learn
12:24
Просмотров 243 тыс.
How Math Becomes Difficult
39:19
Просмотров 77 тыс.
Why AI Simulated DOOM Is Actually Absurd
13:20
Просмотров 93 тыс.
everything about color (literally)
24:56
Просмотров 286 тыс.