Тёмный

A GPT-3 for Images? Dall-E is the most impressive AI ever created! 

Sebastian Schuchmann
Подписаться 10 тыс.
Просмотров 36 тыс.
50% 1

DALL·E / Dall-E is a model based on GPT-3 but for generating images. In the realm of Machine Learning or AI, this has to one of the most impressive models ever released. OpenAI again pushes the boundaries of what's possible.
Support me on Patreon: www.patreon.com/user?u=25285137
ML-Agents Discord Channel: / discord
Keep in touch: / sebastianschuc7
Original Article: openai.com/blog/dall-e/
Music by Lemmino: soundcloud.com/lemmino/encoun...

Наука

Опубликовано:

 

6 янв 2021

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 118   
@randyjordan1320
@randyjordan1320 3 года назад
2:42 a green... what?
@SebastianSchuchmannAI
@SebastianSchuchmannAI 3 года назад
Oh lol :D
@akuma7616
@akuma7616 3 года назад
They're memeing us
@adamklam1
@adamklam1 3 года назад
i read it with a pause after red gloves- sounds DALL-E has some opinions about penguins.
@randyjordan1320
@randyjordan1320 3 года назад
@@adamklam1 lmao
@NicksStuff
@NicksStuff 3 года назад
So...the neural network didn't offer any good answer
@zarthy4169
@zarthy4169 3 года назад
3:57 For the one on the top left, the ai just said. “No, S P H E R E”
@jonatan01i
@jonatan01i 3 года назад
2:37 That penguin wears some green shit!
@KlaudiusL
@KlaudiusL 3 года назад
Most prediction say: AI will reach the singularity at the 40s. Looks like will happen before the 30s
@vtopia.1679
@vtopia.1679 2 года назад
@@puppydragongirl imagine playing a hyper realistic game on an old smartphone
@McDonaldsCalifornia
@McDonaldsCalifornia 3 года назад
I love the concept of prompt engineering becoming a new kind of coding/computer job. I feel like it plays more to my personal strengths than writing code
@qcdatabasevideos3282
@qcdatabasevideos3282 3 года назад
Just found your channel. Great stuff. Keep up the good work!
@pathaleyguitar9763
@pathaleyguitar9763 3 года назад
Everyone noticed the "green shit" at 2:42, but I would also like to draw everyone's attention to the penguin at the bottom of the center column who seems to be a tad angry at us....
@digitalspecter
@digitalspecter 3 года назад
We should use crowdsourcing for data and a distributed computer program to get the computing power from volunteers (it should work well with machine learning because amount of data to computation power is low) like they used for dna analysis / seti program. It's dangerous to let only a select few big companies with the data and resources to develop the next step in computer algorithms...
@Neceros
@Neceros 3 года назад
Copyright free product suggestions with a photo of the item. Heckn.
@genericusername1243
@genericusername1243 3 года назад
long live yt recommendations AI
@whnvr
@whnvr 3 года назад
with the ‘openai storefront’ one i understand why they had to write it in that way though. when i’m working with gpt-3 i often have to use abstract, convoluted wording that makes more sense to the model than to me in order to coax the right results out. i often feel like i’m developing this completely unique, new skillset of ‘communicating w/ ai’ due to how many unforeseen interpretations it has of my instructions. kinda lends weight to the theory that ai will destroy us simply by misunderstanding the purpose we give it haha
@a-ragdoll
@a-ragdoll 3 года назад
how did u access gpt-3 tho
@whnvr
@whnvr 3 года назад
@@a-ragdoll i applied and gave them a strong use-case for what i’m looking to achieve w/ it
@bestofthebest3812
@bestofthebest3812 2 года назад
@@whnvr what are you doing? Just curious and intrigued
@itsjustthatsimple628
@itsjustthatsimple628 3 года назад
That's sick!!
@markusbuchholz3518
@markusbuchholz3518 3 года назад
Great video as a whole your YT channel. Your performance is outstanding and effort impressive. Yes the GPT-3 is very cool and promising, State of the of work and many fascinating application can be build on this knowledge. However I prefer to capture your awesome ML agents ... first! You are amazing man who has to be cloned so our planet will be even better. Keep fingers for your great success!
@isd4154
@isd4154 3 года назад
I love the vintage one because you can put something that wasn't even invented during that time and see what it'll look like
@juanmanuelcirotorres6155
@juanmanuelcirotorres6155 3 года назад
The best channel that I found today
@zarthy4169
@zarthy4169 3 года назад
2:52 It looks like for some of those images, the AI took “in the shape of a square” literally.
@FarfettilLejl
@FarfettilLejl 3 года назад
How else could it have been interpreted?
@zarthy4169
@zarthy4169 3 года назад
@@FarfettilLejl Well, for four of them, they literally just put a light bulb inside of a square.
@a-ragdoll
@a-ragdoll 3 года назад
i like how one of them is a sort of hexagon (top right)
@zarthy4169
@zarthy4169 3 года назад
@@a-ragdoll Yeah.
@kenneth7239
@kenneth7239 3 года назад
I've noticed, It's REALLY good with hands, and lighting / texture.
@moved8575
@moved8575 3 года назад
2:43 look at the green shirt part
@shilohv
@shilohv 3 года назад
Lol. Pretty sure it’s a typo in the video. Now I’m curious what would happen is you actually ran that text. Would the penguin be standing on a green poop emoji?
@PS0DEK
@PS0DEK 3 года назад
@@shilohv Dall-e tries to make sense of what's in the context, even if you add some noise (a.k.a. typos or out-of-context text).
@shilohv
@shilohv 3 года назад
@@PS0DEK I was wondering about that. That makes sense. Kind of like Google’s "did you mean" filter. What would happen if you turned that off? Would it get confused, or come up with something completely different? For that matter what if you typed "and now for something completely different"?
@PS0DEK
@PS0DEK 3 года назад
@@shilohv We lack a proper paper to explain how exactly it works. But it may be impossible to turn this feature off anyway since neural networks are non differentiable, you cannot separate the funcions into smaller blocks.
@akuma7616
@akuma7616 3 года назад
Oof... I thought you had half a million subscribers but hen I noticed a dot in the middle... And I'm like: man, really? This channel deserves much more. I can't help but say that it's sad seeing someone like you getting so little views and support for amazing work that you're doing. I think you'll get to 50k subs this year. Best of luck.
@adrianv.v.4445
@adrianv.v.4445 3 года назад
Nope, it can in fact generate reflections, its just that the network was given a cut-out image of a mirror where it was virtually impossible for it to make something coherent
@Desertpunk1986
@Desertpunk1986 3 года назад
Puppetmaster will come out of the primordial soup that is GPT-3.
@OneArmDan
@OneArmDan 3 года назад
Okay, I can just feel the rise of this channel.
@HarrisonBorbarrison
@HarrisonBorbarrison 3 года назад
Okay, I can just feel the rise of this comment.
@jayknox339
@jayknox339 3 года назад
I agree. Give it some time. Itll happen.
@simonstrandgaard5503
@simonstrandgaard5503 3 года назад
Incredible
@MindSweptAway
@MindSweptAway 3 года назад
Wow!
@laurasmith9135
@laurasmith9135 3 года назад
but how do you use this? do you have to install it on your computer first?
@sgt391
@sgt391 2 года назад
Can't wait in 20 years when phones will be able to run the training for this model in seconds
@Guytron95
@Guytron95 3 года назад
don't suppose you have a link to the clone of the network architecture?
@canaldoapolinario
@canaldoapolinario 3 года назад
It seems like there is room for yet another layer of abstraction between natural language from the prompts and the actual model, maybe training a neural network to get natural language prompts and "translate" to the weird english that the AI seems to work best on
@KlimovArtem1
@KlimovArtem1 3 года назад
5:05 - every mirror has the same shape for some reason)
@connormc4050
@connormc4050 3 года назад
It's because they all have the same seed image
@KlimovArtem1
@KlimovArtem1 3 года назад
@@connormc4050 what do you mean by "seed image"? Is it not taking only the text string as an input?
@serta5727
@serta5727 3 года назад
Subbed
@Phil_AKA_ThundyUK
@Phil_AKA_ThundyUK 3 года назад
How do you use it? All the options seem locked.
@ConnoisseurOfExistence
@ConnoisseurOfExistence 3 года назад
Sharing here and there...
@alan2here
@alan2here 3 года назад
A lizard is practising calligraphy.
@mykulpierce
@mykulpierce 3 года назад
I was just reading about this today. Are there any plans for this being a tool for developers or artists? I'd really love to give it a try
@BigOlSmellyFlashlight
@BigOlSmellyFlashlight 3 года назад
probably not considering ClosedAI's stinginess
@StagnantMizu
@StagnantMizu 3 года назад
How do I get acces? I have a gpt3 key.
@JACBoyJesse
@JACBoyJesse 3 года назад
I love the animal - food/objects hybrids.
@TrueValience
@TrueValience 3 года назад
This video was great. You should experiment with videos like two minute papers does
@TrueValience
@TrueValience 3 года назад
its kind of like this
@IconoclastX
@IconoclastX 3 года назад
i cant wait until 2mp does a vid on this and we get to have this model for ourselves c:
@MindSweptAway
@MindSweptAway 3 года назад
I think the reason why this is a demo is because it’s still In beta, and if you use the model it would break.
@ilzhukov-art-copy
@ilzhukov-art-copy 3 года назад
Hello! how can we use it on personal pc? Where is the soft?
@florianschneider3982
@florianschneider3982 3 года назад
6:38
@sumdud2129
@sumdud2129 3 года назад
So if the API isn't actually open source I can't just download this and start making images myself?
@a-ragdoll
@a-ragdoll 3 года назад
its open source, but it doesnt have the thingy that lets u generate pictures
@Hennesg
@Hennesg 3 года назад
If the public gains access to this a few million people will loose their jobs over time. Stockimage creators, illustrators, product designers
@a-ragdoll
@a-ragdoll 2 года назад
if it was released in its current state it would be easy to see that its made by ai
@findahuman6110
@findahuman6110 3 года назад
Thank you for the great video and content as always! The noise transitions were quite jarring though
@635574
@635574 3 года назад
Just wait for the video dal-EE
@boknonoyski
@boknonoyski 2 года назад
2:38 they spelt shirt wrong
@mrquackface
@mrquackface 3 года назад
Our AI OVERLORD ARE ALMOST COMMING
@vitotonello261
@vitotonello261 3 года назад
bring Pokemon to the next level!
@godofthecripples1237
@godofthecripples1237 3 года назад
We all know what this is really going to be used for once it's stable and publicly available.
@a-ragdoll
@a-ragdoll 3 года назад
nightmare fuel?
@godofthecripples1237
@godofthecripples1237 3 года назад
@@a-ragdoll I was thinking along the lines of something more NSFW, but yeah, plenty of nightmare fuel will be out there
@a-ragdoll
@a-ragdoll 2 года назад
@@godofthecripples1237 if someone tries to generate nsfw stuff on this thing its still gonna be nightmare fuel, maybe in 10 years it will look better
@VincentFischer
@VincentFischer 3 года назад
This is shockingly near AGI level isn't it? I mean the multi disciplinary understanding of all things baffles me.
@robo1540
@robo1540 3 года назад
is that the mf cicada 3301 song by lemmino
@k8ieone
@k8ieone 3 года назад
Yup! I was looking for a comment like this.
@hward1973
@hward1973 3 года назад
would be great for confused girls trying to explain what they want in a tatoo shop
@Michaelf122
@Michaelf122 3 года назад
So basically you're ai is an incredible google image search
@maythesciencebewithyou
@maythesciencebewithyou 3 года назад
All the images are created by the AI and haven't existed before.
@megaheroes3611
@megaheroes3611 3 года назад
How can I use this?
@florianschneider3982
@florianschneider3982 3 года назад
6:38
@cadenitadelnazareno6717
@cadenitadelnazareno6717 3 года назад
Human text=Green shit, I’m sure that should’ve looked different
@XetXetable
@XetXetable 3 года назад
Those mirror results look weird. Why is it the same mirror every time? There don't seem to be similar constants in the other examples.
@SebastianSchuchmannAI
@SebastianSchuchmannAI 3 года назад
A big Part of the Image was given in this Case. Sorry, it isnt shown in the Video
@LaPapaya
@LaPapaya 3 года назад
That one had a prompt image like the old gpt-image
@zarthy4169
@zarthy4169 3 года назад
@@LaPapaya Oh, so you can set a specific prompt image that will show up for each image.
@LaPapaya
@LaPapaya 3 года назад
@@zarthy4169 Exactly, it will generate from that prompt image.
@TheGeekosDen
@TheGeekosDen 3 года назад
„Green Shit” lol
@zarthy4169
@zarthy4169 3 года назад
2:41 The middle one looks perfect. It looks like an actual emoji.
@robo1540
@robo1540 3 года назад
yoo is that 1:57 the default minecraft grass top texture
@robo1540
@robo1540 3 года назад
holy shit it pixel-by-pixel is how much minecraft does one have to play to be able to tell that texture apart from all other random noise
@abrampainter3764
@abrampainter3764 3 года назад
@@robo1540 Yeah just googled it. That's crazy
@dadthelad
@dadthelad 3 года назад
A penguin wearing a green shit???
@NicksStuff
@NicksStuff 3 года назад
Where's the snail?
@magicjuand
@magicjuand 3 года назад
it's just fancy search
@zarthy4169
@zarthy4169 3 года назад
2:19 This looks like Minecraft.
@EricRogstad
@EricRogstad 3 года назад
You don't say the "p" in "OpenAI"? Sounds like "O'en AI"
@workflowinmind
@workflowinmind 3 года назад
Mom? I'm scared
@myuniquehandle
@myuniquehandle 3 года назад
Is it nothing more than a image search? Most of the images are designed by humans, it would be interesting to highlight which changes GPT-3 did (if any)...
@florianschneider3982
@florianschneider3982 3 года назад
are they designed by humans?
@nekomimicatears
@nekomimicatears 3 года назад
Where did you hear that?
@maythesciencebewithyou
@maythesciencebewithyou 3 года назад
It's not an image search. All the images are created by GPT3
@maxziebell4013
@maxziebell4013 3 года назад
It’s a decoy... these results are just attention bait while “Next” is ravaging through the world ;-)
@ziquaftynny9285
@ziquaftynny9285 3 года назад
Next? Can you elaborate?
@maxziebell4013
@maxziebell4013 3 года назад
It is a tv series about an rough AI
@vsiegel
@vsiegel 3 года назад
Wait, what? That model is an order magnitude smaller than GPT-3 and an order of magnitude more scary than GPT-3. I use "scary" here as a unit of AI performance. What irritates me is that my intuition tells me that generating images needs a very abstract understanding of objects and other concepts. Yes, there is the problem: I used the word "understanding".
@StagnantMizu
@StagnantMizu 3 года назад
GPT-3 is scary too, had some conversation in playground with it which made me almost doubt if it was sentient or not lmao
@maroon9138
@maroon9138 3 года назад
German english
@JordanPriede
@JordanPriede 3 года назад
Excessive use of the digital noise transition, and the very slow excessive blur transition to bring in the example photos. It takes away from the rest of the video. Great content, though.
Далее
Why GPT-3 changes everything (and how it works)
13:09
Каха заблудился в горах
00:57
Просмотров 826 тыс.
A.I. in Minecraft
9:20
Просмотров 54 тыс.
GPT3: An Even Bigger Language Model - Computerphile
25:57
These Neural Networks Have Superpowers! 💪
7:30
Просмотров 148 тыс.
AlphaFold: The making of a scientific breakthrough
7:55
Is OpenAI still open?
7:24
Просмотров 10 тыс.
Light Fields - Videos From The Future! 📸
5:13
Просмотров 134 тыс.
iPhone 15 Pro в реальной жизни
24:07
Просмотров 446 тыс.
How to Soldering wire in Factory ?
0:10
Просмотров 4,1 млн