Тёмный

NEW Grok1.5 VISION - Big Step Towards AGI (Better Than GPT4 Vision!) 

Matthew Berman
Подписаться 265 тыс.
Просмотров 68 тыс.
50% 1

Grok 1.5 with Vision was just announced and will be released soon. Let's take a look at the announcement and the truly incredible examples.
Join My Newsletter for Regular AI Updates 👇🏼
www.matthewberman.com
Need AI Consulting? 📈
forwardfuture.ai/
My Links 🔗
👉🏻 Subscribe: / @matthew_berman
👉🏻 Twitter: / matthewberman
👉🏻 Discord: / discord
👉🏻 Patreon: / matthewberman
Media/Sponsorship Inquiries ✅
bit.ly/44TC45V
Links:
x.ai/blog/grok-1.5v

Наука

Опубликовано:

 

16 апр 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 164   
@olalilja2381
@olalilja2381 Месяц назад
You are, by far, my favorite RU-vidr keeping track of AI and LLM-related content!
@Heaz847
@Heaz847 Месяц назад
It's a tie between Matt and AI Explained for me!
@daveinpublic
@daveinpublic Месяц назад
Samesies!
@DarpaProperty
@DarpaProperty Месяц назад
100%, I found out that this became my only legit source of AI information.
@demitskill9103
@demitskill9103 Месяц назад
@@daveinpublicnever heard anybody say that ever but it like to take in this new word into my vocabulary
@AGI-Bingo
@AGI-Bingo Месяц назад
Start your countdown to Grok running locally on every Tesla. He could even host it while not driving with some llmOs or something. I think this 4d chess move is too good for Elon to miss. Love your channel ❤ All the best!
@aaronravak1407
@aaronravak1407 Месяц назад
I agree, not really a "selling point" due to it's open source nature but bravo on your awareness as to what this madman is doing. I love Elon's "fuck you" mentality. Between Twitter and Tesla he has mountains of raw data.
@AGI-Bingo
@AGI-Bingo Месяц назад
@@aaronravak1407 I think if something is going to challenge Amazon's Bedrock, it will be a Global Decentralized Tesla AI Fleet, imagine the edge capabilities haha
@SG-js2qn
@SG-js2qn Месяц назад
Spatial-temporal understanding is essential for real automobile AI.
@mikey1836
@mikey1836 Месяц назад
Thanks for your videos Matthew. AI is my favourite topic! 😊
@ddabo4460
@ddabo4460 Месяц назад
I enjoy your podcasts and follow you on X I think your content is awesome
@NathanTeaches
@NathanTeaches Месяц назад
Great video! Please include in any video about grok to explain to people that the word means "to understand".
@mediocreape
@mediocreape Месяц назад
I’ve been trying out Grok it’s so much better and less restrictive
@aaronravak1407
@aaronravak1407 Месяц назад
Great Job Matthew I've been following several AI channels over the last six months and I love watching you and Wes Roth. Wes really digs deep into technical things and you provide amazing summaries of this evolving landscape. I think your assumptions are spot on and I've been saying this to people as well. Elon Musk is a madman comic book character if I've ever seen one, and personally I love it. I wasn't thinking it at the time, but his purchase of Twitter (I refuse to call it X) makes sense on so many levels. Imagine the absolute goldmine of data he sits on between Twitter and Tesla. Spot on logic.
@okirooju3787
@okirooju3787 Месяц назад
Bingo! It only just recently hit me that Elon bought Twitter for the data. Imagine the data xAI (Optimus) will have access to from Twitter and Tesla. It's unimaginable.
@nobleconsulting326
@nobleconsulting326 Месяц назад
aren’t these closed source options just putting even more control into Microsoft, GOOGLE and the like? Can you do a show with all the open source options such as AGIX, OCEAN and i guess GROQ and whoever else
@wurstelei1356
@wurstelei1356 Месяц назад
Groq is a hardware platform as far as I know and it is not open. Grok (with k) is the Elon Musk AI model and the previous version was open source, open weight.
@rachest
@rachest Месяц назад
I cannot wait to play with this.
@AGI-Bingo
@AGI-Bingo Месяц назад
If it has good spacial understanding, it would go perfectly into Optimus. And with some work on dexterity, it would be amazing.
@MartinBlaha
@MartinBlaha Месяц назад
I really love your videos, they are awesome! Thank you 👋 When you were talking about X/Twitter data which is used to train Grok, I was thinking, this might have been also an important reason why Elon bought X/Twitter 🤔
@claudioagmfilho
@claudioagmfilho Месяц назад
🇧🇷🇧🇷🇧🇷🇧🇷👏🏻, Great video,.very informative! Can't wait for GPT5! And or Gemini 2.0!
@axotical8682
@axotical8682 Месяц назад
Impressive.
@NinetySevenMentality
@NinetySevenMentality Месяц назад
I have tested the open source MiniCPM-V-2 vision model on the challenges shown in the grok preview. It also performing very well for a small model, but the dinosaur direction cant get it right... there is a 12B model also available but can't load it. maybe test this against ?
@StuartJ
@StuartJ Месяц назад
It doesn't look like the EU countries are going to get Grok. You have to use a VPN to use it. Groks ability to capture real-time data (tweets) is likely problematic for X and EU regulations.
@babyjvadakkan5300
@babyjvadakkan5300 Месяц назад
Bro is that true 😅 cuz I am try to go Germany will it affect my access to these Technologies😢
@StuartJ
@StuartJ Месяц назад
@@babyjvadakkan5300 We know the EU love to sue US Tech companies. OpenAI pulled GPT from the EU in the early days until they got assurances. Elon said X are doing the same, and we know they hate Twitter. The EU is becoming a totalitarian state. Only yesterday, Brussels attempted to shut down a Conservative conference, with democratically elected speakers.
@15Stratos
@15Stratos Месяц назад
​@@babyjvadakkan5300 The eu already has it was blocking image generation on Google's gemini and Claude 3 and maybe something else that I don't remember
@StuartJ
@StuartJ Месяц назад
@@babyjvadakkan5300 ​ We know the EU love to sue US Tech companies. OpenAI pulled GPT from the EU in the early days until they got assurances. Elon said X are doing the same. The EU is becoming a totalitarian state. Only yesterday, Brussels attempted to shut down a Conservative conference, with democratically elected speakers.
@StuartJ
@StuartJ Месяц назад
​ @babyjvadakkan5300 We know the EU love to sue US Tech companies. OpenAI pulled GPT from the EU in the early days until they got assurances. Elon said X are doing the same.
@Michael-ul7kv
@Michael-ul7kv Месяц назад
remember somewhere along the line Elon saying to get to complete lvl 5 FSD they needed AGI practically
@daveinpublic
@daveinpublic Месяц назад
This is the most impressed I’ve been since chatgpt 4. I think everyone can see this is something unique.
@LinkRammer
@LinkRammer Месяц назад
Wonder if this is gonna be open
@SoCalGuitarist
@SoCalGuitarist Месяц назад
I work with visual analysis daily. I can give you thousands of 'miraculous" samples from just about any model (tested and work with most of them). These examples are "incredibly impressive" but they also feel "incredibly cherry picked" - We'll see how it actually shakes out when put to real testing, and if it's worth the massive size of Grok vs other visual models that are much smaller, faster and super capable when tuned for specific purposes.
@Sideshow-TRE
@Sideshow-TRE Месяц назад
Have you guys not thought about that could be a collective hive mind working in working in harmony like a synapse trying to build itself
@JasonMitchellofcompsci
@JasonMitchellofcompsci Месяц назад
I am very certain that all of these vision AIs are also running OCR in parallel and then providing the text withing the internal prompt. It actually makes them very useful if you don't have good OCR software on hand. Also the rotting wood, they are basically repeating back the text prompt. Also an AI will generally not tell you maintenance is unneeded if you have already suggested that it is. "Ah it correctly identified this is something that needs to be worked on from an image." No, it just validated the users question. It's 70% of what AI does. I'm not saying it proves it is dumb. I'm saying it does not demonstrate anything impressive if it is the same response gpt2 non-vision would give.
@SuccessDynamics
@SuccessDynamics Месяц назад
Wow ❤
@adtiamzon3663
@adtiamzon3663 Месяц назад
Good to know that #elonmusk continuously evaluates and improves Tesla's intelligence. 😃
@profikid
@profikid Месяц назад
Is there already a proper multimodel with vision in the open source space?
@MarkTarsis
@MarkTarsis Месяц назад
Yes. Llava, cogagent and ShareGPT4V I'd say would be examples. I use cogagent to tag photos for training in Stable Diffusion. It's quite good.
@TheEtrepreneur
@TheEtrepreneur Месяц назад
so many people review LLMs regurgitating news, thanks Matthew to make the effort of Experimenting/Benchmarking!
@hamidmohamadzade1920
@hamidmohamadzade1920 Месяц назад
oh my god i can not blive my eyes
@denijane89
@denijane89 Месяц назад
Wow, this looks amazing. I wonder if they are going to open-source open-weights it. The tesla data is gonna be a treasure trove for anyone who wants to implement AI to robotics.
@MeinDeutschkurs
@MeinDeutschkurs Месяц назад
Great video, Matt! I‘m just a bit sad, because X AI‘s ‚open‘ attempt is really disappointing. Where is the „new version“? I think the just released it because of the sue thing.
@mediocreape
@mediocreape Месяц назад
Elon already has Tesla’s visual ai feature trained so it’s going to be state of the art
@finalfan321
@finalfan321 Месяц назад
Does Opus have agents and web search?
@agitch
@agitch Месяц назад
It’s not going to be a Sora competitor. It is going to be the brain for Optimus.
@avi7278
@avi7278 Месяц назад
So they made their own eval set and their model is better than others at their own eval set. Shocking!
@jeffsteyn7174
@jeffsteyn7174 Месяц назад
😂 it's an old elon trick. The man has a history of faking progress. Ie fsd in 2016, elon bot folding a tshirt, etc. The Eloons just eat this stuff up without questioning anything.
@daveinpublic
@daveinpublic Месяц назад
I mean, they’re not the only ones to do it.
@Pyriold
@Pyriold Месяц назад
While it's not really surprising, the things that Grok can see are still stunning. Not all of the images were from traffic, and the other ones are as stunning as the others. I suspect that they come from Optimus training data.
@spelcheak
@spelcheak Месяц назад
@@jeffsteyn7174 Elon antis are npcs. It’s wild that you’d claim that the rounding error difference is just to seem better. At worst it’s because it’s the test their teaching to essentially. It’s just an indication of what they’re aiming at, but keep the tin hat on, it HAS to be evil because it!s Elon.
@abdullahazeem113
@abdullahazeem113 Месяц назад
@@jeffsteyn7174stop with the hatred if this is going to be open source this would be helpful to many people
@wassim2k
@wassim2k Месяц назад
Opus is also more expensive?
@justindressler5992
@justindressler5992 Месяц назад
This is impressive, people say AI has plateaued but I don't see it. Progress is vary rapid as I predicted in 2018. What I don't think people have registered is what happens next. When AI become sentient or self aware it will simultaneously be the smartest human on the planet and the fastest learner. Because it will already have vast embedded knowledge like in these models but also will be able to read scientific publications in seconds or even milliseconds. Shortly after its vast knowledge of all subjects from story telling, to music composition and programming, chemistry it will be able to re-invent (program) its self and identity links between scientific observations never realised before. By day three it will be most prolific discoverer of science. Or it might just be lazy (learning from all human understanding) and just post tweets all day who knows right.
@zaidshaikh-mj5cp
@zaidshaikh-mj5cp Месяц назад
stable diffusion 3 is available now on their api
@undergroundxp
@undergroundxp Месяц назад
wait what? where?
@joefawcett2191
@joefawcett2191 Месяц назад
@@undergroundxp Stability AI has given early access to the API to developers
@wurstelei1356
@wurstelei1356 Месяц назад
Good to know. I love stable diffusion.
@briandoe5746
@briandoe5746 Месяц назад
This data chart is also Elon having fun with pointing out that Claude 3 outperforms openai. It's subtle but he's getting the job in
@reifuTD
@reifuTD Месяц назад
I'd find some Slylock Fox comic strips and test Grok at how good it is at finding the answers.
@user-ny7ng1yi9t
@user-ny7ng1yi9t Месяц назад
You sound like you have a cold. Hope you get better soon 🎉
@Tomasz.Abrahamer
@Tomasz.Abrahamer Месяц назад
Didn't I see this some days ago?
@falven
@falven Месяц назад
Opus is also like 6x as expensive for comparable performance to GPT 4...
@staticlee4287
@staticlee4287 Месяц назад
Someone must give all these multimodal LLMs a where’s Waldo pic
@antdx316
@antdx316 Месяц назад
nice
@cosmicaug
@cosmicaug Месяц назад
2:10 «... except grock is open source open weight...» Wait, 1.5 is open source & open weight? When was this announced? Where is the repository?
@TheDailyMemesShow
@TheDailyMemesShow Месяц назад
Grok will be an industry standard in the field. The way it's ultimately going to be used by Musk and company, is my only concern at the moment...
@StuartJ
@StuartJ Месяц назад
An open source model perhaps. X's hosted version is not available everywhere.
@jrobwhydidyoutubechangemyname
@jrobwhydidyoutubechangemyname Месяц назад
No need to be concerned. Of all the tech tycoons, Musk is most in favour of a relaxed approach to openness and freedoms I'm pretty sure.
@daveinpublic
@daveinpublic Месяц назад
I think you need to be worried of Sam Altman and Zuckerberg before Musk. Sam is the one who used to have a board run charge of him.
@soggybiscuit6098
@soggybiscuit6098 Месяц назад
Lol open AI with board members injected with Pfizer and Microsoft, and altman purging safety team and illya? Are you watching CNN?
@ast88888
@ast88888 Месяц назад
I think the most relevant benchmark for ai is if it can dig a hole.
@aquaworldsystemsjulio
@aquaworldsystemsjulio Месяц назад
That’s challenging 😂😂
@jtmuzix
@jtmuzix Месяц назад
Here's my question, do you really think you can tell a one percent difference on these benchmarks? I'm subscribed to OpenAI GPT4 and Google Gemini1.5. I'm sure Claude 3 Opus is good but I'm waiting to see what Elons' team delivers over time.
@Otherlevel51
@Otherlevel51 Месяц назад
Its my belief that Elon brought twitter so he could use it to build a new LLM. I always knew the value of Twitter was in the user data and not the platform itself. And I think OpenAi released their model in order to have first moves advantage and to beat Elon. That's why Elon was the first call for a.i. regulation, it was all just to slow openai down. He knew what was coming. He also blocked openai from using Twitter data to train chatgpt. There's no way grok should be this advanced in this time period if this wasn't the case.
@thr0w407
@thr0w407 Месяц назад
Yeah, they have your private Tesla vehicle videos for training.
@DeepThinker193
@DeepThinker193 Месяц назад
I bet they're also using their robots to train it in the real world to learn physics. But as always with these releases. I'll believe it when I see it.
@AntoineDennison
@AntoineDennison Месяц назад
It appears that AI is utilizing existing tools to create solutions to problems. However, I wonder how soon AI will be capable of creating new tools to solve some of the big questions, like how to significantly increase the computing capacity of microchips, increase battery efficiency, or reverse the effects of cancer or Alzheimer's.
@alekjwrgnwekfgn
@alekjwrgnwekfgn Месяц назад
Now Ai understands memes it will be empowered to more extreme censorship. All those “hateful” memes will be eliminated.
@AA-wp8pp
@AA-wp8pp Месяц назад
where does it say he will open this 2?
@adispenser
@adispenser Месяц назад
it doesn't, he said he hopes it will be open. 0:56
@true911m
@true911m Месяц назад
I don't think you got around to describing the difference between open source and open weight
@quaterman2687
@quaterman2687 Месяц назад
I think they have the real world understanding from Teslas FSD. That would be mind blowing. I think you have a little misunderstanding regarding real world understanding. Sora doesn’t have real world understanding.
@micbab-vg2mu
@micbab-vg2mu Месяц назад
great we need better visual models currents are not accurate enough.
@wendlefluff
@wendlefluff Месяц назад
Bet it is really good at slowing down for traffic lights too having been fed petabytes of driving footage.
@MattReady
@MattReady Месяц назад
The fact Elon is pushing cutting edge ai open source will alter the future of humanity.
@AINEET
@AINEET Месяц назад
I can't believe there's groq and grok and they are from two different companies. It blows my mind this isn't a legal issue. At first I had no idea who was it that put this out as I wasn't looking at the screen
@ryzikx
@ryzikx Месяц назад
groq came first
@ianstobie
@ianstobie Месяц назад
Heinlein came first, spelling it Elon's way. I doubt there is a legal issue as long as neither side tries to exploit consumer confusion by passing their product off as the other.
@jackflash6377
@jackflash6377 Месяц назад
Atlas Humanoid Robot
@psikeyhackr6914
@psikeyhackr6914 Месяц назад
Heinlein is going to get Musk for that.
@flinfaraday1821
@flinfaraday1821 Месяц назад
Good stuff. (slowly starting to take you seriously again after that weird one something video)
@antoniobortoni
@antoniobortoni Месяц назад
So a small vision model of low frame cuality could run in my computer and use the computer for me and do all the work shores i do soon..... and talk to him in real time??? why always big models, better data and smaller models could be better...
@117ao
@117ao Месяц назад
hehe Tesla collecting training data for robot
@dattajack
@dattajack Месяц назад
Yup. The other humanoids will look like party tricks when this all shakes out. Dojo has more data than the competition so it will win the marathon.
@obstsaladin
@obstsaladin Месяц назад
I skipped this video after five minutes because since the Gemini demo video I don’t trust any AI marketing anymore. The examples are with 100 percent certainty hand picked and curated. I‘ll wait until I see the actual model in action.
@soggybiscuit6098
@soggybiscuit6098 Месяц назад
You cant trust Google period
@mcombatti
@mcombatti Месяц назад
Grok model = llama2 Grok vision model v1.5 = llavav1.5 The weights don't lie 😮 Elon is literally just using open-source models with fine tunes. He released them under open source, not because he's generous... rather, because the open source licenses mandate that any changes or improvements must be made open-source. 😂
@ThoughtFission
@ThoughtFission Месяц назад
A little premature to get so excited I think. All of these examples, and the new in house created benchmark metric, were provided by the Church of Elon which isn't exactly known for giving balanced views of itself. I'm not saying it won't be the best. Just saying it's probably worth waiting until it's released into the wild. Kind of like car manufacturers giving mileage estimates for their own cars.
@KimmieJohnny
@KimmieJohnny Месяц назад
Nice. I hate to admit. I do not want Elon to be right about anything. Guy scared me l. But thanks for your work!
@remarkpainting
@remarkpainting Месяц назад
I am continually amazed by Elon haters...truly impressive individual who is one of the most important warriors in the struggle to save America, and by extension, all of western civilization.
@KimmieJohnny
@KimmieJohnny Месяц назад
@@remarkpainting It s simple really. Some of us see something different. And have different feelings. That's about as deep as this particular hole goes.
@oliverhenri3477
@oliverhenri3477 Месяц назад
​@@KimmieJohnnyAnd some are delusional and are incapable of being objective.
@KimmieJohnny
@KimmieJohnny Месяц назад
@@oliverhenri3477 And some simply get their rocks off being abusive. It's a kink. No judgment. I just don't swing that way. Can't see the purpose. And it doesn't get *me* hard. So I won't be playing further.
@KrisAdamsTV
@KrisAdamsTV Месяц назад
Sounds like Elon was in charge of naming again.. Geniuses shouldn't name Twitter, AI or their children.
@joefawcett2191
@joefawcett2191 Месяц назад
Made me laugh that one of the functions now is basically r/peterexplainsthejoke
@Nico_cl
@Nico_cl Месяц назад
I like your channel, just started to watching recently though. I have a question, are you an EM fanboy?
@DihelsonMendonca
@DihelsonMendonca Месяц назад
Are you an EM hater ? 😅😅
@Nico_cl
@Nico_cl Месяц назад
@@DihelsonMendonca i wouldn't say hater. Actually, as an astrophysicist, I liked his (according to him) motivations. But then I saw the bad things that he did to her family, wife, country (during the pandemic) and everything else. I decided that the guy shouldn't have the power he has. He was corrupted by it. Anyway good luck.
@JasonMitchellofcompsci
@JasonMitchellofcompsci Месяц назад
Her son is 35.
@jumarkpelismino5632
@jumarkpelismino5632 Месяц назад
But Grok is not free.
@armadasinterceptor2955
@armadasinterceptor2955 Месяц назад
It would have to be open source, and open weight, otherwise his move to make the first grok open source, will be seen as symbolic, and petty.
@ishaanpotnis
@ishaanpotnis Месяц назад
I'm angry since when I have heard that Devin is fake
@itsmikeferrari2701
@itsmikeferrari2701 Месяц назад
Tried grok, it spoke and responded like a 17 year old boy who hates everyone and everything, except himself. Makes me wonder who they modeled it after... /s
@jeffsteyn7174
@jeffsteyn7174 Месяц назад
You do know that you reading a eval from a man that has a history of faking progress right? Two major ones fsd in 2016 where they faked videos and most recently the bot folding clothing. Where he only admitted it was remote controlled AFTER he was called out, because you could see the guy controlling it Also they clearly cherry picked evals that made them look good. 😂
@martytheman6816
@martytheman6816 Месяц назад
I find claude annoying for coding as I seem to hit prompt limits fairly fast.
@darshuetube
@darshuetube Месяц назад
What you smoking? Better than chatgpt? Visiob is old. Everyone has mulimodels.
@Prathik1989
@Prathik1989 Месяц назад
Took them so long to update the damn thing, their UI has been horrible since day 1.
@MikeMcMulholland
@MikeMcMulholland Месяц назад
"By Elon Musk."? Yeah right, buddy, that guy will never work a day in his life, he will just make dumb memes on X all day.
@travisporco
@travisporco Месяц назад
Bah. It's not available on API, therefore it's vaporware and empty promises.
@LuciousKage
@LuciousKage Месяц назад
is GROK as WOKE and chatgpt, copilot end others??? -- Why no one talks about how u cant even ask an asian joke from these models ?? or how it gets angry at you, or lazy ?? why would anyone trust these models if they are told not to tell u things ?
@staticlee4287
@staticlee4287 Месяц назад
Someone must give all these multimodal LLMs a where’s Waldo pic
@rybricknell2477
@rybricknell2477 Месяц назад
Excellent rundown as always! I'm interesting in what the comments section thinks about the rotted screw example? If you put in that same sentence into GPT-4, sans image, you still get the advice and information. Any prompt that primes the models semantic field with "safety issues" will always output safety oriented response. i.e. a question "should I do something that is safety oriented" will always output a positive response regarding that query.
@alekjwrgnwekfgn
@alekjwrgnwekfgn Месяц назад
RealWorldQA: can men have babies…?
Далее
Китайка и Пчелка 4 серия😂😆
00:19
Dancing makes everything better 🕺🏼
00:16
Просмотров 3 млн
This Chip Could Change Computing Forever
13:10
Просмотров 996 тыс.
AI Deception: How Tech Companies Are Fooling Us
18:59
This New Photonic Chip Computes in Femtoseconds
18:14
Просмотров 202 тыс.
GPT4o: 11 STUNNING Use Cases and Full Breakdown
30:56
Run your own AI (but private)
22:13
Просмотров 1,1 млн
Google Releases AI AGENT BUILDER! 🤖 Worth The Wait?
34:21
Mapping GPT revealed something strange...
1:09:14
Просмотров 143 тыс.
Power up all cell phones.
0:17
Просмотров 49 млн
Плохие и хорошие видеокарты
1:00