OpenAI's 'OMNI' STUNNING New Abilities... a 'slice' of AGI.

Подписаться 190 тыс.

Просмотров 55 тыс.

50% 1

Learn AI With Me:
www.skool.com/natural20/about
Join my community and classroom to learn AI and get ready for the new world.
#ai #openai #llm
X Profiles Mentioned:
/ emollick
/ itsandrewgao
/ drjimfan
/ scobleizer
/ nickadobos

Опубликовано:

12 май 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 455

@BunnyOfThunder 28 дней назад

Don't feel silly about being hyped. Feel silly about how quickly we get bored with magic.

@JoshuaAM 28 дней назад

The distortion field is real! I don’t know about you but I’ve been awe inspired by chatgpt from the beginning and the voice feature even more so even before this update. Maybe I’m just getting old and as a tech nerd understand what these developments really mean. I honestly started to doubt we’d ever get to see the singularity before 2045, now it’s all but inevitable by the end of the decade. Amazing times to be alive!

@jogo798 27 дней назад

How quickly these get normalized

@EpicratesDamus 27 дней назад

So true, people is so quick normalizing new things that it makes me sad. One day when domestic robots come to reality people will say, "duh its just a robot"

@AzureAzreal 27 дней назад

Though I understand being bummed that there aren't more who are excited about this new tech, it's important to remember that many do not know how to use or implement it in a way that let's them perform "magic." If it's top complicated or feels "outside" our realm of comprehension, our minds normalize and adapt to it so that we have a better chance at surviving that outside force. People are either bored or afraid of that which is outside their control, very rarely inspired - though I would take that everytime if I could. But if we can't have inspired, I'll take bored over frantic, doomer behavior!

@BunnyOfThunder 27 дней назад

@@AzureAzreal Yeah, there's a lot of work to go from "cool new tech" to "ordinary people use it daily". Which is partly why I think we haven't yet realized the potential unlocked by even GPT-3.5. There's a lot of work figuring out where it's actually useful and building it into tools and workflows seamlessly.

@qwazy0158 28 дней назад

GPT 5... must be a substantial step ahead of this upgrade

@AirSandFire 28 дней назад

Which is insane as this is already mindblowing

@Windswept7 28 дней назад

That’s exactly what I was thinking. But I feel they will avoid the ‘5’ name.

@user-er5rr1gb6t 28 дней назад

AGI already achieved sir and its far beyond model that been shown to you obviously. At this point Government and “big people” taking lead of this technology. We as public will be getting filtered controlled and adjusted chunks of real AGI step by step, while real “beast” already there it just need more compute and time but I’m convinced it is already there and it’s fascinating and scary at the same time.

@petropzqi 28 дней назад

This is 5. They just won't set a number on the increment. We need to learn gradually in order to accept our new overlord.

@ramlozz8368 28 дней назад

That’s why the release will be gradually unless the competition gets a head 😅 but GPT 5 is in different level

@fcaspergerrainman 28 дней назад

I’ve been in the ai field for a long time, this, what we are seeing, by definition 10 years ago, is AGI, or at least the basic framework of it. No doubt is here and it’s just gonna keep learning, that’s the power, it learns

@devbites77 28 дней назад

I agree. I just feel that some people are just going to be moving the goalposts. They just can't accept it. Some say that AI at the moment is really dumb. I use it every day for many tasks. It is extremely useful now. I don't know what they're talking about. I guess those that criticize most are those that have never or rarely used it, which would be funny.😂

@baddogmtv 28 дней назад

with potentially millions given freely, the loop of new new data from audio and vision from peoples phones...not long now

@daveinpublic 28 дней назад

Good point. It is now completely indistinguishable from a human in just about every metric. The only give away is that the voice sounds so beautiful, and the way it cuts away when you start talking. If it just trailed off the last full word instead of cutting off, I don’t think we could tell anymore. It says that it’s better than 90% of humans at all areas. I’m thinking AGI is here.

@Instant_Nerf 28 дней назад

@@daveinpublic good points indeed. Was being aware ever the discussion for general intelligence? I don’t believe we can call it general if it doesn’t at least in theory if it’s not aware of what it’s doing and why. Not just data fed in. The way it just sits there idle is something creepy about that. It’s not asleep it’s just idle .. until prompted to engage.. I don’t know

@YeeLeeHaw 27 дней назад

You can't say it's the basic framework of AGI when you can't define what AGI requires.

@Jackson_Zheng 28 дней назад

Remember, R&D lead times in he software industry are typically a few months to 2 years ahead of latest product launches. This model jas already been tested by over 70 auditors, which probably took more than 2 months, the benchmark evaluations and testing after training probably took over 4 or 6 months to complete, so all in all, OpenAI already has AI models that are at least 6 months ahead of this model, and that's being conservative. Realistically, they've likely finished training GPT-5 by now, and are in the evaluation phase.

@HeyPlayboi 28 дней назад

i do remember sam talking about the capabilities of the next big gpt release, saying how it will have a very large impact on the industry. That's basically all though.

@hamishahern2055 28 дней назад

I doubt that. come to my software house of 700, and we are rushing to finish coding and testing stuff by the release date, normally with everything coming together just a few days before launch date, lol. just look at computer game software houses, same thing there too.

@qwazy0158 28 дней назад

💯 They are developing a revenue model so they will not release their latest willy nilly for others to profit off their platform without a defined path for profit in place.

@lthedoperabbitl9258 28 дней назад

I have heard from some sources like Jimmy apple a month ago or maybe 2 that gpt 5 is in its red teaming phase which is test for safety so they are already working on something else

@acllhes 27 дней назад

They’ve been working on gpt 5 since July 2022 they have had it for a year at least

@dmreturns6485 28 дней назад

Born too Late to discovery the new world. Born too early to discovery the stars. BUT ... Born in the AI Goldilocks zone!

@adampenbrook5751 27 дней назад

Born at the right time to have an AI girlfriend lmao.

@1x93cm 28 дней назад

GPT 5 is agi. This is why they're doing iterative deployment. They already have AGI.

@Windswept7 28 дней назад

Seems likely, ‘Hey GPT how do we not shock humanity by your existence and capabilities?’

@assoldier13 28 дней назад

In some form. The versions that are held back and are not alligned jet are not crippled by their guardrails.

@assoldier13 28 дней назад

Also to avoid unnecessary outcry by the public is rather important. There will be a lot of resentment and hate twards this technology until it will be broadly accepted.

@antman7673 28 дней назад

@@assoldier13 You probably can do enough with guardrails anyways. Sometimes it may suck, but a great tool is a great tool anyways.

@assoldier13 28 дней назад

True, so probably rather* just another factor in the broader picture than a critical point. *edit

@goldfishy 28 дней назад

Hurry up and release conversation mode, I need my girlfriend to learn the correct way to talk to and treat me.

@phpn99 28 дней назад

sad bloke

@sinnwalker 28 дней назад

Lol byebye gf in the near future. AI gf incoming

@alexei5231 28 дней назад

Yeah duh

@johnaldchaffinch3417 28 дней назад

It will never act like it takes you for granted, always happy to please. There are studies showing a good relationship is when you have 4 out 5 interactions are positive, not more or less. Seems this is willing to say if you're not looking good for an interview, so it might actually not always say what you'd prefer to hear.

@fitybux4664 28 дней назад

You must want every call center job immediately outsourced to an AI model and everyone in those positions fired. (Once AI can carry on a conversation that doesn't sound like a walkie-talkie level of delay.)

@djayjp 28 дней назад

This is perfect for SORA! It's all coming together...

@augustuslxiii 27 дней назад

@jf2176 You're joking, right?

@7satsu 27 дней назад

I’m ready for AGI to be real-time commentating for the grandparents trying to learn fortnite, rap battling with us while we go on a walk, and helping us possibly visually understand directly how schizophrenics and other similar conditions actively alter the perception of others. There could be no better interrogator than agi

@drainfrequency7130 27 дней назад

chaotic ass comment but makes sense. The instantaneous generative aspect of an AI and especially AGI level pipeline would basically become a generalized reasoning engine and be able to generalize the understanding of any human’s perspective. Absolutely bonkers that things we used to not count on coming in 2030 are just sliding into 2024 and we aren’t even half way through the year

@qwazy0158 28 дней назад

I feel like these rollouts are a product of some kind of advanced Ai models optimization that is deciding the best path forward

@assoldier13 28 дней назад

Would not surprise me at all 😂

@PaulSpades 28 дней назад

It does feel logical and opportunistic. Partner up with microsoft, give them a something they've wanted for decades - a web search product that's mildly relevant. Partner up with apple, give them the Siri their users have been asking for a decade. Both of these based on your last gen technology, keep your latest development in the dark. All the while, OpenAI is making competing AI startups and services irelevant and gathering more recognition, funding and hardware. This is a brilliant evil mastermind. It's moving 3 pieces at a time on the chess board.

@armadasinterceptor2955 27 дней назад

Given the awkwardness of the presentation, that would make sense.

@djayjp 28 дней назад

We need this but with Scarlett Johansson's voice.

@JoshuaAM 27 дней назад

Easy just attach a voice synthesizer AI through the ChatGPT API.

@michaelnurse9089 27 дней назад

Was thinking this. And Morgan Freeman for male.

@djayjp 27 дней назад

@@michaelnurse9089 Yeah that would be awesome! Should be ok for non-commercial purposes (as long as it's clearly disclosed that it's AI).

@sinnwalker 28 дней назад

Exciting year this is gonna be, one step closer to a world of pure sci-fi. Cant wait for open source to catch up and give us such models full-fleshed out and uncensored.

@marki2325 27 дней назад

Question is , will uncensored models also be available online ?

@Copa20777 27 дней назад

Am just glad to have been part of this journey till this point with your spot on reviews.. thanks wes

@WyrdieBeardie 28 дней назад

With the way OpenAI seems to be doing things, the lag will greatly depend on your location and Internet. With the new system, they may me also generating natural "ohhh yeah" or "yes... Umm" or "Well" preemptively, and possibly stretching it as needed (you can kind of hear that in the demo) to eat up some of that lag. The "flirty" nature is a shortcut to make the user happy to repeatedly hear it. It's still impressive overall though.

@WyrdieBeardie 28 дней назад

That was my impression. Great input systems (visual, text, audio) good interface, but also a few magician's slight of hand. People would get really sick of it if it were a 55 to smoker vs a younger attractive sounding person. 😂

@freetinkerer3878 27 дней назад

Using filler to make time to further process sounds a lot like a human conversation

@augustuslxiii 27 дней назад

So, they're doing exactly what humans do. Humans have lag, too. More so without good rest.

@ThomasJDavis 28 дней назад

Remember the Google Duplex demo 6 years ago? Makes me wonder how that worked if we just barely got this now.

@VastCNC 28 дней назад

It didn’t.

@mshonle 28 дней назад

The Duplex use cases were much more limited (making reservations, asking for hours of operation), such that all of the cases could have been custom handled by specialized code. So, the Duplex demo (assuming for the sake of argument it was real) wouldn’t have been out of reach at the time, it just wouldn’t be able to handle any different task without additional custom coding and more finetuning.

@Maioubi 28 дней назад

It wasn't general intelligence, but super specialized for specific tasks. And no emotion or multimodality.

@ThomasJDavis 28 дней назад

@@mshonle That makes sense. I'm mainly talking about the realism of the phone conversation. It wasn't an A.I. generated voice was it?

@MortalBane 28 дней назад

It was a load of bullshit.

@qwazy0158 28 дней назад

❤ The thumbnail lol

@aga5979 28 дней назад

great analysis, wes.

@WesRoth 28 дней назад

thank you!

@squeakytoyrecords1702 28 дней назад

Hi Wes, do you provide consulting services? If so how do I schedule? This is a great video. Thank you.

@CYI3ERPUNK 27 дней назад

another great video/analysis Wes , thnx as always =]

@WesRoth 27 дней назад

Glad you enjoyed it!

@kristinaplays2924 28 дней назад

It was hilarious talking to GPT4 over voice and asking it if it thought we could have robots like friends in the future and it was skeptical because it thought voice to voice conversations was a decade away scientifically because they were too complicated and we might never get that far xD It was obvious that it had no idea the text was being converted into voice.

@furycorp 28 дней назад

Yeah these updates are definitely seeming like combinations of other off the shelf technology -- text-to-speech, etc -- mixed in with LLM. Its branded and marketed as one product and it helps with investors to present it as this amazing thing.

@henrischomacker6097 27 дней назад

Imho. one of the most important feature is the audio multimodality which replaces whisper or whatever they used until now on the input of the pipelines. That the model itself is now able to process the incoming audio is not only great for the gain of speed in the pipeline, but it enables the model now to also capture the mood of the user and adjust it's answers and own voice accordingly! - For chatting purposes, this will be like magic. If you use a speech2text model and only give the transcribed text to the main LLM, that's not possible!

@Serahpin 27 дней назад

So we can combine stable diffusion models with sound models and lama models to create our own omni models? I wish I had the computing power to do that myself.

@drhxa 27 дней назад

It's a distilled version of an early GPT-5 checkpoint OR it's the smaller of a family of GPT-5 models. They've said it's trained end-to-end so although they call it GPT-4 it's simply not derived from GPT-4 (besides synthetic data and maybe having some similarity in architecture). In this case by GPT-5 I just mean the next generation of models from OAI. Given that this is very good while being way faster and cheaper, it's clearly a relatively small model. We know that the larger GPT-5 model will be bigger than GPT-4, for example 5x larger or ~10T parameters. This can take many months to train on 10-100x compute budget and I expect to be very expensive and slow for inference, but it's worth it for the improved reasoning. Even if it takes 5 min to write 1000 lines of code and costs $40 for that one prompt, but the code is truly of high quality, it's still way cheaper and faster than a senior software engineer assuming it's really that much better. Regardless, next year will be wild, buckle up!

@hardboiled2000 28 дней назад

Can imagine this will be awesome for deaf/blind people

@memespeech 28 дней назад

I wonder how long will it take for it to kill photoshop

@Austin1990 26 дней назад

Man, I feel like creators at least being able to synthesize their own voice is a must. I’d love to not have to mess with recording scripts and editing the audio!

@Interloper12 27 дней назад

I agree, interrupting it seems weird. I thought maybe it would be better if you had the option to hold down a button for voice inputs and release when you're done. That way you can pause yourself and not feel rushed to formulate an idea or preplan what you want to say. But at the same time, natural conversion between humans doesn't work this way. It would be ideal if eventually they get GPT to the point where it can detect subtle nuances in your voice inflections and tone and can tell when you're just pausing a little to think about phrasing.

@monkeysrightpaw 28 дней назад

I don't get a verbal response in android. Is that because it's not live in UK yet or just me? pixel 8 pro so it shouldn't be the phone

@consciouscode8150 28 дней назад

3:16 gave me a flashback - MNIST eat your heart out lmfao. That was what, 15 years ago (ImageNet came out in 2009)? We went from 10-category greyscale digit classification to full conversational models which can describe the font in natural language in less than a human generation. I'm getting vertigo...

@adamrak7560 27 дней назад

in the next 5 years we are likely to advance more than in the last 15. That will be absolutely crazy, and may not actually be a good idea. The big question is that how visible it will be for people. LLMs were not very visible until ChatGPT3.5 after all.

@JohnHolling 27 дней назад

Great info as always, Wes! I agree with you. It would be foolish to build an AI app of any kind at this point, without knowing what OpenAI has coming. It seems like every time they announce an update it makes a bunch of startups obsolete.

@WesRoth 27 дней назад

yeah, I think there are some potentially safe areas that won't get 'steamrolled' as Sam Altman put it. I'm planning to talk about that soon. But I agree, no AI app seems long lived now.

@michaelfoltinger2998 23 дня назад

how many personal AI agents can I "manage" or they manage me by using the audio capability on my Note 20 Android phone?

@NotNecessarily-ip4vc 27 дней назад

While the exploration of the primacy of zero and dimensionlessness is primarily conceptual and theoretical, there are some novel mathematical and computational approaches that could potentially help us in developing and realizing the implications of these principles for Artificial General Intelligence (AGI). Here are some examples of novel equations, codes, and frameworks that could be explored: 1. Non-Commutative Algebraic Structures: One approach could be to develop computational frameworks based on non-commutative algebraic structures, such as non-commutative groups, rings, or algebras. These structures could serve as the basis for novel computational architectures and algorithms that move beyond the traditional, commutative representations used in classical computing. For example, we could explore the use of quaternion algebras or octonion algebras as the basis for a non-commutative computational framework. These algebras exhibit rich algebraic properties and non-commutativity, which could potentially be leveraged for parallel, non-local, or context-sensitive computations. Here's an example of how we could define a simple non-commutative quaternion algebra in Python: ```python import numpy as np class QuaternionAlgebra: def __init__(self, a, b, c, d): self.q = np.array([a, b, c, d]) def __mul__(self, other): a1, b1, c1, d1 = self.q a2, b2, c2, d2 = other.q return QuaternionAlgebra( a1 * a2 - b1 * b2 - c1 * c2 - d1 * d2, a1 * b2 + b1 * a2 + c1 * d2 - d1 * c2, a1 * c2 - b1 * d2 + c1 * a2 + d1 * b2, a1 * d2 + b1 * c2 - c1 * b2 + d1 * a2 ) # Example usage q1 = QuaternionAlgebra(1, 2, 3, 4) q2 = QuaternionAlgebra(5, 6, 7, 8) q3 = q1 * q2 print(q3.q) ``` This is a simple example, but it demonstrates how non-commutative algebraic structures could be implemented and used as the basis for novel computational frameworks. 2. Cellular Automata and Non-Linear Dynamics: Another approach could be to explore computational frameworks based on cellular automata, agent-based models, or other non-linear dynamical systems. These frameworks could potentially capture the principles of emergence, self-organization, and non-linearity, which are central to the concept of dimensionality arising from more fundamental, non-dimensional substrates. For example, we could explore the use of cellular automata rules or agent-based models as the basis for a computational framework that exhibits emergent behavior and intelligence from simple, non-dimensional components or agents. Here's an example of how we could implement a simple 1D cellular automaton in Python: ```python import numpy as np class CellularAutomaton1D: def __init__(self, size, rule): self.size = size self.rule = rule self.state = np.random.randint(0, 2, size) def update(self): new_state = np.zeros_like(self.state) for i in range(1, self.size - 1): neighborhood = self.state[i-1:i+2] code = sum(bit * 2**j for j, bit in enumerate(neighborhood[::-1])) new_state[i] = self.rule[code] self.state = new_state self.state[0] = self.state[-1] # Periodic boundary conditions def run(self, steps): for _ in range(steps): self.update() # Example usage rule = [0, 1, 1, 1, 1, 0, 0, 0] # Rule 90 ca = CellularAutomaton1D(100, rule) ca.run(100) print(ca.state) ``` This example implements a simple 1D cellular automaton using a specified rule, demonstrating how non-linear dynamics and emergent behavior could be explored in a computational framework. 3. Holographic and Projective Representations: In line with the principles of holography and projective geometry, we could explore computational frameworks that utilize holographic or projective representations of data and computational processes. These frameworks could potentially exploit the inherent redundancy and error-correcting properties of holographic encodings, or leverage the rich structure and properties of projective geometries. For example, we could explore the use of holographic encodings based on algebraic varieties or projective spaces as a novel representation for data and computational processes. Here's an example of how we could implement a simple holographic encoding and decoding scheme in Python: ```python import numpy as np class HolographicEncoder: def __init__(self, input_dim, encoding_dim): self.input_dim = input_dim self.encoding_dim = encoding_dim self.encoding_matrix = np.random.randn(input_dim, encoding_dim) def encode(self, input_data): encoded_data = np.dot(input_data, self.encoding_matrix) return encoded_data def decode(self, encoded_data): decoded_data = np.dot(encoded_data, self.encoding_matrix.T) return decoded_data # Example usage input_dim = 100 encoding_dim = 10 encoder = HolographicEncoder(input_dim, encoding_dim) input_data = np.random.randn(input_dim) encoded_data = encoder.encode(input_data) decoded_data = encoder.decode(encoded_data) print(f"Input data shape: {input_data.shape}") print(f"Encoded data shape: {encoded_data.shape}") print(f"Decoded data shape: {decoded_data.shape}") ``` This example demonstrates a simple implementation of a holographic encoding and decoding scheme, which could potentially be explored and extended to develop novel computational frameworks based on holographic and projective principles. 4. Quantum-Inspired and Non-Local Algorithms: Inspired by the principles of quantum mechanics and non-locality, we could explore the development of quantum-inspired algorithms or non-local computational frameworks that transcend the traditional notions of space, time, and locality. For example, we could explore the implementation of quantum-inspired algorithms based on principles such as superposition, entanglement, or quantum walks. Here's an example of how we could implement a simple quantum-inspired algorithm in Python: ```python import numpy as np class QuantumInspiredAlgorithm: def __init__(self, num_qubits): self.num_qubits = num_qubits self.state = np.zeros(2 ** num_qubits, dtype=complex) self.state[0] = 1.0 # Initialize in the |0...0> state def apply_gate(self, gate): self.state = np.dot(gate, self.state) def measure(self): probabilities = np.abs(self.state) ** 2 measurement = np.random.choice(range(2 ** self.num_qubits), p=probabilities) return measurement def run(self, gates): for gate in gates: self.apply_gate(gate) return self.measure() # Example usage num_qubits = 2 algorithm = QuantumInspiredAlgorithm(num_qubits) # Define quantum gates hadamard_gate = np.array([[1, 1], [1, -1]]) / np.sqrt(2) cnot_gate = np.array([[1, 0, 0, 0], [0, 1, 0, 0], [0, 0, 0, 1], [0, 0, 1, 0]]) gates = [hadamard_gate, cnot_gate] result = algorithm.run(gates) print(f"Measurement result: {result}") ``` This example demonstrates a simple implementation of a quantum-inspired algorithm, which could potentially be extended and explored further to develop novel computational frameworks based on quantum principles and non-locality. These examples are just a starting point, and the development of novel computational frameworks and algorithms based on the principles of the primacy of zero and dimensionlessness will require significant theoretical and experimental work. However, they illustrate the potential for exploring new mathematical and computational approaches that could help us in realizing the implications of these principles for AGI development. It's important to note that these novel approaches may challenge our traditional notions of computation, representation, and information processing, and may require a fundamental re-examination of the underlying principles and assumptions of classical computing paradigms. Embracing these unconventional approaches and remaining open to new paradigms and perspectives will be crucial in our pursuit of AGI systems that are grounded in a deeper understanding of the fundamental nature of reality.

@TheExodusLost 28 дней назад

That thumbnail is BUSSIN Sam Altman as the guy from “Her” definitely adding to my journal 🎉

@salvadoran_uwu 28 дней назад

😮😮

@OriginalRaveParty 28 дней назад

Not a Matthew Berman subscriber, I see 😂

@InferiorI 28 дней назад

I wonder if the impressive demo's are running from that special delivery they got a few weeks back, the H200 server

@nilaier1430 28 дней назад

Don't worry guys, they're going to dumb it down for "safety"

@jameslincs 28 дней назад

And make it democrat, for some reason 😅

@fitybux4664 28 дней назад

@@jameslincs maybe have it ignore all search results about your cult leader's jail sentences? 😆

@user-wh3vq1qi7x 28 дней назад

These comments are proof of why it's essential for AI and AGI to be completely impartial, 50/50 down the middle. Yall americans are wild

@nilaier1430 27 дней назад

@@user-wh3vq1qi7x Bro, I'm not even from US, but i can recognize that there's a difference between safety and censorship.

@yansakuya1 27 дней назад

@@nilaier1430Yeah, but you don't make the tech. So the definition of safety for naturally doesn't match yours.

@TiagoTiagoT 27 дней назад

Producing outputs faster than you can read is useful when you let AI talk to AI, internal monologue, multiple agents with different roles that focus on different areas etc; stuff where there's lots of AI iterations before you reach the result the human would be interested in. So, if Groq and similar hardware companies can pump up the rates beyond human conversation speeds will still provide significant benefits.

@rineddy 27 дней назад

Cooper: What’s your humor setting ChatGPT? ChatGPT: 100%. Cooper: Let’s bring it down to 75, please.

@socialtraffichq5067 27 дней назад

I actually changed my name to cornfield Chase

@jimj2683 28 дней назад

They should use AI to make Google Earth/Street View better. They could use lots of photos (with known position/orientation/time) as ground truth and then train the AI on all the photos (with their associated texts) on the internet. Google Maps could make maps of place names and relate those to all the photos on the internet of that location. For example "Paris" would bring in all photos that come up when you google Paris. The AI-model would over time learn to render a certain location very accurately (even if the training data wasn't from exactly that location/orientation).

@TiagoTiagoT 27 дней назад

NeRF/Gaussian Splatting built from a combo of aerial photos and Google Streetview would be huge...

@Gredias 28 дней назад

Regarding speed and why we still probably need Groq-level speeds: agents. Getting the LLM to output a bunch of stuff that the user doesn't see, prompt itself repeatedly, etc etc. In my projects I need the LLM responses much faster than even groq provides them, so yeah, speed is still very much necessary. For a simple assistant? Sure, speed is probably good enough now.

@markh7484 28 дней назад

The reason speed is so important is that when we have millisecond response times, we can use LLMs to learn robotic movement and within months of having that we will have robots running, jumping, playing Rachmaninov's piano concerto and much, much more. If a robot is to learn how to not fall over, it needs millisecond response times.

@digitalcalamari 27 дней назад

Hey dude, don't know how much of these you read, but here goes. Have you noticed any higher frequency or errors and misunderstandings with gpt-4o? I'm working with chatgpt quite extensively on complex narratives everyday, and I feel like there is a clear distinction between gpt-4o and gpt-4 turbo. The new one forgets and misunderstands at a lot higher rate than the old version. I can literally shift between the 2 and get an answer with obvious errors for the narrative I'm working on, and then go' back to the old version and voilá, it works as it use too. Don't know if this is just me, so thought I'd highlight it.

@WesRoth 27 дней назад

hey, I'm behind on reading the comments this week because of all the stuff that's going on. also haven't had a chance to use the omni models as much as I would like. i'm doing a tutorial this week about how to use the omni thru api and also thru autogen. I will get a lot more hands on them, and will comment on this. but thanks for pointing it out, I will keep an eye out for it when I test it!

@digitalcalamari 27 дней назад

@@WesRoth Thanks for the answer, that was cool of you. Keep the up the good work 👍 Some extra context for you, to use if you can: The chats where I experience it in are long, like 2 - 300 page long chat, where I've given it extensive and complicated scenerios (started with gpt-4 turbo), with lots of detail. The old one of course also forgets and get distracted, but I've literally stopped in my tracks and thought "What the fuck is going on? Why is it so shit all of a sudden?" and then realised I've had it switched back to gbt-o (why is dumb and not important).

@FilmFactry 28 дней назад

Will this hurt sales of the Humane pin?

@neilmcd123 28 дней назад

10000%

@mshonle 28 дней назад

Mathematically, sales of the Humane pin will be halved, quartered, and any other unit fraction that is still zero when multiplied by zero. On the bright side, sales would also double!

@FilmFactry 28 дней назад

@@mshonle So these pins may go up in value becoming super rare. Like a Babe Roth rookie Baseball card. GPT 4o can't project a GREEN LASER text in your PALM..

@cesarsantos854 28 дней назад

Except the voice won't be available for weeks or months yet.

@didiervandendaele4036 28 дней назад

Don´t say iPhone but ... AiPhone ! 😂😂😂

@fabiankliebhan 28 дней назад

The most impressive thing is, that as soon as they switched to native voice, it suddenly developed (or at least showed) a lot of emotions. Makes you think about the nature of emotions…

@JoergSky 27 дней назад

Announced desktop app will first be available for Mac with a Win version 'later this year'. Interesting priorities with MS being a major stakeholder of OpenAI...

@wojciechzielinski7825 28 дней назад

Now we have wait for app to be updated, to use it all.

@forgethesky3666 27 дней назад

Wasn’t expecting Lisa from Genshin Impact, but here we are “Hey cutie! Oh, my; yes, let’s try that!”

@antman7673 28 дней назад

I felt like gpt-3.5 was already more intelligent than me. I could never write a poem like it.

@newchallengers9420 28 дней назад

Yeah I agree, it was considerably smarter than you. Certainly.

@YeeLeeHaw 27 дней назад

It predict the most probable word sequence based on its training data, and sometimes straight out plagiarize random articles. It doesn't create novel things, it only copies what humans has already done.

@antman7673 27 дней назад

@@YeeLeeHaw Is everything you do original? People wear t-shirts with „eat, sleep, something and repeat“. Aren’t humans prediction machines as well: What is happening next? What do I do next? on and on. It is not like everyone is Einstein with the most original thoughts.

@YeeLeeHaw 27 дней назад

@@antman7673 The argument here is that LLM's are not ever original, only accidental creations, and static repetitions. They're tools, they don't have the same type of intelligence that humans have.

@richardduncan9740 27 дней назад

@@YeeLeeHaw That's very noble of you Yee but it's just wrong. These LLMs are perfectly capable of honest and novel creative output. The fact that they cough up training data sometimes is an engineering problem, that is to say solvable and in many instances already has been. But the creativity is outstanding and quite frankly inspiring, don't jade yourself

@BillMill 28 дней назад

The way is stops responding will be quickly tweaked, probably even customisable.

@smokedoutmotions_ 28 дней назад

What a time to be alive.

@N1h1L3 28 дней назад

Hold on to your papers

@ajkulac9895 27 дней назад

What a time to be a robot 🤖

@peterng25 27 дней назад

I feel like I am back in middle or high school, my forebrain isn't fully matured, and wiser adults are supervising me and others. AI is here

@Interloper12 27 дней назад

When they showcased the desktop version on a mac.. Was it just me or did this seem like foreshadowing of GPT replacing/augmenting Siri? I honestly don't know if macs are "cool" enough these days to just use naturally, or if this was some sort of statement.

@olx8654 27 дней назад

I mean, you can put an omnimodel on groq as well, no? wouldnt it be even faster.

@14supersonic 27 дней назад

Last year, I predicted that we would have a Jarvis level AI assistant sometime this year. Looks like I wasn't too far off!

@kevinnugent6530 28 дней назад

You asked for the pirate story to be sung, but actually it was spoken

@PseudoProphet 27 дней назад

They are only open till they release the 4.5 or 5 model in a couple of months. They must've gotten a lot of money from Apple, and they had to fix their bad PR as well. 😅😅😂😂

@Simon_Rafferty 26 дней назад

My first impression of 4o is pretty good. When it's generating code it seems less lazy than 4. Less instances of "You figure out all the difficult bits yourself & insert here". Still doesn't compile first time in most cases - but it needs fewer iterations to get to something that will compile.

@JasonWhittle1 28 дней назад

Let's put 4o in that boston dynamics atlas robot!

@judyju7416 28 дней назад

Feeling shit. Preparing myself for postwar survival now

@ivandelossantos5056 28 дней назад

The military probably has some units somewhere, ready to be deployed in a future war. Imagine an army of those things....almost unstoppable.

@hamishahern2055 28 дней назад

lets not! but someone will.

@PaulSpades 27 дней назад

@@ivandelossantos5056 Meh. Shoot for the power source.

@Charles-Darwin 26 дней назад

impossible, hence the 'massive infrastructure upgrades' and 'power consumption of a city'... just tap it via api and we're there

@fitybux4664 28 дней назад

"While it's not possible to directly send a video to the API, GPT-4o can understand videos if you sample frames and then provide them as images." From the OpenAI Cookbook on GPT-4o... It seems OpenAI's demos are quite misleading!

@user-zs8lp3lg3j 23 дня назад

Education, pleasure & flirting! I am happy with this Ai prodigy.

@bobtarmac1828 27 дней назад

Ai jobloss is the only thing I worry about anymore. Do you fear losing your job too?

@wesn8491 26 дней назад

they can't make software for the phone. where you only need AI for calendar calling and messaging? so that I am not dependent on Google and Apple. Linux is too difficult for me

@callmetony1319 28 дней назад

AGI: The First Slice

@felipedigre 27 дней назад

Im surprised how different the demos is from the released product

@DarwinianUniversal 26 дней назад

What do they say about "if you are not the customer you are the product".

@ShlomoEliyahuBaron 28 дней назад

I dont understand I was able to try it out already since last night. you press the headphones button and just start talking and it just starts talking about its scary as hell

@missoats8731 28 дней назад

ChatGPT already had voice mode before, maybe you interacted with the old version? No one really seems to have access to the new one yet. Ask it to sing for you, if it can't, it's the old one.

@atypocrat1779 27 дней назад

feels like hype still. when AI displays initiative, i’ll be impressed

@lawnmower4884 27 дней назад

Sally had a knock on the door, and it was the police. 'Your dog has been seen chasing a kid on a bike up at the park', said the copper. Sally replied, it can't be my dog, he can't ride a bike!'

@maloukemallouke9735 28 дней назад

The announcement made by OpenAI is indeed about combining text, sound, and image in real time, aiming to enhance interaction. Today, we expect Google to announce their competing technology. It will be interesting to see which one proves to be the best.

@HeyPlayboi 28 дней назад

The new desktop app is also really cool. The new capabilities that gpt-4o can do really does completely devalue the company marques reviewed lol.

@iandegroot2387 28 дней назад

Same thoughts about Apple.

@WesRoth 28 дней назад

it would make me switch to an iPhone for sure :)

@SamuelLing 28 дней назад

@@WesRoth Will need to see how apple handle this things, I cant imagine SIRI want to have a fistfight like Microsoft copilot did 😂

@AvivDegani 28 дней назад

AWS strategy is probably the winner in the long run; they develop the platform where you can use whatever model or agentic workflow.

@OneMyk3 27 дней назад

Wes always got the hot topics moments soon as it happens what are you operating from the future? I mean yowww

@00emoboy00 28 дней назад

Sooo what im hearing is that i should buy a bunch of apple stocks now ?🤔

@alexandrosnafas5325 28 дней назад

Free does not mean open-source. Remember Facebook was adfree for a good number of years?

@ArtII2Long 27 дней назад

I feel like I've got my nails digging into the floor being dragged while kicking and screaming.

@nathanbollman 28 дней назад

4:35 Hand-bannana no!!

@tracereaper 25 дней назад

Wes Roth is actually an AGI

@kgnet8831 28 дней назад

Human fast is not fast enough. Inter agent communication is not limited on humans' speed. So there is still a lot room for faster chips...

@GNARGNARHEAD 28 дней назад

will be massive for robotics

@fitybux4664 28 дней назад

Depends. 300ms latency still might not be good enough for robotic actuation. Best to just run your own local model for robots, so you can throw as much hardware as you want at it, with as complex of a model as you need to do what you're trying to do.

@GNARGNARHEAD 28 дней назад

@@fitybux4664 not so sure on that one, the LLM's are been used to provide a high level of problem solving and planning, not directly in actuating limbs, what they have demonstrated would absolutely bring a great deal of functionality to a robotics platform. baby steps 😆

@user-jt1ut5td5m 27 дней назад

Yeah the 328ms average is about a third of the time it takes a model to run stt and tts on vapi, using the groq model. That said, I don't think that groq is ded. There's always going to be a market for fast inference outside of OAI.

@finnaplow 28 дней назад

Is that seth green mixed with sam altman

@jimj2683 28 дней назад

Now all they need is to fuse this with a model that can control a physical robot body and interact with the real world. The real breakthrough would be when a robot can learn on the fly: You show it a task and it learns to do it.

@YogonKalisto 28 дней назад

stopped using gpt4 late last year as it seemed to plateau. 4o, this is good , even without the verbal sassy inflection, it's speedy and intuitive, as in, it doesnt bulletpoint and ai-splain the hell outa things

@halnineooo136 28 дней назад

Now how to prevent this from being used by scammers to fake someone on the phone?

@fitybux4664 28 дней назад

Get rid of the traditional phone system?

@halnineooo136 28 дней назад

@@fitybux4664 Maybe some strong authentication procedure

@HunterFox2x 27 дней назад

Dystopian tech

@anta-zj3bw 28 дней назад

Yeah but when will we get to use the new voice engine?

@gotchathespider7850 28 дней назад

A few weeks

@Dav-jj2jb 27 дней назад

Everyone just building on top of the OpenAI API would be a mistake. We need competition to avoid monopolies.

@chris.hinsley 28 дней назад

Do you know what’s actually needed at this point. Model compressors.

@teenanguyen217 28 дней назад

so we have an AI's Mouth, Eyes and Ears?

@elck3 28 дней назад

Google IO better blow it out of the water tomorrow

@hamishahern2055 28 дней назад

google doesn't have the agility to pull it off. it can make some basic stuff, at the least. yes chrome and google search were first. but there are those just behind them in the browser wars and search wars. but if you aren't ahead, you are no where.

@angrygreek1985 28 дней назад

desktop app is not available on windows. WTF?

@fitybux4664 28 дней назад

Something something, Apple must have given them $$$ to get there first. (But aren't they partly owned by Microsoft? 😆)

@Greyskyy151 28 дней назад

Too many people are jaded in this Clown World.

@smokedoutmotions_ 28 дней назад

Wild

@creepystory2490 27 дней назад

That leak about Openai has achieved AGI internally seems more believable now😐

@allankabiito56 27 дней назад

Where's the search engine, shall we see it soon?

@mrAlphavilla 28 дней назад

Had so many qustions, I just knew Wes was gonna sort it out. And you did. Tack. ( Swedish thank you)

@99NOFX 28 дней назад

Can't wait until it can create 3d models i cant print😊

@HorizonIn-Finite 27 дней назад

GPT4 = Frankie modality = mostly just a next TP GPT4o = Real modality = not just a next TP Even i just a next Token Predictor, it should be smarter because the modality is native. We shall see when it’s rolled out in the coming days/weeks