Say hello to GPT-4o, our new flagship model which can reason across audio, vision, and text in real time. Learn more here: www.openai.com/index/hello-gp...
The whole translation and interpretation industry is sweating right now, and has been for a long time. The only thing stopping them getting wiped out is that you cannot hold AI accountable.
@@ldkevin2 I strongly doubt automatic fabrication and advanced fertilizers will come from a chatbot. It's good technology, but not an industrial revolution.
@@Kaienhere Not quite to the same capability. I've used Google translate in clinic and every now and then I have to hit the microphone button to keep the conversation going. It's not efficient at all when I'm looking at their eyes with the slit lamp and then I have to move all the equipment to go tap the microphone button. Also, if you say the wrong words or want to change what you said, you have to wait for it to finish speaking. And on top of that, it's really not that great of a translator. This is way better and I'm relieved I can finally freaking communicate with the patients.
There are two types of people: 1) people who see AI and say "Wow, what an amazing tech, I'm so lucky to be alive now!" 2) people who see AI and say "Wow, this is very scary, AI will doom us all!"
@@pisky5067 unfortunately they are both right. Do utopian view would be that AI and automation can take over dangerous, boring or repetitive tasks and basically start doing the work that is necessary for survival for us, us freeing up humans for better pursuits, such as creating and personal connection and the sorts of things that technology won’t ever be able to do. Unfortunately, that would require some sort of universal basic income possibly funded through taxes on automation or something of that sort, and that won’t happen until things are dire. We have structured society in such a way that we are woefully unprepared for such a massive shift.
I love how people make fun of awkward tech bros, while they use all the technology made by tech bros. Tell me Mr. social butterfly, how would you interact with a person who spoke a different language than you. Does your super awesome social skills allow you to overcome language barriers?
Yes, finally! This is actually something useful. Thank you. I saw a demo from Google doing live translations years ago, but I have no idea what happened to it. Also, we need to translate text and audio files as well so that anybody could access any knowledge they want regardless of language barriers.
As Interovert, I can imagine myself have more confidence to learn a new language in a more convenient and responsive way, can't wait, it looks so good. 🔥
@@mohammadiaa Thankfully, I don't have to resort to artificially inflating my own comments or posts. My contributions can stand on their own merits without needing to give myself an insincere thumbs up or upvote. Self-promotion of that kind comes across as insecure and desperate for validation. I'd rather let the quality of my words and ideas speak for themselves and be judged authentically by others. A little self-confidence goes a long way - no need to pat yourself on the back artificially.
You don't learn a second language to necessary be a translator, you want to talk face to face to other people not by using a phone, and u also want to engage with them, so no, it's worth it learning a second language.
No because google translate has already been able to do this exact thing for years now, no matter which language you talk in, it will instantly read it out loud in the other. It even shows the translation on the screen as you talk. It also responds at the same speed. It’s going to be more accurate since it’s programmed to be a translator and this is not.
Definitely, the way it speaks Spanish is much less natural and fluid. Also, the guy is clearly speaking Spanish from Spain, but ChatGPT is translating with a pretty strong Mexican accent.
I love how you were struggling to rotate the camera the whole time 😂 It was kind of weird that the table was detected as a stylish person with a leather jacket.
Very useful for traveling to countries you don't know the language, but I think real time translators and interpreters are still needed because conversations depends a lot of the context, and sometimes you don't need to translate literally all the information to the other person to understand the message. Anyway, nice tool
This is amazing, but only if there isn't any background noise. Great for one on one meetings in a quiet room, but this will not work well outside of that.
Hi OpenAI, first off, thank you. Thank you for continuously providing not only Opened Sourced ChatGPT for more technically inclined individuals to continue in the research and development and enabling such an upgraded GPT-4o for users to fully grasp AI’s rapid expansion and evolution in the industry. With that said, thanks again for the extended free prompts for testing such features. As mentioned within a WIRED article recently published, “Just know that you’re rate-limited to fewer prompts per hour than paid users, so be thoughtful about the questions you pose to the chatbot or you’ll quickly burn through your allotment of prompts.” Kudos. I can’t wait to be able to pick and choose between consented personality voices of individuals partnering with OpenAI for a more personalized “voice” of preference when interacting with GPT and enabling its latests search and translation features.
Thank you for the video. I am currently living in Osaka, Japan and I am very interested in Instant Translation with AI models. However, what I understand by "Instant Translation" is not: "I say a sentence - The model translates it after a few seconds and I can hear it - I say another sentence - The model translates it after a few seconds and I can hear it..." What I understand by Instant Translation is: "You are talking in Japanese and, while you are talking in Japanese (with a delay of a few senconds), I listen your speech in Spanish. No matter how long it is the speech. May be the Japanese speech is 10 minutes long and I can begin to listen to it after 5 seconds in Spanish and will end 5 seconds after finishing in Japanese". Basically it is like having a interpreteur by your side who doesn't have to wait until the end of the speech to begin translating. That way, the conversation gets more fluid. I know this is not an easy task, as there are SOV and SVO languages. However, I think that Seamless m4t model is able to take this into account aswell. Do you think is it possible to implement such a thing with this model?
The rippling in the water, would seem to indicate that a feeding frenzy of some sort is underway and things might start to get, what was that heathen's 'theory' called again. I remember the theory was eponymously titled. Ahhhh, now I get it. "It" being exactly just what, I'm not so sure about."
I'd be curious to see how well this works as a private language teacher. I could see myself use this for hours trying to learn and practice a language. My worry is it not pronouncing words correctly or it not being able to switch between english and another language mid sentence
damn thats pretty impressive/cool no ones gonna have to learn a language lol! that being said the spanish one does sound slightly robotic compared to the english one
Uh to get the right amount of intonation in Spanish is real crazy / people talk about job lost - but imaging having an universal translator at all times
I tried this with the current voice model and it forgets to translate after a short time and starts responding directly to what is being said. Has anyone else experienced this? I'm wondering if it will work better with the new voice mode. The demos always show only very short conversations.
Soon they will have translation automatic where you can talk to someone on the phone and it will instantly translate to you and vice versa. We'll have the real-life Star Trek voice interpreter. In fact, it's already here.
There are still plenty of words and sayings that don't have direct translations. I wonder how this would handle those situations. Also, what if someone is speaking sarcastically? Will it convey that to the other person?
It can summarize what they're saying, it doesn't have to directly translate it, it depends on the context. Just pretend you're talking to a human in regard to it's ability to understand sarcasm.
Hi, very interesting but which app I can use to achieve what you showed in this video. I've downloaded official ChatGPT app but not able to succeed to what you did in this video
Suddenly my mobile interface looks like desktop version of ChatGPT, which is to say, not at all interactive. Instead of the orb, there's a text entry field. But it worked before. Now it doesn't. I wonder waht gives.