Тёмный

OpenAI Realtime API - The NEW ERA of Speech to Speech? - TESTED 

All About AI
Подписаться 168 тыс.
Просмотров 14 тыс.
50% 1

Опубликовано:

 

15 окт 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 35   
@KCM25NJL
@KCM25NJL 11 дней назад
Yeah, no basement dweller dev's are gonna be messing with that API until the costs drop by at least 100x, which I honestly only see as a near term incentive for Meta to get a Llama Voice model cookin'
@jamesjonnes
@jamesjonnes 9 дней назад
I'll use it, but can't wait for an uncensored open source version. Text only is too boring. I lack the patience to use text only for too long for the tasks I want, like learning languages.
@boxeemusic
@boxeemusic 10 дней назад
where can i find the code? pls help
@DarrenJohn10X
@DarrenJohn10X 11 дней назад
Looking forward to seeing your alleged "spaghetti" code! (Right now 2 weeks ago is your latest repo)
@sykexz6793
@sykexz6793 11 дней назад
I don't think this is the same model as advanced voice mode.
@khanhhq2044
@khanhhq2044 5 дней назад
Can you share the repo link ?
@三川富資訊股份有限公
@三川富資訊股份有限公 9 дней назад
The Realtime API cost is high. I suggest that there is a cheaper way. 1.Using Google STT to get user's speech texts. 2.Send texts to GPT. 3. Get responses from GPT. 4.Send responses to Google TTS. 5.User gets AI responses in both texts and voices. The response time is longer and it costs lower.
@OliNorwell
@OliNorwell 11 дней назад
Great work! You must have had a busy couple of days getting it working
@meetsummdev
@meetsummdev 10 дней назад
you can really implement it in a few hours
@drewpeer
@drewpeer 6 дней назад
Does everyone have access to this beta? Anything we have to do?
@pjm17
@pjm17 7 дней назад
Could you achieve these results in an app just using the text to speech and speech to text with native ios features alongside openai NON realtime api's?
@benbrahimjamil1976
@benbrahimjamil1976 19 часов назад
How to get the repo ?
@jamesyoungerdds7901
@jamesyoungerdds7901 11 дней назад
Great video, thanks Kris! I'm interesting in the function calling and structured output from the voice websocket return. Can you use agents or agentic flows with constrained and structured outputs with the voice mode 🤔
@Akander20
@Akander20 11 дней назад
where can i get the repo?
@Bangs_Theory
@Bangs_Theory 11 дней назад
Which function controls the interruption?
@gaijinshacho
@gaijinshacho 11 дней назад
VAD
@tommoves9935
@tommoves9935 12 дней назад
Happy to be the first to comment. Kris you are always up to date. Once again cool stuff from you. Spaghetti code... 🤣. Great that you did talk about the costs as well. I like your creative and often real funny ideas. Please keep up the great work! Regarding your phone call: saw a video from a guy in the US weeks ago (no Realtime API) - he did let his AI order a Pizza and it worked great. Latency even back then was good enough - should work perfectly. Maybe try it with an italian accent 😉. Thx from Tom!
@alarconfilms1
@alarconfilms1 12 дней назад
What is the code used?
@khalifarmili1256
@khalifarmili1256 11 дней назад
It's not out yet
@romera9662
@romera9662 11 дней назад
@@khalifarmili1256 How long will it take?
@ibrahimaba8966
@ibrahimaba8966 9 дней назад
I just integrated it on Twilio, it changes everything, but it took me a bit of time.
@MagagnaJayzxui
@MagagnaJayzxui 12 дней назад
What is AVA?
@dievas_
@dievas_ 11 дней назад
I still don't have access to it :/
@contentfreeGPT5-py6uv
@contentfreeGPT5-py6uv 11 дней назад
i tested yesterday ,but Error al conectar: 403 Acceso denegado. Verifica tu clave de API y los permisos para usar el API Realtime.
@elprox1290
@elprox1290 11 дней назад
try checking your api key or just making a new one
@contentfreeGPT5-py6uv
@contentfreeGPT5-py6uv 11 дней назад
@@elprox1290 again, thanks
@Dea07thox
@Dea07thox 11 дней назад
Can't you just better prompt it to have a less talkative output so you don't have to break it's response that often? That would make a big difference and everything more seamless :)
@DesignDesigns
@DesignDesigns 11 дней назад
This is mindblowing...
@saksham3
@saksham3 11 дней назад
Doesn't it have emotions?
@micbab-vg2mu
@micbab-vg2mu 11 дней назад
Thanks :)
@AI_Escaped
@AI_Escaped 11 дней назад
No one is going to be even able to develop at these prices other than those with deep pockets. Just testing and figuring things out would be too expensive to even try.
@thenoblerot
@thenoblerot 11 дней назад
By telling it it is playing a game with the user, it might be failing on purpose to let you win!
@DhairyaMarwah-l1u
@DhairyaMarwah-l1u 10 дней назад
Can you share the repo link ?
@almirkaza
@almirkaza 10 дней назад
can you share the url to the repo?
Далее
Part 5. Roblox trend☠️
00:13
Просмотров 3,1 млн
These Two AI Apps Just Took Over My Job
12:02
Просмотров 61 тыс.
Introducing the OpenAI Realtime API
7:10
Просмотров 1,1 тыс.
Learn 80% of Perplexity in under 10 minutes!
9:52
Просмотров 203 тыс.