Тёмный
No video :(

GPT-4o Low Latency Screen to Voice Tutorial - SUPER IMPRESSIVE OCR! 

All About AI
Подписаться 163 тыс.
Просмотров 11 тыс.
50% 1

Опубликовано:

 

28 авг 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 24   
@JohnSmith762A11B
@JohnSmith762A11B 2 месяца назад
Dude. That thumbnail is terrifying. 😂
@Ms.Robot.
@Ms.Robot. 2 месяца назад
Legit shit. A real coder pwning the Ai matrix❤.
@Ginto_O
@Ginto_O 2 месяца назад
yeah he wrote so much code
@watchdog163
@watchdog163 2 месяца назад
@@Ginto_O Where is your code?
@3choff
@3choff 2 месяца назад
Pretty cool project idea. If you don't mind, I stole it and use Gemini Flash to analyze the images; it's pretty fast too. You should try it.
@ksem1337
@ksem1337 2 месяца назад
I need of tech like that for my desktop virtual 3d assistant. I have a 3d model of a character (AI agent) that has to interact with computer in many interesting ways up to controlling pixels of the screen by itself, for example if it want to impose a an object to interact with virtual space. I hope soon enough we will have enough speed and power for AI agents to be sentient and working seamlessly with any type of information.
@BThunder30
@BThunder30 2 месяца назад
Cool. You projects are always amazing. The local open source projects are the most amazing and interesting to me.
@lokeshart3340
@lokeshart3340 2 месяца назад
U know always here to support u
@enthuesd
@enthuesd 2 месяца назад
This is great. Can we add voice prompt?
@protimaranipaul7107
@protimaranipaul7107 2 месяца назад
Being a member I have been trying to access the github repo, I have sent multiple emails to the provided email address, yet to receive a response it has been 48hrs. Please advise.
@pedrorafaelnunes
@pedrorafaelnunes 2 месяца назад
Im from Portugal, the portuguese is a mixture of mostly Portuguese from Brasil and a lil bit of Portuguese from Portugal heheh Spanish is not my primary language but it is not that bad also !
@pedrorafaelnunes
@pedrorafaelnunes 2 месяца назад
Btw nice project ! :D
@dniliveact
@dniliveact 2 месяца назад
Amazing stuff 😮
@3-deez
@3-deez 2 месяца назад
is there a copy of the code you used in the documentation you sent to OpenAI in your first prompt?
@branislannjemec9050
@branislannjemec9050 2 месяца назад
Do you know when will be having an access to gpt 4o voice api
@PTHastings
@PTHastings 2 месяца назад
🎯 Key points for quick navigation: 00:00 *🖥️ Overview of the project setup* - Setting up for screenshot analysis using GPT-4o - Detailing the low latency approach for image understanding - Collecting documentation and writing the initial iteration of the script 02:18 *🛠️ Implementing functions and configurations* - Fetching documentation from OpenAI for implementing GPT-4o with image inputs - Inclusion of functions from prior projects to streamline the process - Utilizing EnV files to fetch the OpenAI key for configuration 07:21 *🔊 Integrating text-to-speech functionality* - Obtaining OpenAI documentation for speech-to-text-to-speech functionalities - Implementing a feature to read out responses using TTS - Troubleshooting and fixing errors in the TTS APIs and configuration 10:55 *🎛️ Controlling the main function with a trigger key* - Adding a feature to control the main function trigger using a key command - Testing the control setup with screen prompts for AI responses - Demonstrating the capability of the system to respond effectively with controlled triggers Made with HARPA AI
@tumbalasu3718
@tumbalasu3718 2 месяца назад
Is that need gpu?
@taoxu1798
@taoxu1798 2 месяца назад
Awesome
@abhishekrakhe2788
@abhishekrakhe2788 2 месяца назад
Hey how do i get access to git and discord?
@protimaranipaul7107
@protimaranipaul7107 2 месяца назад
Can you please share the code
@MudroZvon
@MudroZvon 2 месяца назад
What is Anal Ysing?
@luisvictorf
@luisvictorf 2 месяца назад
Spanish isn't really Spanish if it's speaking with an US accent...
@lokeshart3340
@lokeshart3340 2 месяца назад
I am 3rd
Далее
Dark AI Agents: The Most Dangerous AI Today?
18:27
Просмотров 7 тыс.
Oh No! My Doll Fell In The Dirt🤧💩
00:17
Просмотров 10 млн
Run your own AI (but private)
22:13
Просмотров 1,4 млн
Make an Offline GPT Voice Assistant in Python
24:29
Просмотров 12 тыс.
Two GPT-4os interacting and singing
5:55
Просмотров 2,9 млн
Have You Picked the Wrong AI Agent Framework?
13:10
Просмотров 62 тыс.
Oh No! My Doll Fell In The Dirt🤧💩
00:17
Просмотров 10 млн