Тёмный

Create a ChatGPT Voice Assistant in 8 Minutes (Python Tutorial) 

Ai Austin
Подписаться 18 тыс.
Просмотров 464 тыс.
50% 1

Опубликовано:

 

24 окт 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 659   
@Ai_Austin
@Ai_Austin 4 месяца назад
Build Your Own GPT-4o Voice Assistant in Python with Groq, Llama3, OpenAI-TTS & Faster-Whisper ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-pi6gr_YHSuc.html
@Ai_Austin
@Ai_Austin Год назад
Try the updated tutorial with GPT 3.5 Turbo, OpenAI Whisper and an open sourced Bing AI API: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-aokn48vB0kc.html
@zeshanasad1414
@zeshanasad1414 Год назад
Bro Can You Please make a Tutorial video of making The Avatar Video Like You ❣️
@grahamulax
@grahamulax Год назад
INSANE.....already an update.... hahaa this is nuts man.
@memerified
@memerified Год назад
J
@HallucinogenUsin
@HallucinogenUsin Год назад
that ending had me cracked up, thank you for that
@HabiburRahman-lq2kt
@HabiburRahman-lq2kt Год назад
Can you make it for Android?
@yurialves6237
@yurialves6237 Год назад
Can you make a video showing how you doing this avatar talk. Cheers
@mulin-ms6qg
@mulin-ms6qg Год назад
+1
@oladejiolaoluwa9155
@oladejiolaoluwa9155 Год назад
I would love to know how....
@kevinspyxboy9930
@kevinspyxboy9930 Год назад
It's actually easy you first generate a face of your choice on midjourney or dall-e and you import that picture in an app like facedance, ifunface, speakpick, etc..
@xlr555usa
@xlr555usa Год назад
Unreal Engine Metahumans can do it but it is intense, Metahumans looks more realistic also
@highwaynumber1265
@highwaynumber1265 Год назад
Cheers
@Bluem0onJ
@Bluem0onJ Год назад
I had some problems with the speak and talk part, so it ended up like a chatbot that works with "hotkeys"/command triggers for input to make specific things. Like, command trigger "music" opens a youtube playlist and things like that. I'm happy with the results :) edit: now It can "talk"... I generated some phrases on viocevox, downloaded the audio files and made it play along with the texts in the code at some key points
@axystorm5269
@axystorm5269 Год назад
can you please share your code with me? im interested in the music part. thank you :D
@SwapnilAngarkhe
@SwapnilAngarkhe Год назад
you prolly doing it in windwos. the say function doesnt work in windows its the same case as mine.
@thegamingdemon4788
@thegamingdemon4788 Год назад
Bro, can you please share your code? I kind of need it for a school project. I'll make it worth your time I swear!
@LexifromZargon
@LexifromZargon Год назад
idk if i commented before but i really enjoy this. its simple to understand and easy to follow especially youre clean code and the use of comments makes it verry easy to code allong and to customize stuff as needed.
@elijahonewheaton
@elijahonewheaton Год назад
This is great, and I'd love to try it, but the text is so small and kind of blurred that it's a challenge to make out the code. Will you add it to the description or pinned comment? That'd be really helpful.
@MarcDimmick
@MarcDimmick Год назад
If you are looking for a topic for your next video, I would love to see you take this to a web interface using flask. I have been trying so many different ways from other videos but always end up in a dark rabbit how with a chatbot, unable to find something that works. It keeps recommending code that breaks in so many ways and loses the original function that was working.
@MarcDimmick
@MarcDimmick Год назад
Great example, well explained and acutally works. I have tried multiple youtube examples and forever end up in a rabbit hole spiral with chatgpt providing corrections to then only create further errors. I really liked how you explained each function and process. I was a great tutorial in provide clear and precise instructions that were very informative. Thank you.
@Ai_Austin
@Ai_Austin Год назад
Very glad that helped, Thanks for the feedback Marc! More python tutorials coming!
@PhoenixVids123
@PhoenixVids123 Год назад
@@Ai_Austin Hi, i just wanted to ask about an error I am getting! I have done pip3 to install everything, and when I run it I get an error saying pyaudio wants installed. I go and do everything "pip3 install pyaudio" "pip install pyaudio" etc. Nothing is working, it does like half of it then says that "Could not build wheels for pyaudio" blah blah blah. Then it says that there's an error with "#include 'portaudio.h' ". Do you know how to fix this?????
@jahnaviaddanki8762
@jahnaviaddanki8762 8 месяцев назад
@@PhoenixVids123 in windows it will show for me like that, so i am using raspberry pi the pip install pyaudio worked on the pi
@r34ct4
@r34ct4 Год назад
Great tutorial, Austin. Simple, to the point. Would it make sense to upgrade this to the Turbo model now? Also, could you do a tutorial about fine-tuning {prompt: x, response: y} to clone your friends using chat history data?
@Ai_Austin
@Ai_Austin Год назад
Absolutely. If you just change the engine variable in the open ai function of the code, you can just specify “gpt-3.5-turbo”. Then it will send your prompts to the new version of the API. Fine tuning is absolutely in the video pipeline. Have a few others ahead of it but will be creating a fine tuning tutorial here soon.
@r34ct4
@r34ct4 Год назад
@@Ai_Austin looks like fine tuning is not yet available for Turbo. I've tried with DaVinci without much success
@sebaccimaster
@sebaccimaster Год назад
@@Ai_Austin Hmm I'm getting " Engine does not exist" error... What am I overlooking?
@Gotiplex
@Gotiplex Год назад
@@sebaccimaster it s not working like that you have all another syntax for the completion. Look in internet you will find your answers
@LE6ENDTV
@LE6ENDTV Год назад
Not sure if anyone asked you this... How did you create your AI visual in place of your face? I'm totally intrigued!
@mcfubumaki8359
@mcfubumaki8359 Год назад
Sure, here's an improved version of your statement: "I used ChatGPT to analyze the script of this video and engaged in a conversation where ChatGPT3 generated a micro-detailed strategy to guide you through every last detail that you might need to know. In summary, during our conversation, you asked about creating a GPT-3 powered voice assistant with Python. I provided you with a step-by-step guide that covers everything from importing necessary libraries and setting up the OpenAI API key to defining functions for transcribing audio to text, generating responses, and speaking responses. We also discussed the importance of error handling and adding additional features to improve the accuracy and usefulness of the voice assistant."
@deussayajin1441
@deussayajin1441 Год назад
Opa
@sarkis3791
@sarkis3791 Год назад
help please it says An error occured: You exceeded your current quota, please check your plan and billing details.
@marekpalczowski9722
@marekpalczowski9722 Год назад
I have just tried to do something like this for my program, but you are the first one, thank you very much, great job. Now I will use it for my program. Thank you.
@defaultHandle1110
@defaultHandle1110 Год назад
You can make a iPhone shortcut with the api. It’s one response though, but works really well. Using your api key. No programming needed.
@mailoisback
@mailoisback Год назад
How did you create your avatar and his speech? It sounds much better than the pyttsx's generated voice?
@KunjaBihariKrishna
@KunjaBihariKrishna Год назад
It's 11labs probably
@kubapuchalski8633
@kubapuchalski8633 Год назад
I understood nothing, but damn IT stuff and programing is fascinating, it would probably take me 1000 years to learn it, that is why all I can do is admire people like you.
@Ai_Austin
@Ai_Austin Год назад
I bet you could learn it. Its not reserved for some super high iq humans. Checkout the free online book “automate the boring stuff with python”. Give yourself a month. Study it 1-2 hour 3 times a week, this program will look like fluent english!
@kubapuchalski8633
@kubapuchalski8633 Год назад
@@Ai_Austin thx buddy, I'll give it a try :)
@sebastianbolo2480
@sebastianbolo2480 Год назад
The avatar is so well made, every one i saw blinks so much but this one does it at a reasonable phase.
@TungjangpoMusic-yq4rf
@TungjangpoMusic-yq4rf Месяц назад
Can you edit or train speedchrecognition library so that it will able to convert our dialect/unknown language to text
@Mansyno
@Mansyno Год назад
WOW you are an incredible tutor i have been an instructor/teacher for 30 years now and i NEVER seen code writing and concepts explained so clearly and understandable like you just did here explaining and teaching code is not so trivial as many would thing and there are plenty examples for that on the net GREAT video (and note that im not even talking about the specific content itself) keep up the good work, SUBSCIBED
@obanjespirit2895
@obanjespirit2895 Год назад
c'mon bro really? you've never in you're 30 whole years of teaching never seen it explained better amongst professional teachers? I mean sure the video is informative but c'mon.
@Gotiplex
@Gotiplex Год назад
I made a similar code, it s very easy, but you can improve yours by saving in a txt file all the questions and answers so it can memorize what you said before. You just have to give all the content of the file for each request
@MikeKleinsteuber
@MikeKleinsteuber Год назад
The big problem with this is that chatGPT is only relevant for many queries up to 2021. You really need to make this to interact with Bing Chat which has access to current data.
@Ai_Austin
@Ai_Austin Год назад
Great idea Mike, I got a Bing AI Voice Assistant Tutorial coming soon. You are right, having access to current data for our voice assistant is a huge improvement and I’m working on getting that out for you guys now! The bing voice assistant I am making will be completely free if you have beta access to bing as well. Unlimited questions.
@MikeKleinsteuber
@MikeKleinsteuber Год назад
@@Ai_Austin Sounds good, though I noticed this morning I already have both voice input and output available on Bing Chat. Don't know when Microsoft added that. Sadly you have to press the microphone icon to activate whereas it would be much more useful to be able to start with some sort of voice activation like Google assistant (especially if it could be customised). What we really need is something as interactive as in the movie 'Her' (I and many like me would pay a monthly fee for that btw) ....Keep up the good work
@VirginMostPowerfull
@VirginMostPowerfull Год назад
Even in the comments you talk like a robot my guy.
@jakesmith7428
@jakesmith7428 Год назад
@@VirginMostPowerfulllmaooo
@RealWorldMaverick
@RealWorldMaverick Год назад
Great video. As someone else mentioned, the code is a little small. In future videos would you be able to make it larger so it's easier to read.
@jordanbirley3074
@jordanbirley3074 Год назад
This can be very beneficial for those who may have a disability. Very cool
@PdWOLFG4NG
@PdWOLFG4NG Год назад
can you make this with GPT4all? would love to see a video on how to get this running on a offline system since you dont want to be depending on their model, if it gets out of hand we need backup models
@HeistAlex
@HeistAlex 10 месяцев назад
this is a great tutorial!. I really love it if you upgrade it. What i mean by upgrade is that, import the python programme in to any type of device such as arduino or raspberry pi ( If possible). Make it wireless.
@oklahomajefferson4625
@oklahomajefferson4625 Год назад
Thanks laddie! I've been scratching my head with assemblyAI for days to make this to work, this went seemingly!
@itsmeark
@itsmeark 6 месяцев назад
i know right assembly ai is so much money
@michelleokolocha5374
@michelleokolocha5374 Год назад
Hi, how do we change the voice to sound a bit like normal voice. And how do we make this work like google AI. For it to come up on our phones when we say 'Hey Genius' Or just call her name.
@braayevangelista1895
@braayevangelista1895 Год назад
SOOOOO sick dude!! can you teach us how to implement an avatar to integrate it with the gpt responses? such as yours in the video
@aryang1976
@aryang1976 9 месяцев назад
Can u help me? The code is all correct but there is a huge error saying that google is not an attribute in the module recognizer
@eajahmed
@eajahmed Год назад
I watched your video and really enjoyed it. Please make another project like this where it will be a mobile application and whenever I call genius it will respond like Siri or google assistant. And if you make a video let me know with a little reply. In the end, I will say one thing, you are a wonderful teacher
@pranshusahu7633
@pranshusahu7633 Год назад
the video was good and i followed it but al last what files Did you download while you were running the programm can you tell and if i want to convert the voice to jarvis's voice how can i do it
@yeetoner8431
@yeetoner8431 Год назад
Thanks for the great video :) One nit-pick, the text is so small it's a struggle to read, I'm constantly leaning into the screen just to know what I'm looking at. There's heaps of dead space around your avatar, maybe consider zooming in a bit on your next vid.
@nat.serrano
@nat.serrano Год назад
How did you make the avatar to talk? (The guy talking). Do you have a guide for yhat?
@DexFlex_YT-
@DexFlex_YT- 10 месяцев назад
with sr.Microphone() as source: recognizer sr.Recognizer() audio = recognizer.listen(source) its highlightin "sr" as an error and when i run it it says invalid syntax, and when i try to pip install the library it says that its already installed
@NonnyStrikes
@NonnyStrikes Год назад
I actually learned some things completely unrelated to the video. Thanks dude.
@alooflemur1492
@alooflemur1492 Год назад
Hello, loved the video works wonders. Would you be able to make a video series on how to add other features? such as opening apps, opening websites, setting alarms, adding a todo list & having it speak at cirten times of the day, say you want an alarm at 7am the bot would say good morning (name) today is (Date) with the weather being (weather info) & so forth I think it would be really cool
@GraveUypo
@GraveUypo Год назад
that is the best animated avatar ive ever seen
@Ali-Houssene
@Ali-Houssene Год назад
Thanks Austin and increase the font size in the editor next time
@un0RRS
@un0RRS Год назад
Had to get rid of the underscore in speech_recognition to get that to work. And I had to run pip install pyaudio to get it to work, but it works. Does this thing have contextual memory? Will it remember by conversations with it? I don't see any logging or context, so I don't think it does.
@roxforgegames4548
@roxforgegames4548 Год назад
This code works, but it is not optimal. Using speech_recognition to detect the initial command is slow because it requires sending the audio to a server, waiting for the server to process it with a large model, and then receiving the result. Ideally, a pre-trained KWS model that can recognize a single command and runs locally should be used instead.
@hugosilva5842
@hugosilva5842 Год назад
how would you do that ?
@gaurav_0369
@gaurav_0369 Год назад
Yes we're curious how to do that
@Gotiplex
@Gotiplex Год назад
@@hugosilva5842 you can use speech recognition library and it s source code to run it locally, it would be faster, but not that faster...
@alexsyld5410
@alexsyld5410 Год назад
@@hugosilva5842 this requires only a lil bit of machine learning skills and a PC that costs about 2k
@orangesunlabs
@orangesunlabs Год назад
Very cool, make it and I'll use it, especially would love it if we could upload a Mid-journey etc talking avatar of our choice (or photo that could be adapted).
@todef4ult
@todef4ult Год назад
how did you create the host animation..please explain
@maut276
@maut276 Год назад
Bro you're just awesome, Can U please make an app like this...
@Ai_Austin
@Ai_Austin Год назад
Coming very soon! 🫣
@maut276
@maut276 Год назад
@@Ai_Austin thanks broooo
@alberttakaruza5612
@alberttakaruza5612 Год назад
Can some explain were we are headed with but me its gonna be wild 😂 loved the Dis-song😅🎉
@thokevinay
@thokevinay Год назад
Nice. Is it possible to create a Telegram bot using OpenAI's latest model released 3 days ago? Using the chat endpoint? It would be nice if you create a tutorial for that.
@LEHeak
@LEHeak Год назад
Thanks Mate! Through this i was able to completly copy famous Chatbots like Siri or Alexa and thanks to the python statement "in", i was able to create a bot, who can filter my commands from whole and variable sentences. My Bot almost feels like a human teacher i can ask any question 😁 ... well ... almost ... davinci seems not to be able to tell the correct date and time since both is created from learning and not from actual live data (i asked GPT directly, Davinci refused to give me a usefull answer 😂)
@galaxielion7896
@galaxielion7896 Год назад
Hi ! I love this video, but once I have the code, how I can launch the program ? (run the program don't work) 🥺
@mcfubumaki8359
@mcfubumaki8359 Год назад
Great video! I found it really informative and helpful. Thanks for sharing your knowledge and expertise with us. Looking forward to more videos like this in the future!
@devawratvidhate9093
@devawratvidhate9093 Год назад
Which AI tool you use avtar and voice-over Thanks for video
@esotericist
@esotericist Год назад
Very interesting. Thanks. What software do you use to create teh talking head? Thanks.
@HealerSama
@HealerSama Год назад
Hey! How did you make your avatar? Can you make a tutorial on that?
@MrThePavlik
@MrThePavlik Год назад
ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-yWRx-jCDBqo.html
@InspiredInsights4U
@InspiredInsights4U Год назад
This is a good video it could be even better though with a release of GPT 3.5 turbo if you would take and show this again using GPT 3.5 turbo and whisper I think he would have a lot better response and a lot of people will really jump on wanting to do this. Thanks.
@Ai_Austin
@Ai_Austin Год назад
I have been researching Whisper. Its barrier to entry is a lot higher. Meaning if you want to run Whisper without having to pay for every question to transcribe, it needs to be done locally. Which puts you in the position of either needing a PC with 10+ GB of video ram. I also have not seen any evidence that the whisper api performs better in transcription than google speech recognition. OpenAI is the hype but I don’t want to make people feel obligated to shell out money for something that is currently possible for free. If one needed offline transcribing and has a beast of a pc to power the python program, Whisper would be a great choice.
@InspiredInsights4U
@InspiredInsights4U Год назад
I think today’s computers are probably powerful enough to handle text to speech I am a blind individual and I use several apps on my phone on my computer that dude just this kind of conversion and they’re not high power apps or high power computer. Some of them sent off to the Internet for processing but one of the things that could be done. If CPU horsepower is a real concern is push it off to the GPU most computers have Decent graphics processing units that would process much faster than a CPU ever could and it doesn’t take a lot of code to do that. I do think there’s a little more involved in writing code but I don’t think it’s any strong barrier. I think it’s just something Hass to be learn how to do. I’m in the process of trying to learn some of these things myself and I don’t see it as difficult as what you think it might be Again being blind it’s a little hard for me to quickly ramp up to the stuff but I’m getting there
@catonup
@catonup Год назад
I like your delivery style, however to be really effective the code needs to be legible, at least for those of us that are great coders. Even after magnification itsome of it was just a blur. It would be excellent if you could provide a file with the code in it.
@Ai_Austin
@Ai_Austin Год назад
I appreciate the feedback, is that happening even on 1080p with a computer monitor? Either way Ill make sure zoom in on the code and start linking a github repo for the projects. Thanks Nicholas!
@hanksimon1023
@hanksimon1023 Год назад
@@Ai_Austin Thanks! Great video. ... Yes, text is fuzzy on an iMac 27" monitor.
@ingmarxhoftovningsr6144
@ingmarxhoftovningsr6144 Год назад
@@Ai_Austin On my MacBook Pro Retina from 2015, the code is very readable at 1080p, only slightly blurred. Still, it would be convenient not having to write the code, but it might be a better learning experience writing the code myself.
@StoutProper
@StoutProper Год назад
@@ingmarxhoftovningsr6144 or you could get chat gpt to write it
@ingmarxhoftovningsr6144
@ingmarxhoftovningsr6144 Год назад
@@StoutProper Yes, that's the way to go, I guess!
@RomitKhilari
@RomitKhilari 8 месяцев назад
Great video austin but my program cannot access the voice of the microphone. Do I need to save my voice as a file for this program to recognize
@arjund1173
@arjund1173 Год назад
cannot access local variable 'audio' where it is not associated with a value (error) How do I fix
@sourabhagrawal4098
@sourabhagrawal4098 Год назад
will it run the same on python or do I need to change something? because I'm trying to run it on Python and it does not run like VSC.
@Art2om
@Art2om Год назад
This is all cool, but what if you wanted to do it with ChatGPT-4? Or are the libraries for it not available to the public yet?
@richardtosuto
@richardtosuto Год назад
What was the process you used to create the speaking animation of your avatar?
@TitanfangArtChannel
@TitanfangArtChannel Год назад
I'm also interested in how this is done.
@mukondekafwimbi3503
@mukondekafwimbi3503 Год назад
Wow. Are you also able to create an artificially intelligent speech app that can describe pictures fed into it in order to help the blind understand what is happening in the picture
@afifahapriliani574
@afifahapriliani574 Год назад
Great video !. what if you add the talking avatar feature to the ai assistant like the one in the video?
@Ai_Austin
@Ai_Austin Год назад
Thank you 🙏 That is the future but for now the tools to do it would make an extremely expensive and slow assistant
@deussayajin1441
@deussayajin1441 Год назад
Opa quero ver
@mazentahhan2255
@mazentahhan2255 Год назад
i'm facing some troubles running it, it requires FLAC conversion utility. any suggestions ? btw: i'm using mac..
@arjund1173
@arjund1173 Год назад
same
@ashrafshaik79
@ashrafshaik79 11 месяцев назад
Hey, nice tutorial, but can you let me know if there is a way to change the voice of the model? If yes, how? Thanks!
@intensanium8320
@intensanium8320 10 месяцев назад
Yes we can modify parameters of the tts. Ask chatgpt how you can modify the parameters of the tts and you will have a little code snippet. just copy and paste the three lines after the initialization, you can modify the values for testing different voices and speech rates
@heavenly2463
@heavenly2463 Год назад
Hello, Austin. Please answer me, who did you study for?
@Competitiveai
@Competitiveai Год назад
awesome, I love this. how you do the animated face?
@AlvinSony
@AlvinSony Год назад
Bro please reply How can I slow down the reply of this AI and change the voice. I appreciate you It worked like a charm!
@Ai_Austin
@Ai_Austin Год назад
I show how to change the speed settings in the new Bard voice assistant video on my channel.
@adithsuresh3444
@adithsuresh3444 Год назад
This is amazing 💯, it would be cool if you could also create a video on how to control other applications, using this module, btw is it possible ?
@Ai_Austin
@Ai_Austin Год назад
Yes, absolutely possible.
@kevinsandoval9818
@kevinsandoval9818 Год назад
Awesome :), will this work on a raspberry pi?
@aiortairaan5458
@aiortairaan5458 Год назад
what are the dependencies that are required to be installed for this project?
@roxforgegames4548
@roxforgegames4548 Год назад
pyttsx3, pyaudio, openai and SpeechRecognition
@cammoplayz
@cammoplayz 4 месяца назад
i need help, it says "Python was not found; run without arguments to install from the microsoft Store, or disable this shortcut from Settings > Manage App Execution Aliases." what do i do??
@cryptombt5880
@cryptombt5880 Год назад
This is a really good video compared to most on here. Cheers
@indy2l
@indy2l Год назад
Damn I was thinking about it this morning, but it appears that someone already did it 1 month ago before me.
@jonathanrose9912
@jonathanrose9912 Год назад
Currently working on integrating some Amazon API's to make the a much more usable experience. Including no wake word. I have 0 technical background but in the last 6 hours with the help of chatgpt I have a working model
@Ai_Austin
@Ai_Austin Год назад
My goal exactly with these tutorials is you add your own preferences and upgrade upon these. Super cool to hear you’re doing it with no coding background man!
@skylier
@skylier 6 месяцев назад
In case anyone is wondering as of today (2024) basically everything is outdated in this video unfortunately ;-; Hopefully this can save some people from trying and failing.
@Ai_Austin
@Ai_Austin 6 месяцев назад
Watch the newer tutorials. Code tutorials don't last for ever when they use 3rd party API's! My newer tutorials are far better than this one anyways!
@Daniel.Kovacs
@Daniel.Kovacs 6 месяцев назад
That's me 😂
@whitneydesignlabs8738
@whitneydesignlabs8738 Год назад
Great! Good coding,
@GreyWind182
@GreyWind182 Год назад
Is there anyway to attach this kind of virtual assistant code to a virtual avatar that can respond like the one in this video?
@FragZ999
@FragZ999 Год назад
"That sounds like NLE CHOPPA" ahahah you got me there m8 love the video
@seb_gibbs
@seb_gibbs Год назад
I've done the same in PHP using a few different APIs and streaming the data as to reduce the latency as much as possible, but its still laggy. Reducing the lag between a question and response is the tricky bit.
@Ai_Austin
@Ai_Austin Год назад
Id check out my new Bard voice assistant tutorial! Its faster than openai's api's and free. The past week I have been using Bard way more than chatgpt. Its just better for fact based responses that need to check recent internet data to verify its answers. And somehow faster than chatgpt without back-searching google.
@sarkis3791
@sarkis3791 Год назад
@@Ai_Austin help please it says An error occured: You exceeded your current quota, please check your plan and billing details.
@littlerocks3984
@littlerocks3984 Год назад
Bro you have used gpt-3 not chat gpt. Chatgpt have extra features such as it can answer question based on previous question and responses. For example on chatgpt Question :-Creat a basic html page Answer:- *code* Question:- now add the background colour to black Answer:- * modified code with background colour black * But when u use gpt-3 for the same it will treat both the questions differently.. and not give the upgraded HTML code.
@Ai_Austin
@Ai_Austin Год назад
Sounds like you found some feature you want to add. Feel free to add a fork to the github repo linked in the new voice assistant if you wanted to actually contribute :)
@littlerocks3984
@littlerocks3984 Год назад
@@Ai_Austin thanks for the help
@ricardocosta9336
@ricardocosta9336 Год назад
Beultiful pipeline this channel.
@Acervorochagames
@Acervorochagames Год назад
I am creating perfect javas myself.
@bananaear23
@bananaear23 Год назад
This is really helpful stuff, this level of quality get an insta like from me
@moshimoshi_04
@moshimoshi_04 Год назад
Amazing man....
@1975nikola
@1975nikola Год назад
After Whisper API release, does it make sense to use Python function for transcribing audio? How much better is Whisper in understanding?
@Ai_Austin
@Ai_Austin Год назад
That is a great question. Ive yet to find the need to learn whisper. Its my understanding that its superior for language translation and perception of accents. It also isn’t free like the speech recognition method i showed.
@1975nikola
@1975nikola Год назад
@@Ai_Austin I was testing Whisper over the weekend. It works great - English is excellent, while even small languages are acceptable with an editor. API is not that expensive, you can transcribe a movie for around 0,50 EUR. However, there is also possibility to install it on your server, running it locally and with that it will only cost the price of the infrastructure.
@PdWOLFG4NG
@PdWOLFG4NG Год назад
this is good bro, thank you
@ferhf824
@ferhf824 Год назад
please show us how to make a vatar like yours that's really cool
@sushantkulkarni97
@sushantkulkarni97 Год назад
Is it possible to retain a session-like memory of previously asked questions with the API like we can do on the ChatGPT web interface? For instance if I ask "Where is the oldest tree located"? and follow it by "How tall is it?", can we make API responses retain the context?
@Ai_Austin
@Ai_Austin Год назад
Using 3.5-turbo it is possible to have contextual memory. It would definitely add some complexity and would potentially want to create a command to refresh memory if you did so.
@Gotiplex
@Gotiplex Год назад
it s very easy, you just have to save your questions and the answers in a .txt file , then you give for each new request the content of the file !
@95doyaku
@95doyaku Год назад
I followed your youtube tutorial, but I can't hear anything. how can I find this section => filename = "input.wav" thank you.
@albertomenendezrodriguez6439
Hello! Actually im experiencing some problems with the pyttsx3 just want to make sure its just me or also you are having some issues with it.
@codylawson7175
@codylawson7175 Год назад
I need help when I run and say "genius" it says An error ocurred : module 'speech_recognition' has no attribute 'recognize'
@ipigtaiwan
@ipigtaiwan Год назад
change it to sr.Recognizer()
@yuguiro
@yuguiro Год назад
hello, very nice tuto but i have a problem with the microphone. the script doesnt want to hear my voice
@nick334t9
@nick334t9 Год назад
I'm having the same problem have you found a solution yet?
@arjund1173
@arjund1173 Год назад
I am having the same issue please help.
@nick334t9
@nick334t9 Год назад
@@arjund1173 ngl so I just copy and pasted my code into ChatGPT and told it that it wasn't hearing my voice and ChatGPT fixed it for me lol. Also make sure you have the packages installed too
@RichestMen
@RichestMen Год назад
Is it functional like google? I wanna use it to command a moving equipment or device. Does it support different languages?
@ganugaumeshbabu6144
@ganugaumeshbabu6144 Год назад
Can we add the basic functionalies like, open a website, open a file, launh/terminate a program ...etc?
@Ai_Austin
@Ai_Austin Год назад
All are actually pretty simple tasks with Python. Just a matter of adding a wake word for the new task and adding the few lines of code needed for each desired task you mentioned. ChatGPT could probably even do it for you!
@rishavmishra5071
@rishavmishra5071 Год назад
Thanks brother 👍👍
@kvvlu
@kvvlu Год назад
could you redo this for the new GTP 3.5 turbo API?
@dogame5
@dogame5 11 месяцев назад
Hey, that's a tutorial. But i got an error and no idear how to solve it. Can you help me? I got this message over and over: *An error occurred: cannot access local variable 'recognizer' where it is not associated with a value* *Say 'Jarvis' to start recording your question*
@sentientpotato9342
@sentientpotato9342 Год назад
Is it possible to do this using an existing voice like from a movie character?
@learningMachine1.0
@learningMachine1.0 Год назад
Simply, GENIIUS🤣
@milindkulkarni7101
@milindkulkarni7101 Год назад
Do we already have an app on Google or apple Play Store that works as a gpt voice assistant and speaks using out of choice Avatar of choice?
@f4r3l0
@f4r3l0 Год назад
amazing! congrats
@Vabistru
@Vabistru 9 месяцев назад
i got it to say "say "Genius" to start recording" but it still wont work due to gTTS not working, it says on line 66, col 35 that theres a problem and it cannot be defined
@bipinpandit6000
@bipinpandit6000 Месяц назад
I put the API key in the function but the code is not running, please help thanks
Далее
Creating Jarvis powered by OpenAI and Python | ChatGPT
18:40
Create Your Own AI Animated Character (step by step)
15:21
iPhone Flip станет ХИТОМ!
00:40
Просмотров 321 тыс.
Create a Podcast with OpenAI Realtime API
18:57
Python AI Voice Assistant & Agent - Full Tutorial
33:10
ChatGPT Prompt Engineering Course
30:36
Просмотров 1 млн