I Built an A.I. Voice Assistant using PyTorch - part 1, Wake Word Detection

The AI Hacker

Подписаться 56 тыс.

Просмотров 434 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

5 окт 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 357

@robinranabhat3125 3 года назад

While internet is full of AI guru's teaching basics with some slides and a jupyter notebook, this guy actually teaches ML with a production level code. Why are you underrated !!

@p._7555 3 года назад

True. On the learning curve, we need basic and high level AI teaching too

@ItachiUchiha-nx2sw 4 года назад

First few minutes : Alright, this sounds so cool Middle part: Da fuq Last few minutes : Alright, this sounded cool

@theaihacker777 4 года назад

😂

@christopherdimitrov1652 3 года назад

Hahahaha

@vinayak354 3 года назад

Hahaha

@AfafPrinceOSH 3 года назад

@@theaihacker777 is there any way u could change the voice of Google assistant to any random voice?

@user-fj4ih3lo9f 4 года назад

Could you make a more detailed tutorial? I couldn't find any other videos on how to make an AI Voice Assistant, i really liked the vid altough it was sometimes hard to follow. Would really enjoy a full detailed series on this :D

@bryanfeliciano4102 3 года назад

The best way to learn is to mess with it my dude. Go into the GitHub and read the code, and start writing your own following his example but alter it to suit your tastes .

@gamecraftczjaajenomja1057 2 года назад

Amazing video! Finally someone not just showing some random jupyter notebook. I love how you show the real problems: not enough wake word samples, voice streaming, long training times, etc. Continue if possible, I would greatly appreciate it!

@GeometryDashEndermaster 4 года назад

You're like an alternate reality Michael Reeves

@soon3794 4 года назад

Basically a less suicidal version lol

@dareokoski8158 3 года назад

@@soon3794 he is still yung, give him time XD

@TeamVexVideos 3 года назад

The other Micheal isn't as good with Man's AI. He just uses like google API or existing software. And Micheal Reeves just does comedy not education.

@sgodsell2 3 года назад

At 10:00 you said two words that sound similar, like MOLLY, and FOLLY, FALLEY. If you were to include those 2 or 3 words in your model data under a different category. Then they will never trigger when you say any of those words, except when you say hey wally.

@mohamedfasil4932 2 года назад

The weather is M####F#### hot I thought that and in deep thinking this doesn't sounds right

@johnhandley1870 4 года назад

I’m watching this video on my iPad and when you said “Hey Siri”, Siri woke up... By the way, I’d be really interesting in seeing a video in which you explain how to set up a computer to carry out Machine Learning tasks. :)

@theaihacker777 4 года назад

Would love to do that!

@theaihacker777 4 года назад

thanks for the support!

@leninbabu5797 4 года назад

Its been cool to see U Can we make this model using raspberry Pi zero !!

@maritzadelascasas2727 4 года назад

the mot ovios wake word is wake up a.i.

@rangefreewords 2 года назад

Awesome! I was looking for a totally offline LAN based smart A.I. like what you just presented Objective: To control everything on my sailboat, take helm, drop anchor, play music or movies from my pi based server and work alongside my autopilot systems and chart plotter. I know often during my voyages I won't be accessing any internet but, I want to have all the same ubiquitous control as a smart home, etc. I am trying to source as much as I can from RU-vid to build a decent system. Keep up the awesome work!

@subhajitkundu7546 2 года назад

Hey, good day mate, The project that you talked about sounds awesome. I am just checking in to know how the project is coming along and where are you headed with this project currently.

@danielogunlolu 2 года назад

I am working on something similar on channel. Kindly check it out

@mathewpatterson2187 Год назад

Ahoy there! I'm also checking in, how's it going with the project sounds awesome and very reminiscent of what I want to do! Have you had much success?

@daanzap Год назад

I have the exact same idea! It's going to feel a bit like being on the enterprise.

@elliotmarks06 Год назад

This project looks super cool! I'm a little late to the party, but I think this would be awesome to revisit with the new AI chat tools! Especially something like GPT-Neo or the other open-source implementations.

@andreashon 4 года назад

This video is exactly what I was looking for. All other voice assistant youtube guides use shitty Google services and other proprietary sources. Thank you. Looking forward for next vids on this topic. Also that'd be interesting if you reveal how much time have your machine spent on all that learning.

@theaihacker777 4 года назад

Next video coming up soon! For wakeword, training was really fast. only spent like 30 minutes training.

@leif1075 Год назад

@@theaihacker777 Was this process mostly fun and enjoyable? If not how did you not give up when it got hard and not get bored and frustrated? Thanks for sharing.

@leisana4097 3 года назад

Extremely extremely intelligent AI. You asked what next to try - Can you try - Self driving car with Raspberry Pi and Pytorch. A small rover

@ramoncaceres4399 3 года назад

Pretty interesting. For about 2 years I’ve been obsessed with turning my house into an AI assistant. Yet have the voice Overlay of ( BT from titanfall ). Yet pretty hard to do that.

@JohnSmith-ox3gy 2 года назад

Why mess with perfection?

@justsomeguywithtattoos6267 3 года назад

This could be easily applied for translations, so that you can have an earplugs that instantly translates what someone is saying to you

@sonu-d6e 2 года назад

Been watcNice tutorialng your vids for a good few weeks now, learning new sNice tutorialt each day. my worksoftow has improved so much since watcNice tutorialng

@JorgeHernandez-iw2fd 3 года назад

How does someone even get to this level? I'm assuming years of practice and stuff but what/how did you practice?

@shubhamthapa7586 3 года назад

no one year is enough

@dillonridder8737 4 года назад

"Apple's okay" lol

@schwarzarbyter 3 года назад

thats exactly when this video got its 72nd dislike.

@mileshall5795 Год назад

DUDE!!!!....You are a total BOSS!!! Thanx man. you are WAAYY better at this instructional thing than established channels/RU-vidrs that (for some reason) have more subscribers, etc. Keep that stuff UP!! You are just the dude I was looking for.

@adarshvinayak 4 года назад

Your videos are amazing man. You've just earned a fan. By the way, I'd love to see you make the speech recognition model next.

@theaihacker777 4 года назад

It seems that speech recognition is popular!

@sreeram9220 2 года назад

I'm not gonna lie when I hear your one minute speech I find some hope

@anshpatel8083 3 года назад

“ Apple’s okay. “ Im subscribing this channel

@RichardBaileyrichoncode 4 года назад

Looking forward to next episodes.

@DrSmart20 Год назад

haha i looked the idea up like a year ago thinking it would be cool and everything i found was just "attatch spare phone to a speaker" lol this just showing up in my feed now is so exciting

@தமிழோன் 3 года назад

I wonder why your channel is still not famous. 🤔You deserve millions of subscribers!!!

@cloudsystem3740 2 года назад

honesty i never used anything like that but somehow you inspire me to start thinking how i can learn that stuff very nice video and source code thanks

@Rottingflare 3 года назад

This looks like an awesome project, can't wait to see more development!

@aayushbajaj2260 3 года назад

this is insane. thank you for building this.

@RomansapienMVision 2 года назад

Thanks for sharing. I was looking for a video that would get my head spinning then eventually put me to sleep.

@harjunmnath 3 года назад

common man, give us the next video of this series we have been waiting too long

@MrHumbleOne Год назад

It’s a shame you didn’t keep this repo up I just got a raspberry pi and this is my intentions but I don’t know enough about ML or engineering to pull it off 😅 thanks for the content!

@myrthestruver5262 Год назад

you have no idea how proud I am I even understood half of this haha

@bradc6056 4 года назад

I haven’t seen the rest yet, but have you thought of creating a blacklist of all the words that sounds like Wally, and that should increase overall accuracy.

@akashdhage 3 года назад

Great learning thanks a lot. I have gone through the video completely,if the audio signal is split with a equal diffrence for eg:2 sec then it may result in loss of information as a the split may occur at the middle of word

@ali-g 4 года назад

Oh man, you are amazing! Just inspired me, thanks for the great content.

@400DaysUnicorn 3 года назад

Couldn't find part 2 and so on. Would love to do this project for myself. Thank you!

@Elian- 4 года назад

Great video! High qualiy, entertaining and inspiring

@benni5541 2 года назад

Im Currentlx using rasphy. Although its pretty neat and does EVERYTHING, it does not leave much room for the very tech savy users. I guess i will tinker a bit with your code and incorperate parts. But i need to replicate my current satellite nodes with printed pcb first. So much todo :D

@ShivamVerma-gq2sm 4 года назад

Why not go in the sequence you already told? Incredible video ! I read a blog of yours on medium about LSTM, quite good explaination. Thanks man for such an awesome stuff .

@theaihacker777 4 года назад

Always a logical choice! but just wanted to know what the viewers are most interested in seeing next

@dr.mikeybee 2 года назад

Nicely done. You need to add a bunch of Molly Dolly ground truth to the training set. That should fix it. It's great that you have it running on a pi. Nevertheless, you are going to need speech recognition, and I'm not sure vosk or deep speech will run on a pi, and if it does, you really don't need a wake word detector model. If the speech recognition model can understand the wake word, that can be your detector model too. The only reason for a wake word detector is to avoid going out to the cloud.

@hs4lhp828 3 года назад

Interesting video. Looks like fun. You're smart af. In other news, I've always suspected that I'm dumb af. This is now confirmed.

@alicomando1195 2 года назад

Those codes make me breath deeper

@TheAcujlGamer 3 года назад

Did I just found a small creator that makes great & fun pyTorch content?

@PritishMishra 3 года назад

You are a genius.. Subscribed !!

@NathanaelNewton 2 года назад

"Ok, Now that we got the code.." Wow.. that was so information dense.. SUBBED AND BELL WOW.. I'm going to learn a lot here I think :)

@PiyushSharma-od2el 3 года назад

Nice making Terminator A.i

@romicasimiaisialtele366 2 года назад

Thank you Mike! I'm just starting out and tNice tutorials video really helped get the basics down!

@fahdciwan8709 4 года назад

Keep it coming Michael !!! Thanks a ton!!

@thom2503 4 года назад

Great video, liking the format

@fteoOpty64 4 года назад

Ultra Cool Dude!. Fantastic instruction process and very concise. Very Good. 101% grading!.

@lukewhatley8043 4 года назад

Awesome video! Do you know if the dataset you decide to train it on has to be in .wav format? So close to getting your example working! Let me know if theres somewhere I can ask some questions regarding your code. Again great video man!

@theaihacker777 4 года назад

Hey Luke! If you have discord, join the discord server and I can help you there. The link is in the description.

@CrazyGamerDude17 2 года назад

Why was this abandoned? This is the best tutorial series what happened to finishing the AI?

@microgamawave 2 года назад

You can make a video about gait recognition biometrics in python recognized you from your walk model ????

@halimaujunwa9533 4 года назад

Oh yhhh......I got lost in that mathematical modelling part tho but still cool.....really cool

@szczurekk1155 2 года назад

I also really like softEX, it has a very nice effect to it

@scollyb 4 года назад

Great video. Have you tried the simple solution of adding examples of you saying the close phrases to the training set. Simplest way would be just to add many copies of them to the set and retrain. Possible more robust way would be to add a second stage to the process trained on only your voice.

@benniegant 3 года назад

Wow, My AI Assistant is working now Thanks!!!

@lionellow105 2 года назад

Hey sorry for bothering but would it be okay if I ask you some questions on how you got it to work? Especially the raspberry Pi part

@etienneekpo348 4 года назад

Cool Mike. Thanks for sharing !

@manuelherrerahipnotista8586 2 года назад

Very clear explanation. Thanks a lot

@mauricioandrestiznadoroman3460 2 года назад

production. Thanks again!

@tuananhlam90 4 года назад

Hey great tutorial! Looking forward to your written guide! But question/request for you: if I don't have access to expensive deep learning hardware setup like you have here, can you do another separate tutorial series on how to build/train model on say AWS/GCP?

@theaihacker777 4 года назад

Totally can consider that. But I think there are quite a few tutorials like that out there. Also i would recommend using google colab since it’s free and good enough for small projects

@egs-zs8-127 4 года назад

Great video! Thank you so much!

@user-mr8cw4ud8n 2 года назад

He had when he "pitched down the Nice tutorialgh hats at the end of the phrase. "

@randallnorwood6803 Год назад

Well I must say that I'm very interested

@devbella5223 Год назад

Why aren’t you more popular??

@r7rahuls 3 года назад

Finally.....I got what I was looking for. ❤️

@provakar5496 3 года назад

Thanks a lot, I just made google assistant!

@lionellow105 2 года назад

Hey, what wakeword did you use

@xYASMINNN 4 года назад

Love this! 🤍

@mcudgir1291 2 года назад

This helped a lot thank you

@HappiFix 3 года назад

daaaaayum good stuff homie g gangsta

@getalife6654 3 года назад

I really like your content and wanted to try this myself :)

@sodapopcowboy8620 3 года назад

Good news I understand this. Bad News I am overly perfectionistic.

@johnsnow5510 2 года назад

Very nice ideea! It may be a dumb question, but here it goes: can the recorded voice be in other language than english, and by using the same principle get similar results? I'd like to create an assistant that recognizes speech input in real-time and returns information like weather, youtube videos etc.

@melaniedavis6379 2 года назад

Love this, but you see what had happened was I'm in confusion the words you say are too technical for my brain 😂

@mingtang9823 3 года назад

This is very helpful! Thanks a lot. I subscribed!

@DJWangDJ Год назад

dude, you are awesome!

@antoniomeraz520 4 года назад

Dude you´re awesome!!! make more of this

@Wallee580 2 года назад

I don't understand much of this but I think a good way to fix your wake word issue and make it more simple is to just use a speech to text algorithm. Though as I am a noob to AI that may be exacly what you just did :p

@ObitoUchiha-be1jo 3 года назад

I was thinking of building too using raspberry xD. I'm surprised u didn't include cortana

@amitg2k 2 года назад

Awesome...way to go

@prakashupadhyay9529 4 года назад

Loved your explanation and flow!

@matthewfelgate 4 года назад

Wouldn't a trigger word with an X or K make it easier to detect? (Like Alexa or Ok Google)

@theaihacker777 4 года назад

Probably...

@baskett98 4 года назад

Install an Anaconda and you're set to start. Get more modules and libraries as and when required.

@aaronchantrill7338 4 года назад

I use "MagicVoice" which seems to work well. Does that count as using a 'K'? How are you choosing 'X' and 'K'?

@cheenamaejafar 3 года назад

" and I had to sell my left kidney for this" - Hilarious!

@Giuseppe._.gioffre 3 года назад

Yo i gotta say pretty good vid he only thing that is pretty noticinle is that you watch a lot of Michael reeves and you are inspired by him a lot because a lot of your jokes and very similar setup to his and the video editing style to either way good job just a quick tip that might get you more audience is find you own style which I know you are in the hunt for good luck

@aviralgandharva3157 3 года назад

please make a separate video on using the github stuff 👏👏👏👏👏👏👏

@seesah 4 года назад

this is so amazing!

@CreativeNames101 3 года назад

Bootleg Michael Reeves you'd find at a hair salon, a combo i didnt know i needed

@luvsec5469 3 года назад

The Michael Reeves we always wanted but never got. Until now.

@kalyanibhandari3031 4 года назад

Please make a video on how to make hotword detection in computer itself

@MrBobWareham 3 года назад

How did you learn so much and know what to type? It just looks so complicated you are awsome

@2Taps420 2 года назад

do a full how to video plz!

@OpenAITutor Год назад

Hi Micheal! I greatly appreciate your insightful presentation and the fantastic animations you've created. I'm writing to request your permission to utilize some of your content to supplement my endeavor in spreading the word about transformers. My intention is solely to use some of the animations you've designed, and I assure you that appropriate credit will be given, recognizing you as the source of these materials. Thank you for considering my request.

@kalyanstock8058 Год назад

Wow...can you do a video on text to speech for any voice?

@karanrathod8555 4 года назад

sir please complete full course of this AI soon...

@jeremyuzan1169 3 года назад

Hi ! congrats for your work. How do you install Pytorch properly in a Raspberry Pi bro ? Thanks a lot :) Jeremy

@dirtydan69 3 года назад

Where can I talk to you about help on building my own AI? I’m still learning how to code and would mean a lot to me if you gave me a helping hand

@RudraPratapSingh-nh7lw 3 года назад

Part 2 fast please

@shallinkumar9042 3 года назад

I'm getting this error in dataset.py (extension.py", line 14 warnings.warn('torchaudio C++ extension is not available.') )

@harryrushin7166 3 года назад

Although the wake word is an important step to producing this AI assistant. Wouldn't it have been beneficial to begin with speech recognition as it would have helped recognise the wake word?

@getalife6654 3 года назад

Is it possible if you post how you did all and what commands you used ? ;)

@rupertbowen-jones858 3 года назад

Have you considered the NVIDIA Jetson Nano or the Jetson Xavier NX? Could be a better and more powerful solution than a humble pi? Looking forward to working through this project though... great videos and git. Happy New Year!