Fastest speech to text transcription, 100% offline - Whisper.cpp | Zero latency

CodewithBro

Подписаться 1,4 тыс.

Просмотров 14 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

28 окт 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 64

@codewithbro95 5 месяцев назад

If you have any questions please feel free to drop them below! Please don't forget to like and subscribe for more interesting content like this🔥

@maxxflyer 2 месяца назад

hey bro, does it offer italian language?

@codewithbro95 2 месяца назад

@@maxxflyer I belive it does, you can check the repo

@hjoseph777 Месяц назад

I am using a combination of "faster-whisper" and "whisper. cpp" offline. I will use "faster-whisper" for fast machines or servers with GPU and whisper in my project.cpp will be on a regular laptop on CPU. Thanks for sharing; your demo was crystal clear; keep it up. New subscriber

@codewithbro95 Месяц назад

My pleasure, glad it helped;)

@TiTanos168 8 дней назад

Thanks for the info. But the screen is completely blacked out.

@endresbielefeldt2050 5 месяцев назад

thank you for the amazing content!

@codewithbro95 5 месяцев назад

Always a pleasure🎉

@edmondgoddy 3 месяца назад

1K Subs. Congrats bro

@codewithbro95 3 месяца назад

@@edmondgoddy thanks man, really appreciate the support 🙌🏾🙌🏾

@mentalview8703 4 месяца назад

Great video bro. Keep it up 👍

@codewithbro95 4 месяца назад

Thanks, really appreciate 🙌🏾

@dazdazfzf 28 дней назад

thanks for you content for West Indies in the carribbeans. Guadeloupe :-) I am curious to know on what kind of machine you are working on ? Is there a big GPU ? I saw metal, normal apple laptop ?

@codewithbro95 28 дней назад

apple silicon m1, with 8 core gpu I think

@teclascelestiais9328 12 дней назад

incredible! Do you know if it only transcribes wave files? Can I also get mp3?

@codewithbro95 11 дней назад

not sure but i believe you can convert to wav and transcribe from there!

@Jeka476 2 месяца назад

Why is there black screen in middle of the video?

@codewithbro95 2 месяца назад

Hey man, apologies for this, that should have been spotted before publishing. Sorry!

@gomgom330 Месяц назад

What different with speech recognition library?? As i know speech recognition support engine like whisper,watsonx,and google speech, but for offline it use vosk by default

@codewithbro95 Месяц назад

This is more accurate in terms of recognition

@mbegangsylvain1076 4 месяца назад

love it !!!

@codewithbro95 4 месяца назад

Glad you love it... Please, don't forget to like and subscribe for more interesting content like this one🔥😎

@Plash14 Месяц назад

Hey umm, can faster whisper detect sounds like that too or is it only Whisper.cpp?

@codewithbro95 Месяц назад

@@Plash14 not sure what you mean

@Plash14 Месяц назад

@codewithbro95 basically it can detect your keyboard typing sounds etc right? Was wondering if it can be done on faster_whisper as well

@codewithbro95 Месяц назад

@@Plash14 I see, not so sure about that(haven’t tried it) however, if it’s based off of whisper then I believe it should be able to do that

@Plash14 Месяц назад

@@codewithbro95 I see... thanks for the reply!

@RoarStaze 4 месяца назад

How do you get the make command to work on windows?, i got the make command but i just get error saying cc not found and someone said gcc=cc but i dont know how to do anything from there

@codewithbro95 3 месяца назад

@@RoarStaze not tried it yet on windows but from the error you got, I believe you have to install gcc on your windows machine

@RoarStaze 3 месяца назад

@@codewithbro95 i do have gcc someone said i need to make it gcc=cc but ive no idea how to do that

@QHawk7 Месяц назад

It picks up sounds? weird... Doesn't it phone home?

@codewithbro95 Месяц назад

haha, not yet!

@theMonkeyMonkey 5 месяцев назад

Your english is excellent. may i make a suggestion - python is not pronounced pie-ton but pie-thon - with the 'th' being the same as the 'th' in 'this'

@codewithbro95 5 месяцев назад

Appreciate the correction!

@GodFearingPookie 3 месяца назад

Are you serious?

@DenzilSheldon 3 месяца назад

Wow amazing! Question: how much faster is it estimated working faster then Python? Thanks a lot!

@codewithbro95 3 месяца назад

No specific data on that but after trying both I’d say it’s just about 5x faster in transcription

@aryanbamane1281 Месяц назад

How do I implement this on website?? Please help.

@aryanbamane1281 Месяц назад

Anybody knows??

@codewithbro95 Месяц назад

There’s a section for that in the repo

@contactmebaba 2 месяца назад

The guide to install and make it working was not clearly captured in this video. In between it was only voice and no screen record visible to us. I appreciate your effort, but you need to cover the content for wide audience from beginner to Advance in step by step procedure. The command ''make" still doesn't work. The problem with all these AI youtubers are not providing solution to an issue and keep moving to other AI tools with new content. Try to follow-up and provide solutions to your audience in order to get more followers.

@codewithbro95 2 месяца назад

@@contactmebaba I recon the screen went black at a point, sincere apologies for that. That was an editing error. Will try my best to do a better job at double checking before publishing.

@JackieUUU 5 месяцев назад

amazing! what gpu are you running? or it’s on cpu?

@codewithbro95 5 месяцев назад

Running on macOS M1 chip with 8 core GPU, I believe whisper.cpp makes use of metal on mac

@gnosisdg8497 5 месяцев назад

can you put this offline whisper with a local llm model lets say phi3 to get reply based on whisper? i mean lets see how fast it can actually put out what the llm model will reply, this way you can make an offline ai assistant with no latency in responses and local 100 %

@codewithbro95 5 месяцев назад

i am actually working on something like this, check out my recent videos on Jarvis. I am building Jarvis so you don't have to

@gnosisdg8497 5 месяцев назад

@@codewithbro95 cool nice job keep it up, can you also add a way to use phi3 llm with phidata as well for Local RAG and also options for reading csv , pdf ,word documents as well ? this will give you a lot of views also, we are talking for an actual use of an ai assistant with this abilities !!!

@codewithbro95 4 месяца назад

@@gnosisdg8497 definitely something i am looking to work on, stay tuned!!!

@siddharthchadha3930 4 месяца назад

Thanks your video goes blank in the middle for a little bit

@codewithbro95 4 месяца назад

@@siddharthchadha3930 really? Didn’t notice that. Apologies nonetheless

@HimanshuChanda 3 месяца назад

@@codewithbro95@ 06:13 onwards

@hjoseph777 Месяц назад

Your screen went black at 6:10

@codewithbro95 Месяц назад

yeah, editing mistake, my appologies

@snatvb 4 месяца назад

I wait same speed TTS(text to speech), it would be great to have

@codewithbro95 4 месяца назад

Not sure i understand what you mean!

@snatvb 4 месяца назад

@@codewithbro95 we have option recognize speech to text in realtime, but text to speech is really slow now

@codewithbro95 4 месяца назад

@@snatvb definitely agree with you, inferencing with TTS is very bad at the moment, though I recently stumbled on a really promising project called ChatTTS apparently it’s being built specifically for this purpose, I haven’t tried it though, maybe I will and make a video on it.

@snatvb 4 месяца назад

@@codewithbro95 yep, I've seen recently. I tried "bark" from suno and it work pretty slow (I have rtx 3070) and sometimes it voices llm imagination text instad of I gave :D

@ToMooNoT 4 месяца назад

Hi, noob here.. Trying to figure out how to get the `make` working from VSCode terminal, on windows so far I installed MSYS2 added C:\msys64\usr\bin and C:\msys64\mingw64\bin to PATH env variables but... still says command not recognized..

@RoarStaze 4 месяца назад

same did u find a fix?

@codewithbro95 3 месяца назад

@@ToMooNoT does it work outside of vscode ? That’s the normal terminal

@ToMooNoT 3 месяца назад

@@codewithbro95 I had to install Visual Studio and build the C code from there or something, but it didn't build the microphone one, and I don't know how to add it to the build step, so kinda gave up, also was trying to get my AMD GPU to work with ZLUDA which is a library that should make CUDA code work on AyyMD, but no luck there either even with AI helping with troubleshooting..