Тёмный

Real-time Speech Recognition in 15 minutes with AssemblyAI 

AssemblyAI
Подписаться 137 тыс.
Просмотров 144 тыс.
50% 1

Get your free speech-to-text API token 👇
www.assemblyai.com/?...
Transcribing in real-time is a super skill only court reporters can brag about. But luckily, we don’t need to learn how to type fast to get transcriptions of audio quickly. Thanks to Assembly AI’s Streaming Speech-to-Text model (previously real-time speech recognition), it is very simple to set up a python script that can listen for audio and turn it to text.
In this video, we will see how to create this script on Python with the help of pyaudio, web sockets and asynchronous functions. The app will have the power to listen to audio input through a microphone and display the transcription in real-time. We will integrate this code into a simple Streamlit application to showcase the real-time speech recognition with a touch of interactivity.
If you’d like to follow along, don’t forget to get your own AssemblyAI API token for free at assemblyai.com
You can find the code from this tutorial in this GitHub repository: github.com/misraturp/Real-tim...
Find the written form of this tutorial here: www.assemblyai.com/blog/real-...
AssemblyAI Streaming STT docs: www.assemblyai.com/docs/speec...

Опубликовано:

 

11 ноя 2021

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 72   
@saifullahkhan9837
@saifullahkhan9837 2 года назад
The accuracy and formatting is quite interesting here.
@AssemblyAI
@AssemblyAI 2 года назад
Thank you! - Mısra
@debojitmandal8670
@debojitmandal8670 7 месяцев назад
​@@AssemblyAIhi what if I want the input to be not from microphone and i want it from my speaker or laptop speaker how do I do it then.
@lfmtube
@lfmtube Год назад
Most instructional and useful video. Thank you.
@AssemblyAI
@AssemblyAI Год назад
You're very welcome!
2 месяца назад
Thanks for everything :)
@otomakannioc8213
@otomakannioc8213 11 месяцев назад
Very sympathic and engaging presentation. Maybe the most beautiful side of Artificial Intelligence 😊
@AssemblyAI
@AssemblyAI 11 месяцев назад
Thank you!
@pjayo
@pjayo Год назад
Is there a JavaScript version of this video please? Both service side and front end…
@ashiqashervegar7973
@ashiqashervegar7973 Год назад
How can I use this for transcribing particular chrome tabs for online meetings? Can you help me with that?
@slimyelow
@slimyelow 9 месяцев назад
Very kewl it works. However for the live service a $8 minimum is required. - but totally worth it
@amineelarif7001
@amineelarif7001 2 года назад
that is sick! goodjob
@AssemblyAI
@AssemblyAI 2 года назад
Thank you Amine! - Mısra
@lookersky6145
@lookersky6145 Год назад
I've this installed and worked on windows. My question is that Real-time Speech Recognition only recognize english ? Does it support other languages ? Thank you.
@HomelessRafi
@HomelessRafi Год назад
How can I introduce um, ahs, and other filler words in to the Realtime transcription? I see it is an option for uploading an audio file
@adhikesavan9377
@adhikesavan9377 2 года назад
when i tried to install pyaudio terminal displays this error: "Cannot open include file: 'Python.h': No such file or directory "
@Pinkijhabnp
@Pinkijhabnp 8 месяцев назад
Thank you for this nice tutorial
@AssemblyAI
@AssemblyAI 8 месяцев назад
Glad you liked it
@weebiesoftware6296
@weebiesoftware6296 2 месяца назад
I want to implement a realtime app using voice recognition on python 3 / android 11 on my samsung s22. It's my understanding portaudio is NOT supported on Android 11. Is portaudio your only way to get to the mic?
@Asparuh.Emilov
@Asparuh.Emilov 2 года назад
This is really awesome! I would prefer though to see the final result as a short highlights at the beginning of your videos before you go into the details of how to. But thanks anyway for the effort and the time! Hugs!
@AssemblyAI
@AssemblyAI 2 года назад
Thanks for the feedback! It's definitely a good idea to give an impression of the app that is being built. With the newer videos we do a preview at the beginning of the videos indeed. - Mısra
@Asparuh.Emilov
@Asparuh.Emilov 2 года назад
@@AssemblyAI 🤗🤗♥️♥️
@MrThought2012
@MrThought2012 10 месяцев назад
Very nice and easy setup! Took me ages to achieve the same with whisper. However, are you planning to support other languages, german, french or even a multilinugal model?
@omarsiddiqi5018
@omarsiddiqi5018 8 месяцев назад
Can I ask how you were able to do it?
@1992kshitizyadav
@1992kshitizyadav Месяц назад
As of now, only the English language is supported in the live transcription feature. when can we expect more language support ?
@claudiotassis
@claudiotassis Год назад
Incredible video. Would I be able to use chatGPT, as an intermediate, to correct the sentences based on vocabulary and grammar, and after that, get the response from that chatGPT "reviewed" sentences?
@mohamedshagie3342
@mohamedshagie3342 Год назад
Yup i tried to make it but it worked only text cant use speak 😅
@PoojaVerma-sl6mg
@PoojaVerma-sl6mg 9 месяцев назад
Could you please instruct me on how I can include this in my Angular project?
@ckames22
@ckames22 2 года назад
Awesome 👍
@AssemblyAI
@AssemblyAI 2 года назад
Thank you!
@Miguel-hq1lx
@Miguel-hq1lx 2 месяца назад
is it possible to transcribe in real-time in other languages, such as spanish?
@fahnub
@fahnub Год назад
Does it also offer diarization in real time?
@moncefarajdal4582
@moncefarajdal4582 2 года назад
Can you please let me know how can I integrate this in my JAVA Maven project?
@AssemblyAI
@AssemblyAI 2 года назад
Hey Moncef, unfortunately I also don't have experience on that. -Mısra
@usus8420
@usus8420 2 месяца назад
hi great works but what about smartphone ?
@REALVIBESTV
@REALVIBESTV Год назад
Can this work in Unreal Engine 5
@onintsoavola5698
@onintsoavola5698 6 месяцев назад
Is it possible to make it faster ? The transcription takes a little time
@borr2749
@borr2749 Год назад
Assembly ai real time transcription doesn't have a free trial ?
@user-mx5lv5qp5y
@user-mx5lv5qp5y 11 месяцев назад
can you pls let me know how to save that text
@IntricateMoon
@IntricateMoon Год назад
I'm on windows, When I try to run it it does nothing, just creates a new line on the terminal. when I cloned the github repo, it was working, hmmm
@AssemblyAI
@AssemblyAI Год назад
Have you tried speaking while the code is running? It might be that you don't have a microphone connected to the computer.
@KashyapJadav
@KashyapJadav Год назад
Live transcript is paid version?
@parameswaranesnsce-cse9491
@parameswaranesnsce-cse9491 6 месяцев назад
can we speak any indic languages , will this endpoint will transcribe or not ?
@AssemblyAI
@AssemblyAI 6 месяцев назад
Yes AssemblyAI's API supports Hindi Transcription, check out this tutorial: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-3WlNOCeyyjQ.html
@MDMUHTADEEFAIAZKHANSOUMIK
@MDMUHTADEEFAIAZKHANSOUMIK Год назад
Can we setup Bangla language for this system?
@spinal_cord
@spinal_cord Год назад
I know this is a little old, but I get a 4002 error, what might cause that?
@walker71391
@walker71391 11 месяцев назад
Did you ask ChatGPT?
@GiulianoGolfieri
@GiulianoGolfieri Год назад
Is it possible to use this service in other languages apart from English?
@tiagofyhnesteves74
@tiagofyhnesteves74 Год назад
im also trying to find an answer to this question
@GiulianoGolfieri
@GiulianoGolfieri Год назад
@@tiagofyhnesteves74 they answered to me privately. It's not possible yet. I switched to Azure cognitive services, which is multi-language.
@frizzfrizz3550
@frizzfrizz3550 8 месяцев назад
@@GiulianoGolfieri I had taken it for granted that it was a multilingual service, a fucking morning's work wasted. Grazie della info, Giuliano
@rubibeats
@rubibeats Год назад
how to add custom ui?
@eagold
@eagold 2 года назад
buut.. if i have no money to buy the pro key?😕
@AssemblyAI
@AssemblyAI 2 года назад
You can get started for free!
@siamkamelia87
@siamkamelia87 2 года назад
does this work for song transcription ? in real time ?
@AssemblyAI
@AssemblyAI 2 года назад
Hey Siam, depending on the amount of background music and clarity of pronunciation you'd get varying levels of success with transcribing songs.
@ibrahimimohssine8131
@ibrahimimohssine8131 2 года назад
is assemblyAI support arabic language with vowelization?
@AssemblyAI
@AssemblyAI 2 года назад
We are launching support for Arabic in late January!
@angelfernando8954
@angelfernando8954 2 года назад
Hi. how can i change the lenguage to transcript in spanish?
@AssemblyAI
@AssemblyAI 2 года назад
Hey Angel, here is the documentation on transcribing in languages other than English. docs.assemblyai.com/walkthroughs#specifying-a-language
@dirtydevil81
@dirtydevil81 2 года назад
@@AssemblyAI But do different languages work with realtime transcription on this specific endpoint? The documentation, regarding changing the language, is not clear about this.
@giovanniied
@giovanniied Год назад
@@dirtydevil81 do you find a solution?
@bakhshizade
@bakhshizade 8 месяцев назад
I am here for Freddie.
@marlontuquerres6072
@marlontuquerres6072 Год назад
THIS IS ONLY AVAILABLE ON MAC/LINUX, RIGHT?
@AssemblyAI
@AssemblyAI Год назад
No, it is available independent of the operating system.
@loubino18
@loubino18 3 месяца назад
Should have mentioned cost to go to pro version.... why hide it?
@benyusu8045
@benyusu8045 8 месяцев назад
received 4001 (private use) Not authorized; then sent 4001 (private use) Not authorized
@Homurdan
@Homurdan 2 года назад
Aha Türk !
@egeyay9470
@egeyay9470 Год назад
Ahahahha
@barankaya3333
@barankaya3333 Год назад
Türk müsün?
@valerozanoni952
@valerozanoni952 Год назад
When i added this line if json.loads(result_str)['message_type'] == 'FinalTranscirpt': it wouldnt transcript anything anymore
Далее
5 Lines of Python Code to Create Video Subtitles
4:41
Gặp 2 thánh troll | CHANG DORY | ometv
00:42
Просмотров 23 млн
The Most Impressive Basketball Moments!
00:36
Просмотров 13 млн
Can AI code Flappy Bird? Watch ChatGPT try
7:26
Просмотров 9 млн
Programming Is NOT Enough | Add these 7 skills…
13:19
Best FREE Speech to Text AI - Whisper AI
8:22
Просмотров 910 тыс.
OpenAI Embeddings and Vector Databases Crash Course
18:41