Тёмный

5 Lines of Python Code to Create Video Subtitles 

AssemblyAI
Подписаться 138 тыс.
Просмотров 14 тыс.
50% 1

In this video, we learn how to use the AssemblyAI Python SDK to generate subtitles for any video with timing.
Read more in the AssemblyAI documentation: www.assemblyai.com/docs/guide...
Take a look at the SDK documentation: github.com/AssemblyAI/assembl...
Get your Free AssemblyAI API key: www.assemblyai.com/?...
▬▬▬▬▬▬▬▬▬▬▬▬ CONNECT ▬▬▬▬▬▬▬▬▬▬▬▬
🖥️ Website: www.assemblyai.com/?...
🐦 Twitter: / assemblyai
🦾 Discord: / discord
▶️ Subscribe: ru-vid.com?...
🔥 We're hiring! Check our open roles: www.assemblyai.com/careers
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
#MachineLearning #DeepLearning

Опубликовано:

 

8 окт 2023

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 41   
@kunalsoni7681
@kunalsoni7681 9 месяцев назад
I'll apply this into my project :) for sure
@user-rr2hz3ti6l
@user-rr2hz3ti6l Месяц назад
Definatly going to add this project in my Github profile, Thank you so much❣
@generativeresearch
@generativeresearch 9 месяцев назад
This is great!
@jetakota
@jetakota 4 месяца назад
thanks sister...really works. solved a big problem of mine for a long time. thanks thanks thanks thanks thanks thanks X 1000000
@peacefulsystem
@peacefulsystem 2 месяца назад
Thanks for the informative video.
@mehdismaeili3743
@mehdismaeili3743 9 месяцев назад
Hello, thank you for your great videos. I wrote two short functions to add to this program, which allows you to choose the color of the subtitle, even each line of the subtitle can be a different color.
@skateboardpete8236
@skateboardpete8236 6 месяцев назад
Do you have a git repo ? I would like to see
@synamalhan3243
@synamalhan3243 8 месяцев назад
Is there a way to get the time stamps for each word?
@rangabharath4253
@rangabharath4253 9 месяцев назад
awesome
@amaanmajeed4068
@amaanmajeed4068 9 месяцев назад
Great, love what your company is doing. What are the pring plans. I would love to hop on to this tool.
@alex-stalker
@alex-stalker 2 месяца назад
Great!
@biddutkobir8665
@biddutkobir8665 2 месяца назад
Thank you for this now I can use it to dub my web series.
@teachkhmerbinary
@teachkhmerbinary 8 месяцев назад
i like this
@GeorgeZoto
@GeorgeZoto 9 месяцев назад
Cool feature, I am curious for RU-vid creators like us how does this compare? What does it offer in addition?
@AssemblyAI
@AssemblyAI 9 месяцев назад
Do you mean compared to the automatic YT subtitles?
@renatobrakarz3499
@renatobrakarz3499 9 месяцев назад
First, in 41 seg, from Brazil!
@renatobrakarz3499
@renatobrakarz3499 9 месяцев назад
Primeiro, em 41 segundos, do Brasil.
@davidoludepo
@davidoludepo 9 месяцев назад
Thank you. What do you use for your video recording (this video I mean), Loom?
@AssemblyAI
@AssemblyAI 9 месяцев назад
I use Screenflow. :)
@davidoludepo
@davidoludepo 9 месяцев назад
@@AssemblyAI thank you so much
@nicky_rads
@nicky_rads 9 месяцев назад
Nice! How does this work on the back end ?
@AssemblyAI
@AssemblyAI 9 месяцев назад
We have a team of engineers and AI-researchers working on making our transcription models better every day. :) More on this here: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-r-CEc_ZYV9E.html
@kunalsoni7681
@kunalsoni7681 9 месяцев назад
AssemblyAI's API is such a masterpiece (❁´◡`❁)💯🤩
@NasserAlshareefKSA
@NasserAlshareefKSA 21 день назад
its work
@alonhen667
@alonhen667 6 месяцев назад
Does it work on other languages as well?
@killianrak
@killianrak 3 месяца назад
i tried in french, doesnt works
@mehdismaeili3743
@mehdismaeili3743 9 месяцев назад
Excellent. can I get txt file that contian time and text like youtube transcript ?
@AssemblyAI
@AssemblyAI 9 месяцев назад
Yes, that is also easily possible. See the docs here: www.assemblyai.com/docs/getting-started/transcribe-an-audio-file To recap quickly, run "transcript.text" on the returned transcript and you will have a full text version of the transcription (without the timing info.)
@bandhakavidattamoudglya4771
@bandhakavidattamoudglya4771 8 месяцев назад
Hey assembly a.i your development in the a.i technology is awesome I just wanted to be a part of you , let me know poc , we will connect for a while
@InfinitySamurai1111
@InfinitySamurai1111 6 месяцев назад
@Davincy-cd8uf
@Davincy-cd8uf Месяц назад
Does it work for local languages like telugu (india)
@Intellectualmind4
@Intellectualmind4 9 месяцев назад
🎉🎉🎉🎉
@gidmanone
@gidmanone 8 месяцев назад
how much does it cost?
@mandeepsng
@mandeepsng 7 месяцев назад
how to generate text into specific language like I want in Hindi language ??
@conanssam
@conanssam 9 месяцев назад
Is for free? which is prefer whisper than this?
@mshonle
@mshonle 9 месяцев назад
The key feature here is matching the timing of a transcript to the audio track. Getting transcripts to match what-was-spoken-when is an otherwise tedious process.
@joeywang2024
@joeywang2024 4 месяца назад
hi, can translate korea language?
@aladinmovies
@aladinmovies 9 месяцев назад
Beautiful girl in programming. Nice video
@dyablohunter
@dyablohunter 8 месяцев назад
100 beers to the one that solves this: Compare a new 10 second selfie video against a private encrypted database of similar videos and determine some results/outputs from video and audio. Elements: 1. The video selfie in portrait mode - person has to be decently placed within the video frame and decently illuminated. 2. The audio from reading of a random short phrase (must be readable 3-5 seconds) in the native language of the person (language can be selected from user input). Phrase can be randomly generated by AI, must never be the same phrase and the phrase prompted must match what the person reads, so audio must be analyzed. If the subject head is framed properly and illuminated properly, recording will start automatically. Within the 10 second recording, the person making the selfie will be prompted to read the random phrase out loud (in native language). The sound must be analyzed in real time so that the phrase read by the human is converted from speech to text and the output must match the sentence prompted by 90% accuracy or more, or he/she has to start all over again. RESULT: The result of each new comparison initiated when a new selfie video taken is compared against this database has to be an answer to these 2 simple cascading questions. 1. Is the subject in the video a human being? true or false - accuracy must be over 90% - cannot be fooled by manikins or by very obvious recordings played on another screen ^ this is required before saving the video to the encrypted database. 2. Is the subject a different human compared to the subjects from all other videos by analyzing both video image for face ID and sound for vocal timbre? if it's not different, must output all matches by @username value. ^ this is also required before saving the video to the encrypted database. I am open for suggestions to increase accuracy and prevent this system from being fooled/hacked. Also, let's make it open source. I can help with the front-end and hosting.
@mhadnanali
@mhadnanali 5 месяцев назад
Tried: Working like a champ (My review: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-RguZjqOSEWo.html)
Далее
КАК ДУМАЕТЕ КТО ВЫЙГРАЕТ😂
00:29
Run LLMs locally - 5 Must-Know Frameworks!
4:31
Просмотров 16 тыс.
k nearest neighbor in Python
29:10
Просмотров 1,5 тыс.
How to use LangChain for RAG over audio files
10:54
Просмотров 4,2 тыс.
Creating a Speech to Text Program with Python
8:38
Просмотров 51 тыс.