Тёмный

Python Local Text To Speech Coqui TTS | Generate Audio From Text Using Python 

Hussain Mustafa
Подписаться 10 тыс.
Просмотров 7 тыс.
50% 1

💼 Book a meeting: cutt.ly/Pegxp5rA
In this video we will build a python script that will allow us to generate speech from text locally on our system using the coqui TTS package for python. We will take a look at working with the Coqui TTS package coupled with gradio to create a web interface through which the user can upload there text and generate speech from. The concepts covered will help you understand the fundamentals of working with text to speech systems such as Coqui locally on your system, setting up and configuring a python environment, and using gradio to build a web interface to interact with your Python scripts. This is an excellent guide for beginner Python/ML developers, or anyone looking to learn about text to speech (TTS) systems and build them using Python.
Resources:
Source Code: cutt.ly/Ner6ffaE
Gradio: www.gradio.app...
Coqui TTS: github.com/coq...
Socials:
Website: hussainmustafa...
Github: github.com/hus...
LinkedIn: / hussain-mustafa-960920184
Twitter: / hussain34274892
Buy Me A Coffee: www.buymeacoff...
#python #learnpython #tts #machinelearning #artificialintelligence

Опубликовано:

 

16 сен 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 37   
@moneyman-ne9lw
@moneyman-ne9lw 4 месяца назад
Coqui TTS setup was a breeze thanks to your step-by-step guide. 😊
@m_hussain_mustafa
@m_hussain_mustafa 4 месяца назад
Glad it helped!
@rlt_app
@rlt_app 4 месяца назад
You always manage to make complex topics easy to understand.
@m_hussain_mustafa
@m_hussain_mustafa 4 месяца назад
Thats the goal haha :)
@RaezekenOG
@RaezekenOG Месяц назад
Nice tutorial man! Great job!
@m_hussain_mustafa
@m_hussain_mustafa Месяц назад
Thank you. :)
@RonyHassan47
@RonyHassan47 3 месяца назад
Great one. I will forget about eleven labs
@m_hussain_mustafa
@m_hussain_mustafa 3 месяца назад
Thank you :)
@user-lm2qw9ph9z
@user-lm2qw9ph9z 24 дня назад
Awesome tutorial. I wish I could create multiple audio files from a longer text (from a text file), with each audio file corresponding to a separate paragraph.
@m_hussain_mustafa
@m_hussain_mustafa 24 дня назад
That would be cool!
@MHM-jy4uj
@MHM-jy4uj 4 месяца назад
How does Coqui TTS compare to other TTS libraries you've used?
@mmajr
@mmajr 20 дней назад
Good job! How do you tune the speech speed?
@preneure
@preneure 4 месяца назад
Can you show how to integrate this with a web application? That would be super helpful!
@ridabrahim7604
@ridabrahim7604 3 месяца назад
That shouldn't be a problem, you will do the same thing by sending the text from the front end and process it in the backend and deliver it again(as an audio) to the user, use flask for python to do this
@ridabrahim7604
@ridabrahim7604 3 месяца назад
Great one as usual
@m_hussain_mustafa
@m_hussain_mustafa 3 месяца назад
Thank you 😊
@shubhampadekar2590
@shubhampadekar2590 3 месяца назад
Hi loved the content May I know how to pass speaker index while using multilingual model while using TTS method
@JoeMamaJunk
@JoeMamaJunk Месяц назад
Great video!
@m_hussain_mustafa
@m_hussain_mustafa Месяц назад
Glad you enjoyed it
@MrIMacro
@MrIMacro 3 месяца назад
Amazing
@m_hussain_mustafa
@m_hussain_mustafa 3 месяца назад
Thank you! Cheers!
@mohsenghafari7652
@mohsenghafari7652 3 месяца назад
hi coquiAI library support Persian language ? thanks
@m_hussain_mustafa
@m_hussain_mustafa 2 месяца назад
Hi, I'd recommend checking the documentation.
@StormixDZN
@StormixDZN 3 месяца назад
Does it work on cpu only if I don’t use model training but just tts?
@m_hussain_mustafa
@m_hussain_mustafa 3 месяца назад
Yes it does.
@StormixDZN
@StormixDZN 3 месяца назад
@@m_hussain_mustafa thx bc I have an amd gpu and I can’t use training sadly
@edgarl.mardal8256
@edgarl.mardal8256 3 месяца назад
Very bad voice output, could you show how to train the modell so it actually sounds like a human?
@m_hussain_mustafa
@m_hussain_mustafa 3 месяца назад
Hi, soon I'll be releasing a tutorial featuring another model that will allow to create much more human like audio, in the mean time you can play around with using other models than the one I have shown in the video, training a model will be quite resource intensive.
@edgarl.mardal8256
@edgarl.mardal8256 3 месяца назад
@@m_hussain_mustafa cool, i suggest using appolio,
@Insidestoryland
@Insidestoryland Месяц назад
yes thanks for sharing. i need also taring video of modell.
@sandeeps3108
@sandeeps3108 3 месяца назад
Bro can you make a project for voice cloning
@m_hussain_mustafa
@m_hussain_mustafa 3 месяца назад
Hi, will try to make a tutorial on that.
@DigitalGus75
@DigitalGus75 2 месяца назад
Except is sound like last decades speech synthesis.
@m_hussain_mustafa
@m_hussain_mustafa 2 месяца назад
Yes this is definitely a draw back. However, I'm planning on releasing another video where thr speech synthesis sounds much better.
@DigitalGus75
@DigitalGus75 2 месяца назад
@@m_hussain_mustafa bark is pretty good sounding offline transcription. Not sure it is still supported, but it is still available
Далее
Как мы играем в игры 😂
00:20
Просмотров 156 тыс.
😂😂
00:16
Просмотров 950 тыс.
Essential AI prompts for developers
8:31
Просмотров 65 тыс.
The most important Python script I ever wrote
19:58
Просмотров 195 тыс.
Streaming real-time text to speech with XTTS V2
5:43
Просмотров 4,7 тыс.
World’s Fastest Talking AI: Deepgram + Groq
11:45
Просмотров 48 тыс.
Как мы играем в игры 😂
00:20
Просмотров 156 тыс.