Тёмный

How to Clone Any Voice With AI | Tortoise-TTS Tutorial 

Prompt Engineering
Подписаться 163 тыс.
Просмотров 123 тыс.
50% 1

If you've ever wondered how to clone any voice with AI, look no further than Tortoise-TTS Tutorial. In this step-by-step tutorial, you'll learn the secrets to unleashing your inner voice actor and creating high-quality voiceovers using AI. Whether you're an aspiring voice actor or just want to impress your friends, this tutorial will teach you everything you need to know to get started. Join us as we explore the world of AI voice cloning and take your creativity to the next level. You can create this tool to create audio tools like Eleven labs for audio.
Link to the Notebook: colab.research.google.com/dri...
Link to Audacity: www.audacityteam.org/
☕ Buy me a Coffee: ko-fi.com/promptengineering
In this RU-vid video, we will explore the technology behind deepfake speech, which involves generating speech from text using a text-to-speech model. This process typically involves three main components: a voice encoder, a synthesizer, and a vocoder. The voice encoder learns to create a fixed-dimensional embedding, or vector, that captures various features of a specific human voice. The synthesizer then uses this information to create a mel-spectrogram from a given text transcript, which is further processed by the vocoder to generate an audio waveform. Additionally, we will provide you with a list of relevant keywords related to this topic.
#elevenlabs #voicecloning #TortoiseTTS #AIvoicecloning #voiceover #voiceacting #voiceactor #voiceimitation #voiceimpersonation #voicechanger #aitechnology

Наука

Опубликовано:

 

11 июл 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 213   
@engineerprompt
@engineerprompt Год назад
If you liked the video, you should check out the video on how to create your own AI Avatars here: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-V2efVSXSlqc.html
@pacovazquez665
@pacovazquez665 11 месяцев назад
where can i find help to install it on my anaconda pormpt window, its displaying errors
@ROM_2OO3
@ROM_2OO3 Год назад
It is incredible. I saw some comments where they say that the accent is totally lost, but I tried it and the accent is the same!!!! I thank you very much for this, it is what I was looking for a long time. Its just perfect ❤
@engineerprompt
@engineerprompt Год назад
Thank you and glad you found it useful.
@SexyRolex
@SexyRolex Год назад
Thank you, i like how you are explaining everything so that even a person that doesnt know how to code can understand it.
@engineerprompt
@engineerprompt Год назад
That's the goal :)
@saifshaikh8828
@saifshaikh8828 Год назад
Hey brother, is it safe to install in my pc ? As my windows showing harmful and dangerous risk?
@curtbert2121
@curtbert2121 Год назад
​@saifshaikh8828 That's just because it's a .exe that you downloaded from the internet. Pretty much any file like that will get flagged but this one should be safe.
@postwhateverwhenever
@postwhateverwhenever Год назад
Thank youuu~~ i'm gonna use this for my favorite video game characters 😎
@MrTalhakamran2006
@MrTalhakamran2006 Год назад
Always love your videos...no nonsense ... straight to the topic
@engineerprompt
@engineerprompt Год назад
Thank you!
@starcitycreations
@starcitycreations 10 месяцев назад
This tutorial is awesome! Thank you SO much!
@basictutorial88
@basictutorial88 Год назад
I love it bro, Thank's for sharing 👍
@ozzy1987mr
@ozzy1987mr Год назад
muchas gracias estaaba buscando algo asi... excelente material y contenido del canal.. apesar que no hablo ingles muy bien y el traductor es malo sus videos se entienden y son muy claros
@engineerprompt
@engineerprompt Год назад
Thank you, I am glad you found it helpful. Consider subscribing to the channel, have something big planned for Spanish audience in near future 😀
@user-op3yd5gi9p
@user-op3yd5gi9p 7 месяцев назад
Good job! It sounds like you, I know we hear own voices differently from audio recordings We hear ourselves thru our own bodies/bones
@prakash.pathak
@prakash.pathak Год назад
I have seen this problem with many RU-vidrs who say AI clones of their voice is not matching with their original voice. The output created above is "exactly" sounding like you. But you can't realize that because we hear our voice in a different way than how others hear it!
@ChristianIce
@ChristianIce Год назад
The timbre is ok, the inflections and the accents got totally lost. Last year, this technique would have been incredibly good. Today there are much better options. AI is evolving at the speed of light.
@MyTobirama
@MyTobirama Год назад
@@ChristianIce Can you link some of them?
@engineerprompt
@engineerprompt Год назад
Interesting point, that could actually be the case.
@engineerprompt
@engineerprompt Год назад
@Mutual induction Its absolutely free. Watch the video for number of audios :)
@ProVideoScribe
@ProVideoScribe Год назад
@@engineerprompt what if we want to render long paragraph? should we cut it into sentences and render it one by one, or is there any way to render all of it at once?
@senate_shakya_
@senate_shakya_ Год назад
You sir have my respect!
@engineerprompt
@engineerprompt Год назад
Thank you!
@Nethrex
@Nethrex 10 месяцев назад
I haven't yet experimented but this, but great video!! Do you think it's possible to run locally and use it for a personal/local assistant on a PC? Also is there a way to get it running and working even without internet (so completely local)?
@JukeBoxDestroyer
@JukeBoxDestroyer Год назад
the voice sound just like your voice, minus the accent, sounds very good
@TylerThomas
@TylerThomas 3 месяца назад
Def gonna try this iut
@wnrandom98
@wnrandom98 Год назад
great tutorial thank you
@engineerprompt
@engineerprompt Год назад
Thank you.
@xs6819
@xs6819 Год назад
Thanks for sharing this. Is there anyway to make it read 1,000 words at a time?
@kevinehsani3358
@kevinehsani3358 Год назад
thanks for the video. Have you tried bark? Looking for voice cloning model that I can train longer locally for better results. Thanks again
@georgezlei
@georgezlei 7 месяцев назад
LOL. The generated voice really sounds like you. I thought it was yourself talking then I noticed you already clicked the button.
@MyOtherworldlyLove
@MyOtherworldlyLove 11 месяцев назад
Could you please make a video with fully detailed instructions on how to install Tortoise and get it working? Like, instructions for total beginners? 😅You skipped the part of the process between finding it on Github and adding a new voice, and that's the part that's the biggest mystery for me. I'd love to use voice cloning, but I've never used Python and basically all I know about it is the fact that it exists. So detailed step-by-step instructions for those of us who know nothing about coding would be very appreciated! 😅
@cybergigafactory
@cybergigafactory Год назад
Great video, thanks. Is there a limit in how much it can generate at once?
@engineerprompt
@engineerprompt Год назад
If you use it locally, then I think it will be limited by your RAM.
@planetgamecommunity817
@planetgamecommunity817 Год назад
no ...its limited by the price of GPU spexs..hheheh
@weluvtech
@weluvtech Год назад
Awesome tutorial thanks. I was looking for a good Google Colab of Tortoise-TTS. By the way, I found your generated samples sounded just like you. It was hard to pick when you were playing them.
@WhyHelloReader2Me2You-wc2br
@WhyHelloReader2Me2You-wc2br 5 месяцев назад
Can you download that model to run it locally on your machine? Is the resulting file a .pth?
@DamienRourke
@DamienRourke Год назад
Great walk-through, thanks! BTW, the HQ sample did sound like you. It lost a bit of your accent, but overall it did sound like you.
@engineerprompt
@engineerprompt Год назад
Some others have pointed out the same. I guess, I am not used to hearing myself like that :)
@saifshaikh8828
@saifshaikh8828 Год назад
Hey brother, is it safe to install in my pc ? As my windows showing harmful and dangerous risk?
@MrDaydreamer1584
@MrDaydreamer1584 Год назад
"It lost a bit of your accent, but overall it did sound like you." It lost all of the accent, not just 'a bit'.
@harsh2624
@harsh2624 Год назад
this is so COOOOL
@engineerprompt
@engineerprompt Год назад
Thanks :)
@saifshaikh8828
@saifshaikh8828 Год назад
Hey brother, is it safe to install in my pc ? As my windows showing harmful and dangerous risk?
@usagiracha
@usagiracha Год назад
"ModuleNotFoundError: No module named 'einops'" any idea how to solve this?
@ulisesjorge
@ulisesjorge Год назад
Thanks, this is extremely useful; I suppose that one can feed this algorithm a file so that it can read it and output a recording?
@engineerprompt
@engineerprompt Год назад
Yes, there is a text variable in the notebook. Assign the text to it and it will do the rest.
@akashshesh
@akashshesh Год назад
@@engineerprompt Can you elaborate on this? I am trying to have may sentences read, but it says its too long
@ajitkumar15
@ajitkumar15 Год назад
Can we use this for other languages too or is it limited to English Language, thanks in advance.
@adrianaagresta
@adrianaagresta Год назад
Same question I was about to ask
@Nabuuug
@Nabuuug Год назад
It does sound EXACTLY like you, it's crazy. I guess it's the "hearing our own voice is weird" phenomenon that is at play here
@engineerprompt
@engineerprompt Год назад
that seems to be the case.
@Spozinbro
@Spozinbro Год назад
Not exactly, the replicated voice still has some missing accent.
@robbieweld7928
@robbieweld7928 Год назад
@@engineerprompt I disagree it sounds as if it gave you an american accent
@tiagolourenco7158
@tiagolourenco7158 Год назад
Hi, when I run the second code block I don't have the option to upload my files, and it shows "fileexistserror", I believe is something basic but I don't know what to do. Thank you
@RichardBonn
@RichardBonn Год назад
cool!
@saifshaikh8828
@saifshaikh8828 Год назад
Hey brother, is it safe to install in my pc ? As my windows showing harmful and dangerous risk?
@st.magnic8592
@st.magnic8592 Год назад
can the segments be less or more than 10 seconds? or does it have to be exactly 10?
@engineerprompt
@engineerprompt Год назад
No, its can be much longer, I have tested it on upto 30seconds. Its based on the hardware you are using.
@imashark241
@imashark241 Год назад
niceeeeeee now do not have to use effort on reading using my woice but just using woice clone
@didiervandendaele4036
@didiervandendaele4036 Год назад
Great found feature ! But is it possible to use this clone voice to speak in another language with the same accent ? 😮
@engineerprompt
@engineerprompt Год назад
Check out the latest video on thr topic
@Seii-FPV
@Seii-FPV 2 месяца назад
I don't have GPU option and have pretty powerful nVidia card installed. I only get GPU T4 and for some reason it won't accept that as an option. Has anything changed in how this works now? Does it need paid subscription?
@csomi35
@csomi35 Год назад
Is it possible to add emotions to the generated audio? I mean after successfully cloning some voice I would like to fill up it with some emotions (exclamation, fear, sad, fading etc... like a voice actor).
@mattlegge8538
@mattlegge8538 Год назад
I don't know if that's possible with any software yet. Maybe bark?
@BigDaz
@BigDaz 11 месяцев назад
The documentation says ---> you can evoke emotion by including things like "I am really sad," before your text. I've built an automated redaction system that you can use to take advantage of this. It works by attempting to redact any text in the prompt surrounded by brackets. For example, the prompt "[I am really sad,] Please feed me." will only speak the words "Please feed me" (with a sad tonality).
@AMDSTT
@AMDSTT Год назад
Thank you i will try then tell you
@saifshaikh8828
@saifshaikh8828 Год назад
Hey brother, is it safe to install in my pc ? As my windows showing harmful and dangerous risk?
@AMDSTT
@AMDSTT Год назад
@@saifshaikh8828 I didn't test it
@WaseemAhmad-mf3wh
@WaseemAhmad-mf3wh Год назад
Nyc video
@jorgemarz
@jorgemarz Год назад
thank you! Do you think this could work if i use other language? or you can upload other language models or somethig?
@engineerprompt
@engineerprompt Год назад
This specific one only supports english but check out github.com/coqui-ai/TTS for multilanguage support. Hope this helps
@jorgemarz
@jorgemarz Год назад
@@engineerprompt Thanks!!!
@InquisitorGeneral
@InquisitorGeneral Год назад
Is it still necessary to chop audio up into 10 second segments at 22khz sample rate? I have many audio samples from 10 minutes to 45 minutes all at 48khz. Would these not work at all or would they cause some problem?
@engineerprompt
@engineerprompt Год назад
They will work, but you will need good hardware to run it though!
@DailyStoicisme
@DailyStoicisme Год назад
Hey why do I always get a message "maximum stack size exceeded?"
@naze8793
@naze8793 11 месяцев назад
hello. i tried mine but it doesn’t play my text but the default text that comes in the colab. please any fix?
@Touristt
@Touristt 11 месяцев назад
Do i have to reupload audio everyime i use it?
@MACKINGPIN
@MACKINGPIN Год назад
Thanks for the video I found it really helpful. I played with it and created a synthetic voice using a Spanish speaking person as the voice model but my results were not as good as yours... is it me or the model works best with english language mainly?
@engineerprompt
@engineerprompt Год назад
This specific model is tailored towards English language.
@MACKINGPIN
@MACKINGPIN Год назад
@@engineerprompt small question. Can you tune fine the model with the files you give it in different runs?
@cybergigafactory
@cybergigafactory Год назад
Is there a way to use it on a iPad Pro?
@boulimermoz9111
@boulimermoz9111 Год назад
Great Great Ai thank you very much, do you know if it works for other languages ? french ?
@engineerprompt
@engineerprompt Год назад
I think currently it works for English only but if you can collect data, you can retrain the model on other languages.
@zyrazuric1499
@zyrazuric1499 Год назад
Can it run in a low end pc?
@gisonnisylvio818
@gisonnisylvio818 11 месяцев назад
Does it work only in english ? If I want to make it better in a certain language, do i need to only add more and more samples ?
@engineerprompt
@engineerprompt 11 месяцев назад
This one is limited to English
@muoity4418
@muoity4418 Год назад
Does this model work well with languages other than English such as: Japanese , Chinese , Vienamese
@engineerprompt
@engineerprompt Год назад
This model can only generate English but you can retrain the whole model for any other language. Here is the link with steps: github.com/neonbjb/tortoise-tts/issues/5#issuecomment-1112705908
@UnderratedKitchen
@UnderratedKitchen Год назад
does this work with other languages ?
@NewImperivm
@NewImperivm Год назад
no
@vyqh
@vyqh Год назад
Great product showcase, can I use this to generate a 1000 word text to speech in 1 go?
@engineerprompt
@engineerprompt Год назад
With local installation, probably yes.
@grim789
@grim789 Год назад
​@Prompt Engineering Do you have a video on installing this locally? I'm struggling to get it setup.
@ss-np9gx
@ss-np9gx 10 месяцев назад
how do i fix this unterminated string literal (detected at line 4) when i write my text
@flowsolo
@flowsolo Год назад
Mine made a crazy demon noise in the middle of a sentence.... SWEET. XD
@engineerprompt
@engineerprompt Год назад
haha, it can be unpredictable some time. Hope you had fun with it.
@saifshaikh8828
@saifshaikh8828 Год назад
Hey brother, is it safe to install in my pc ? As my windows showing harmful and dangerous risk?
@NowOrNeverAI
@NowOrNeverAI Год назад
Is there a limit on how many words can be spoken?
@r_pydatascience
@r_pydatascience Год назад
I wanted to try this. Allas, it is taking months to upload the audio files. Then I upgraded my colab to use a pro version. No success.
@hoovy1163
@hoovy1163 Год назад
here's a few issues that i have: it doesn't have the same exact voice, it's lower pitched and sounds more older? it also has british accent for some odd reason. and when i try to form long sentences it starts babbling and making weird robot and inhuman noises lol
@shledoncooper
@shledoncooper Год назад
Can we use different languages?
@jayr7741
@jayr7741 Год назад
How long passage it will take to cloned the voice? I wanna create the big passage in my voice, is it possible with it? Passage of At least 5000 words
@engineerprompt
@engineerprompt Год назад
You will probably have to divide the passage into different parts and then feed that into the model. That's the best way to do it.
@ryusuikagaku
@ryusuikagaku Год назад
Can it clone other than english voice?
@engineerprompt
@engineerprompt Год назад
Not with this version. There are other packages that can do it.
@shailendrarathore445
@shailendrarathore445 Год назад
Make a video for specific person voice cloning for hindi language using google colab..
@edwardecl
@edwardecl Год назад
7:33 - Best bit
@user-xk2zs6vy9t
@user-xk2zs6vy9t Год назад
Are other languages supported?
@elizalapteva
@elizalapteva Год назад
Hey guys. I’m just wondering - if you were able to download your voice over there and sign any note like Whitney Houston - would you use it? I mean it’s it cool to record a love song to somebody but with your own song?
@planetgamecommunity817
@planetgamecommunity817 Год назад
easy
@Mox53
@Mox53 5 месяцев назад
is it possible to make this text to speech work in another language?
@engineerprompt
@engineerprompt 5 месяцев назад
Not with Tortoise, I think coqui supports that.
@pacovazquez665
@pacovazquez665 11 месяцев назад
im having trouble installing the tortoise, can anyone here point me to a place where i can find help
@Araujo_gabbriel
@Araujo_gabbriel Год назад
Oi. É possível fazer a clonagem em português nessa área que você mostrou no tutorial? Ou teria que pegar a área de um brasileiro para eu conseguir fazer isso?
@engineerprompt
@engineerprompt Год назад
This specific model works only with English. There are some other models that I can explore.
@SokratesStudios
@SokratesStudios Год назад
@@engineerprompt That would be fantastic. I just wonder, is there any voice model generator that works with tones and regardless to the language that's spoken??
@gaetanomegna4436
@gaetanomegna4436 Год назад
NameError: name 'load_voice' is not defined How can I fix it?
@tendaimurevanhema1166
@tendaimurevanhema1166 Год назад
I’m getting “Maximum call stack size exceeded.” On the second last cell
@lkbanztheman
@lkbanztheman Год назад
bro it doesn work cus my ram is too high and it doesnt allow me to do it again after my first try on all my google accounts :(
@nanigh2913
@nanigh2913 Год назад
It's support any language like kannada, or only English?
@engineerprompt
@engineerprompt Год назад
This specific one, only supports English.
@redlinrangerstudio5331
@redlinrangerstudio5331 11 месяцев назад
i keep getting load_voice' is not defined
@NeonDarius7843
@NeonDarius7843 Год назад
Great tutorial! I'm happy! Tho, I tried to generate a voice but I got the results where a female sounds British. Is there anyway to change the accent?
@azab14
@azab14 Год назад
Is it work with other languages such as Arabic
@RamonValdez2014
@RamonValdez2014 Год назад
I get this error after running the "generate speech" cell: "NameError: name 'text' is not defined". Anyone???
@ecstasycheese7390
@ecstasycheese7390 Год назад
Play the "# This is the text that will be spoken." box first before going down to the "# Generate speech with the custotm voice." box
@RamonValdez2014
@RamonValdez2014 Год назад
@@ecstasycheese7390 Spot on thanks a lot! I tried to install Bark on my PC last weekend but I got stuck in some dependency that just won't work. Gotta stick to Collab for the time being!
@Paulinhox88
@Paulinhox88 Год назад
I get this when i try to upload my audio files "MessageError: RangeError: Maximum call stack size exceeded." Any ideas how to solve this?
@engineerprompt
@engineerprompt Год назад
I haven't faced but make sure you have enough space on your google drive and have stable internet connection.
@saifshaikh8828
@saifshaikh8828 Год назад
Hey brother, is it safe to install in my pc ? As my windows showing harmful and dangerous risk?
@shanesteven4578
@shanesteven4578 Год назад
Worked great until I used up the days GPU allocation!. Nice work, thanks for the effort and video.
@engineerprompt
@engineerprompt Год назад
Glad it was useful :)
@karonwhitehead2383
@karonwhitehead2383 Год назад
Can you do this on phone
@mrGapMan1
@mrGapMan1 Год назад
The clone is spot on, but a bit cleaner english accent.
@saifshaikh8828
@saifshaikh8828 Год назад
Hey brother, is it safe to install in my pc ? As my windows showing harmful and dangerous risk?
@manoelvitor-dev
@manoelvitor-dev 4 месяца назад
How make translate from portuguese in this?
@UnderratedKitchen
@UnderratedKitchen Год назад
bro i am getting error
@user-by3fn8fo1f
@user-by3fn8fo1f Год назад
Bro I find a problem to # Imports used through the rest of the notebook.
@saifshaikh8828
@saifshaikh8828 Год назад
Hey brother, is it safe to install in my pc ? As my windows showing harmful and dangerous risk?
@chesper_miguel
@chesper_miguel 5 месяцев назад
There was a Brazilian who translated this video with the tool, LOL
@engineerprompt
@engineerprompt 5 месяцев назад
😀
@asterinycht5438
@asterinycht5438 Год назад
How to train on other language ?
@oussamael7304
@oussamael7304 Год назад
can i clone another language and make someone speak it
@amanisebele4073
@amanisebele4073 Год назад
i keep getting an error every time I try run it
@jamrhxh
@jamrhxh Год назад
How can improve spanish?
@desstuctorr5263
@desstuctorr5263 Год назад
Hi guys , do someone know if an AI like 11labs or anything else exist but in French ? Im french and im really looking for that but it seems impossible to find
@engineerprompt
@engineerprompt Год назад
check the next video :)
@Aaliyashi
@Aaliyashi Год назад
@@desstuctorr5263 11Labs do have a model now that supports a few other languages than English. French is one of them, so it should be pretty straight forward :) Just switch the model from "Eleven Monolingual v1" to "Eleven Multilingual v1" when generating your voice lines.
@desstuctorr5263
@desstuctorr5263 Год назад
@@Aaliyashi Yeah I already tried it. The voice cloning is ok but it make a canadian accent that is pretty annoying
@Aaliyashi
@Aaliyashi Год назад
@@desstuctorr5263 Oh I see, that's a shame. At least it seems like it's something they're working on.
@desstuctorr5263
@desstuctorr5263 Год назад
@@Aaliyashi Yep . And its only an experimental version after all !
@astralislux305
@astralislux305 Год назад
Sounds exactly like you.
@engineerprompt
@engineerprompt Год назад
Thanks, others have pointed out the same. Seems like I don't recognize my own voice 😉
@tharakamalli4366
@tharakamalli4366 Год назад
Use text to speech Can videos be monetized?
@engineerprompt
@engineerprompt Год назад
why not? watch this video to learn about Google's policy: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-VjphDyQhlW8.html
@privatesoft5006
@privatesoft5006 Год назад
It Support Arabic Voices?
@spectrecular9721
@spectrecular9721 Год назад
It seems the AI struggles with non-American accents
@fkxfkx
@fkxfkx Год назад
So do Americans.
@saifshaikh8828
@saifshaikh8828 Год назад
Hey brother, is it safe to install in my pc ? As my windows showing harmful and dangerous risk?
@crypticutopia7228
@crypticutopia7228 Год назад
​@@fkxfkxas an Australian who went to the US for 5 weeks I can confirm this is very true🤣
@KeytoChannel
@KeytoChannel Год назад
It's a racist AI
@geese5170
@geese5170 Год назад
It’s mostly American companies and American voices being used as examples mostly cuz it’s US English 90% of the time
@funginimp
@funginimp Год назад
I cannot actually hear the difference between the you and the first sample. Remember that you probably sound slightly different to yourself because you're hearing it through your body.
@user-oo2uz4ru7y
@user-oo2uz4ru7y Год назад
Please what is the python version, I keep scipy installation error
@engineerprompt
@engineerprompt Год назад
Python 3.9.16 (in google colab)
@user-oo2uz4ru7y
@user-oo2uz4ru7y Год назад
@@engineerprompt thank you very much
@AntoniusTertius
@AntoniusTertius Год назад
@@engineerprompt How do I install that version if there's no installer for it???? I can't install the latest Python because Tortoise doesn't work with it, right?
@ValicsLehel
@ValicsLehel Год назад
Can be trained other then EN language?
@engineerprompt
@engineerprompt Год назад
This model can only generate English but you can retrain the whole model for any other language. Here is the link with steps: github.com/neonbjb/tortoise-tts/issues/5#issuecomment-1112705908
@ValicsLehel
@ValicsLehel Год назад
@@engineerprompt That is not easy at all :-)
@engineerprompt
@engineerprompt Год назад
@@ValicsLehel That's true. Check out this repo, seems to have multilingual support, I haven't really looked into closely but probably something worth checking out: github.com/coqui-ai/TTS
@ValicsLehel
@ValicsLehel Год назад
@@engineerprompt I will take a look. I want to find a solution to generate my voice (or actor voices) but not the default one. Elevenlabs is ok, but cannot learn Romaniann for example. Just EN.
@mustafak.farouk1071
@mustafak.farouk1071 Год назад
Why does it have to be 10 second segments?
@engineerprompt
@engineerprompt Год назад
You can provide it longer segments as well but its just about the compute resources
@SyntheticVoices
@SyntheticVoices Год назад
Tortoise-tts also has a fine-tuning via a fork
@engineerprompt
@engineerprompt Год назад
Would love to have a look at it, any resources you recommend?
@SyntheticVoices
@SyntheticVoices Год назад
@@engineerprompt I have put a link in the description of my lastest vids to MRQs repo
@engineerprompt
@engineerprompt Год назад
@@SyntheticVoices thanks, I will check it out!
@Syn3rgy-DMS-HANZ
@Syn3rgy-DMS-HANZ Год назад
👍😇
@giooooo3522
@giooooo3522 Год назад
NameError: name 'text' is not defined. I followed you in every step. :(
@engineerprompt
@engineerprompt Год назад
Make sure you run the block containing this code: # This is the text that will be spoken. text = "Thanks for reading this article. I hope you learned something." Seems like it didn't run that part.
@GreenHatAnimation
@GreenHatAnimation Год назад
@@engineerprompt was having same problem and this is the solution
@oxanaivanova8007
@oxanaivanova8007 Год назад
google colab now sucks because you have to pay it will only let you generate 1-4 voices then ur done this is so frustating i will just do story reading without it
@TheBenyos
@TheBenyos Год назад
"Copy of copy of copy of copy of copy" ... 😀
@user-yi4pt9rq9s
@user-yi4pt9rq9s Год назад
The only problem is that the software doesn't pick up your accent.
Далее
A Tip on Training Better Voice Models in Tortoise TTS
10:32
🎙ПЕСНИ ВЖИВУЮ от КВАШЕНОЙ🌹
3:09:38
How I used Claude Sonnet 3.5 To Do My Job
1:31
Просмотров 7 тыс.
Graph RAG: Improving RAG with Knowledge Graphs
15:58
Просмотров 24 тыс.
Create Training Data for Finetuning LLMs
22:29
Audacity How to Change Voice
7:07
Просмотров 63 тыс.
02 - EEG - 04 Time Markers
8:41
Просмотров 14