SadTalker: Bring Your Image to Life with Audio | Audio to Animation

Prompt Engineering

Подписаться 170 тыс.

Просмотров 59 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

12 сен 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 108

@texasenglishacademy5302 11 месяцев назад

wow great video thanks for sharing. keep up the good work.

@fxstation1329 6 месяцев назад

Thank you for such a comprehensive tutorial. Would also love to see the improved version update that you mentioned @13:10 And is there any other tool to transfer facial expressions from one talent to other, rather than just giving the audio to produce those facial expressions?

@insadeyt Год назад

2 years ago i needed this for my video and i found it ... yeah

@vaddempudilahari9849 11 месяцев назад

Did you tried the code???😢

@Niggalodeon323 11 месяцев назад

@@vaddempudilahari9849i have 🙃

@jcbbb 7 месяцев назад

wooow thanks for the nfo

@jdspetersgaming 5 дней назад

@@jcbbb 🤣

@akashbhoi6155 Год назад

Thanks man love your channel never get disappointed ❤❤❤❤❤❤

@engineerprompt Год назад

Thank you for the kind words :)

@tanyasubaBg Год назад

Just love your videos. Always very useful, interesting, and informative

@engineerprompt Год назад

Thank you!

@mekkicharfi5454 Год назад

many thanks for this work. Wonderful discovery for me. thanks again

@lookingaround1586 6 месяцев назад

Can this be used to stream real time responses with a chatbot?

@UFXTrendScalper 7 месяцев назад

Great Video. Do you have a video on running it in your local machine? Also do you have a video on using an API liek you mentioned @ 9:53 in your video?

@jcbbb 7 месяцев назад

Lol don't listen too well eh

@zippythinginvention Год назад

Interesting! Thanks. I had tried the Collab several times and failed. I wonder if that's because i didn't copy it to my Google drive? It wouldn't notice the images nor audio I added.

@max477 Год назад

can we use this generated video on social media or personal youtube channel...? is this alternative of what you showed yoyo-nb/thin-plate-spline-motion-model

@namkraft 8 месяцев назад

Thanks for the video. Have subbed and liked vid. I have a Q, for Google Colla free plan, what's the maximum amount of audio it can process. For example if I have a 1-hour audio, can I still use the free plan?

@sukhpalsukh3511 Год назад

Please make video more projects like this, or any upcoming project। And project showed at last in video

@engineerprompt Год назад

Thank you, will be doing more on this!

@greatjensen Год назад

Wonderfull video and content, thank you. I would be very interested in further detail regarding improving the lip-sync.

@engineerprompt Год назад

Thank you, will make a follow-up.

@carfixolx4625 5 месяцев назад

@@engineerprompt Hey, have you made the follow- up? I can't find it on your channel

@nevahgiveup3574 Год назад

How does this work with a wider or taller image with a face, e.g. a talking character in a scene? Is it as good as D-ID to still detect the face?

@engineerprompt Год назад

For that, you will need to select 'full' as a pre-processor. It works surprisingly well even in that case.

@brendankeogh4669 Год назад

Is there a limit to the length of the animation? Could you upload a 3min audio?

@zerthura8500 Год назад

The licence says opensource and free for commercial use too. Why we should read all the content of the code searching for a disclaimer that say opposite? In my opinion when I use a code is my responsibility only to read the licence and don't violate it. It is not my responsibility to read all the content of the code. Why do you think that this disclaimer putted somewhere into the code package has some kind of legal value?

@ElChapoDel8 Год назад

i tried on huggingface but it says error when generating

@microsoftplus9366 Год назад

Is there a way to use Pose style, Batch sizes or 512 model in Colab? My face images are on an angle and I get some weird warping.

@mrjoshwwilson 3 месяца назад

Where can I ask questions about sadtalker installation?

@AIdevel 5 месяцев назад

excellent can I incorporate this in a Streamlit python application if yes how to go about doing that ?

@ShibashisBiswas Год назад

Please make a Video on the last part , where we can enhace the output of sadtalker. make a separate video.

@wayneout 7 месяцев назад

I use it in Automatic1111. What do you suggest for the photo pose? Mouth closed or open? Thanks

@QEDAGI Год назад

Please share info about your malicious detection. It's one thing to yell DANGER but a better thing to explain why that's so.

@nikonikolic1365 4 месяца назад

Can the SadTalker software work in real time? That is to say, if the sadTalker were to receive a string of text, could it manipulate the face in real time, or does the text string need to be processed on-line by remote server? Please advise.

@billmelater6820 5 месяцев назад

How long can a video be and how do you download it when it's done?

@ozzy1987mr Год назад

espectacular, justo lo que estuve buscando

@goudou5099 6 месяцев назад

Anyone knowing how much time it takes to generate that kind of video approximately (assuming you have a decent CPU and GPU, like i7 and RTX2070 for example) ? I can't find this anywhere, and as I'm testing several solutions, I'd rather find the information than trying all of them 😅

@slayermm1122 Год назад

thanks for the kind video. may i ask the differnce between running it on google colab and hugging face?

@msalama Год назад

Can we make it making the video faster so it will be suitable for live applications? Is there is any alternative model that can do the same in real time?

@engineerprompt Год назад

I think if those are short video segments, then with good hardware it can be done.

@msalama Год назад

@@engineerprompt i run it on colab for short audio but it took around 38 second to produce the first video(low quality) and took around 4 minutes to make it high quality, please can you refer me to any resource to learn how to make it faster (maybe make them model run on multiple gpus!)

@engineerprompt Год назад

@@msalama I mentioned in the video that its going to be slow if you are running it on FREE collab. In order to run it faster, you will run it high end GPUs locally.

@msalama Год назад

@@engineerprompt thanks

@dominikartner917 10 месяцев назад

@@msalamadid you find a solution? I’m trying to do the same thing

@darkman237 7 месяцев назад

Is there a limit on the audio length?

@dany-tu1qg Год назад

Is there anyway to do that but with a facless/masked character for example deadpool or spider man ?

@MissChelle 8 месяцев назад

I’m not very tech aware and just tried this, however I can’t get the audio file to upload to sad talker. Any idea why?❤️🇦🇺

@kanandave-dh4js 3 месяца назад

is there feature of text to audio still remain ? i am not able to see it, someone guide me.

@roygatz Год назад

How do you move the shoulders, only moving the face is very unnatural, very often I bounce when I see an avatar only taks with face without moving the body

@user-bv3jh8jw9n Год назад

I got error on Animation (colab) /bin/bash: line 1: python3.8: command not found

@ajmalkhan8013 Год назад

Fix here ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-33H6f1mS77U.htmlfeature=shared

@user-ci8gs5fv2z 10 месяцев назад

I created a copy for Drive and the first video I made was successful, about 1 and a half minutes long. However, even repeating all the processes, I was no longer able to generate even a 30-second video. The error that appears to me is when executing the fifth cell, when I want to see the preview and download it. It simply says the following: "IndexError: list index out of range". I don't know what else to do, if you can help me I would appreciate it in advance.

@j_yh Год назад

You used AI processing on your face in the intro yes?

@engineerprompt Год назад

Yes, on a face, not mine :)

@MojarDesh-fr9eg Год назад

sir how to get the full sized image? it's cropping my image

@NewbiesFinanceAndCrypto 7 месяцев назад

I can't find the text to voice

@Reviews4now Год назад

I get error messages in the final steps, when I try to run the code.

@ViperShortz Год назад

I keep getting this error. /bin/bash: line 1: python3.8: command not found when trying to run the animation section. Any ideas how to fix it?

@user-ym1fx2mh4r Год назад

I came across the same error😭😭

@Nicoo333 Год назад

got the same issue as well...

@ajmalkhan8013 Год назад

Fix here ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-33H6f1mS77U.htmlfeature=shared

@ajmalkhan8013 Год назад

@@user-ym1fx2mh4r Fix here ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-33H6f1mS77U.htmlfeature=shared

@ajmalkhan8013 Год назад

@@Nicoo333 Fix here ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-33H6f1mS77U.htmlfeature=shared

@2MINUTESRELAX Год назад

Thank you for sharing this information sir..can you please suggest easier way ..😢

@engineerprompt Год назад

If you are looking for not free option, check out my AI avatar video :)

@2MINUTESRELAX Год назад

@@engineerprompt need free option too sir..suggest me ..

@rajkumar3433 6 месяцев назад

Any voice cloning model

@nibhanraheem Год назад

keeps giving the error "This application is too busy. Keep trying!"

@A5said Год назад

Mrci pro

@hendrikvanbrantegem7526 Год назад

how do you run this locally? or what are the implications of running it in collab?

@jagadeeshbarla5422 Год назад

ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-yEkLEm-10Mw.html

@bryanbrewer4272 5 месяцев назад

Moist yet semi-creamy......

@negociosdelaeradigital8415 Год назад

When I create the avatar video, at the end, I see two icons of a face and a comment at the top. How can I delete that? Thank you.

@engineerprompt Год назад

Run the local version of the code in Auto1111. Or change the code where its reading the icon etc.

@AIvideofakes Год назад

cool BUT the only runtime option a got was python 3, and after I selected the image I wanted to use, the animation script errored out with "/bin/bash: line 1: python3.8: command not found"

@ajmalkhan8013 Год назад

Fix here ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-33H6f1mS77U.htmlfeature=shared

@its_sharan54 6 месяцев назад

guys which is faster?

@ozzy1987mr Год назад

existe algún otro proyecto que permita el uso comercial?

@engineerprompt Год назад

I haven't seen one yet but I think someone will release a version soon.

@EarmWermChannel Год назад

It's an absolute bitch to install in A1111. Fucking nightmare!

@liangzhi4467 Год назад

Thank you for the instruction. Please make a video of upscaling ai video for free.

@aaronzhang1062 9 месяцев назад

if the image is cartoon,how it work? like animal

@engineerprompt 9 месяцев назад

It will still work

@Paperclown 10 месяцев назад

yeah this is dumb how to add voice movement to a video from audio file

@xiaoyxx593 Год назад

How to Run Sadtalker on an AMD GPU

@xiaoyxx593 Год назад

Why is my SADTALKER running on CPU when I have a GPU 6600? Is there any solution?

@mirkofilippelli1872 Год назад

not working

@iamYork_ Год назад

Just saw this tutorial is 2 months old... in AI terms that is 2 years old... Sadly outdated as most AI tutorials become if you blink haha... sure worked great a month ago or so... currently all local running methods are non-functional...

@ajmalkhan8013 Год назад

Nah bro Sad Talker is still leading

@iamYork_ Год назад

@@ajmalkhan8013 I was finally able to get it running locally direct via their gradio... I was hoping to get the automatic1111 extension functional but currently it is inoperable... I'm sure there is a work around but after a few tests using sadtalker i realized it wasn't what I was looking for... I will wait another month to see where it goes hahah... but as a FREE option it is pretty decent...