Flux Completely Destroys Stable Diffusion 3! The New Champion

All Your Tech AI

Подписаться 23 тыс.

Просмотров 168 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

22 окт 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 106

@metanulski 2 месяца назад

Btw. Schnell is german for fast

@AaronPlay 2 месяца назад

If it can really do consistent characters I’ll pay for a subscription

@allyourtechai 2 месяца назад

Pixel Dojo can already do consistent characters. Pose control, image to image, image relighting, LoRA training, it’s all included

@th3_sull1van 2 месяца назад

It's probably the best addition to Pixel Dojo since Creative Upscaler

@synthoelectro 2 месяца назад

these are the mad scientists I remember, and no wonder SD felt like you were doing something interesting, these guys were running the show. I guess after a bit they left and then the other people took over and made SD boring.

@allyourtechai 2 месяца назад

No kidding. Yeah I’m assuming these guys all left when the CEO changed and things went downhill.

@synthoelectro 2 месяца назад

@@allyourtechai I got in on beta wave 2, and it was like we were doing something special and it was fun. Then they started taking away the uniqueness of the experience. Yeah you're right.

@Huang-uj9rt 2 месяца назад

Cool, I think the author explains it so well, according to the author's procedure, I also run Flux online on Mimicpc, and the effect is not much different!

@TPCDAZ 2 месяца назад

Would you be able to create a tutorial on how to get the dev or smaller models running locally on comfy?

@allyourtechai 2 месяца назад

Of course!

@sjorspa 2 месяца назад

Can this model be used on the same way as other SD models work? IE Loras, Controlnet, etc. In other words, is this the same architecture, or is completely different? And thanks for this nice video.

@allyourtechai 2 месяца назад

I believe those will all be possible very soon. The architecture is different enough that you can’t just drop this in as a replacement, but I think this model is impressive enough that development will happen quickly

@MGmirkin 2 месяца назад

Would like to see an apples to apples comparison between Flux & SD3 on similarly specified prompts, to see whether it in fact "completely destroys" SD3, or whether they're reasonably comparable...

@allyourtechai 2 месяца назад

Happy to do this tomorrow.

@brianjanssens8020 2 месяца назад

Love how people are hyping up Flux so much because its what SD3 was supposed to be lol. I think SDXL and ponydiffusion have developed enough to a point where they are already incredible by themselves, especially because of the community, to the point where i'm not really impressed by new AI image models. I love that it adheres the prompts better and being able to generate text is also awesome, but those are things you could already add in tools like Canva anyway. What i'm excited to see more of, is the workflows for fully generated/animated high quality scenes.

@allyourtechai 2 месяца назад

After using it for the past week or so. This model completely blows everything else away.

@heatherburkey6311 2 месяца назад

Can this be locally installed on MacBook Pro Max M3, 96 GB? What is pricing/usage/credits etc of Pixel Dojo? Looks pretty cool. I’d love to support your product if I can swing it

@allyourtechai 2 месяца назад

You should be able to run the schnell model locally. It’s still very impressive. The pro model is not open source, but that’s what’s running on Pixel Dojo. It’s 25 per month and includes over 20 image tools

@sierradesigns2012 2 месяца назад

Great video, but you didn’t do the most important test hands and limbs 😅. Also want to ask if Flux can do Anime and can people create their own checkpoints or Lora equivalents with it?

@allyourtechai 2 месяца назад

Haha good point. It’s actually quite good at hands and feet which is also impressive. I don’t think you can do a Lora or equivalent just yet, but soon!

@sierradesigns2012 2 месяца назад

@@allyourtechai thank you for your answer, will take a look at your site.

@phreakii Месяц назад

Achnell is what at least 50% will get so that would be a better review. Run it on Mac? And, how about NSFW?

@Tesla119 2 месяца назад

Are you planning on a subtle upscale upscale you have on Dojo right now? Changes too much of the image.

@allyourtechai 2 месяца назад

There is a resemblance slider in the options. Turn that up, and turn creativity down and you get the original image but upscaler with very subtle changes. I’ll make that a preset (subtle upscale)

@Tesla119 2 месяца назад

@@allyourtechai ok thanks

@Thex-W.I.T.C.H.-xMaster 2 месяца назад

I'll stick with SD until this is a little bit more enhanced in a couple of months or so.

@allyourtechai 2 месяца назад

It crushes sd out of the gate

@obscuremusictabs5927 2 месяца назад

I don’t think you understand how good Flux is.

@Hobnockers Месяц назад

What about image to image creation? Bulk images and consistent look? Will this work?

@allyourtechai Месяц назад

I’ll do a video tomorrow showing the new tools

@Hobnockers Месяц назад

@@allyourtechai thx, but you didn’t create that one yet, right?

@artman40 2 месяца назад

But how well can it do the following things: 1. Different art styles e.g. paintings of various eras. 2. Novel concepts (e.g. "Stirrup-shaped bottle) 3. Selective details (e.g. Group photo of 50 people) or incarnate latticework 4. Photos that look like they're taken casually 5. Stylistically bad drawings (4-fingered scribbled hand that looks like it's drawn by a child)

@allyourtechai 2 месяца назад

I haven’t seen it fail at any prompt I have thrown at it so far. Check the gallery at pixel dojo

@aylictal 2 месяца назад

I've been messing with it this evening locally in comfyui with the schnell version. for realism its definitely top notch and if thats your thing then this is really good, especially at various elements in an image. because of your question i went ahead and tried your #2 question as i hadnt tried anything like that, and it blew it out of the water. it gave me //exactly// this: "extreme close up of a beer can on a kitchen countertop, the beer can has an anthropomorphic image of a half penguin half fish smiling holding a can of the same beer he is printed on drinking it. extreme depth of field with focus on the beer can, volumetric lighting, masterpiece, ultra realistic" however in regards to your other questions, i've been experimenting with specific anime styles and it really struggles at hitting them even when i've been super descriptive with references to the specific artist or art style, as well as also trying to improve the prompt by running it through a prompt helper. it also struggles with characters as I'm assuming it has no idea who popular characters are. i tried very common anime characters like rei ayanami which failed to do her hairstyle completely, and also others like spike spiegel and it just completely failed. i didnt try goku but yeah if it failed at that i wouldnt be surprised. outside of anime, it also struggled with other painting styles. i asked it to do a remix of van goghs starry night and got nothing even remotely close to it. so to answer your question, 1 - 5/10, 2 - 9/10, 3 - 8/10 main beef i have with it is how slow it is. a 3070ti isnt enough unless you want to queue up a huge batch overnight. sdxl is about 15-20 times faster at generating images, and thats even with including latent upscaling on those (i haven't tried a latent upscale with this model yet). im doing 1024x1024 images and its roughly 680 seconds wait time for an image while as on sdxl i can spit those out in about 30-45 seconds if the models are loaded in memory before generating.

@allyourtechai 2 месяца назад

@@aylictal I've heard others say similar things about the speed. I'm running it on Nvidia H100's on pixelDojo ($25,000) gpu's, so there is a clear advantage in terms of speed. I'm betting the model will improve over time though.

@aylictal 2 месяца назад

@@allyourtechai jesus yeah those are beefcakes. well done.

@DOCTOR-FLEX 2 месяца назад

I want to build a website just like Pixel dojo. Where do you recommend I start?

@Skylla54 2 месяца назад

UI Ux: Figma Frontend: HTML, CSS, Javascript, Backend: Javascript(NodeJS) Starting point: 1. Draw a red rectancle in the browser, if you click on it it get blue. 2. Draw a red rectancle in the browser, if you click on it it displays a picture 3. Draw a red rectancle in the browser, if you click on it it displays multiple pictures 4. Write a Script that takes a text as a input and throws a picture out (use local ai, f.e. comfy ui) 5. Write a Script that takes multiple text as a input and throws a pictures out (use local ai, f.e. comfy ui) 6. Stich carefully 1-3 + 4-5 together. Thats how I would do it. Maybe it helps^^

@One_Harmony 2 месяца назад

So, AMD does not auto work for Windows? And to make AMD gpu work, it must be in a linux environment? Help us detail a tutorial to use comfyui work specifically with AMD gpu. I have msi armor radeon rx580 8g on win 10. Thanks. Looking forward for your video.

@iskanderprob3081 2 месяца назад

Can Image Dojo generate images in different aspects ratios?

@allyourtechai 2 месяца назад

yes, 1:1, 16:9, 21:9, 2:3, 3:2, 4:5, 5:4, 9:16, 9:21

@GS195 2 месяца назад

Need to be able to tell it what NOT to generate, and run it in A1111. Then it will be a winner for me.

@allyourtechai 2 месяца назад

Most newer diffusion models no longer use negative prompts. You have so much control using the positive prompts that it isn’t necessary.

@GS195 2 месяца назад

@@allyourtechai Say I wanted to generate a female figure skater in a leotard. I put leotard in the prompt but I still mostly get dresses with skirts.

@brentglittle 2 месяца назад

Does your version of Flux Pro have a NSFW Karen Hall Monitor waiting to ruin your day?

@allyourtechai 2 месяца назад

Haha, no

@rus218musiculove 2 месяца назад

A1111 wen?

@giuseppedaizzole7025 2 месяца назад

will be good a review of the result of face enhancer with a low quality pics or filtered pics. thanks

@allyourtechai 2 месяца назад

Sure thing! I can do that

@GooseAlarm 2 месяца назад

I just need loRA training and to be able to easily use two character LoRa together without them fusing. That is all I need.

@allyourtechai 2 месяца назад

No kidding haha

@NicoSeymore 2 месяца назад

Do i need to make new Loras for this or could i use my SD ones?

@allyourtechai 2 месяца назад

I use my SDXL Lora then run it through my flux image to image tool, then finish in the upscaler

@syarasdsalrial1953 2 месяца назад

im one of your subs in pixel dojo,is it truly unlimited generation? and why sometimes i can't press Generate?

@allyourtechai 2 месяца назад

Completely unlimited

@jasonshere 2 месяца назад

He didn't seem to answer but I would assume there's still a queue that may limit your generations.

@allyourtechai 2 месяца назад

@@jasonshere It's unlimited

@jasonshere 2 месяца назад

@@allyourtechai "why sometimes i can't press Generate?" I was referring to this question.

@zoltronborgman1006 2 месяца назад

does it have inpaint though i need that a lot

@Lahouel 2 месяца назад

how much VRAM is needed to run locally in ComfyUI?

@allyourtechai 2 месяца назад

12GB I believe for the smaller schnell model. The pro model can’t be used locally

@Steger13 2 месяца назад

Flux is Too strict. they don't even accept the prompt sexy girl😅

@allyourtechai 2 месяца назад

It does on Pixel Dojo.

@NGW_Studio 2 месяца назад

do we get unlimited generations if we sub for pixel dojo?

@allyourtechai 2 месяца назад

Yes!

@NGW_Studio 2 месяца назад

@@allyourtechai wished i didn't renew MJ recently ahh...anyways just subbed, great work so far, looking forward for future innovations bro, keep it up

@giuseppedaizzole7025 2 месяца назад

maybe i'm missing something, but this is not an open source, am i right? thanks

@allyourtechai 2 месяца назад

It is, at least the smaller model

@FunkyByteAcademy 2 месяца назад

it is. they have 3 models. 2x opensource. 1 x closed. The quality of the opensource ones are amazing!

@abergethirty Месяц назад

It'll totally Destroy Stable Diffusion when it can run locally with a midrange graphics card.

@allyourtechai Месяц назад

Fair! The quantized models are getting there, but they drop in quality pretty dlfast

@DASIANNA 2 месяца назад

how to run this locally???.... can this model work in a stable diffusion ui like "automatic1111/forge/fooocus/comfy" and so on???

@allyourtechai 2 месяца назад

It only works in comfy locally and only the smaller schnell model. It’s still impressive, just lower quality

@fossil98 2 месяца назад

@@allyourtechai The schnell and dev model are the same size. Dev is distilled for quality and prompt adherence, schnell is distilled for 4 step generation. Pro is not local.

@Elwaves2925 2 месяца назад

If you don't want the nodes of ComfyUI, it works in SwarmUI, which is still Comfy but with a UI similar to A1111 and the others. Both the dev and schnell work.

@giuseppedaizzole7025 2 месяца назад

Hi....question, is the monthly Subscription only for one month and after canceled? its not clear...any one knows, thanks

@allyourtechai 2 месяца назад

Subscriptions renew monthly. Cancel any time

@giuseppedaizzole7025 2 месяца назад

@@allyourtechai thanks

@fadilabn9832 2 месяца назад

So can it run locally on 16gb vram gpu?

@Airbender131090 2 месяца назад

@allyourtechai 2 месяца назад

The smallest model, yes. The large Pro model I run on an Nvidia h100 with 80GB of vram for PixelDojo. You can’t run that one locally

@-Burs 2 месяца назад

@@allyourtechai Only the smallest model, or the dev model too? Seems those models are about the same size (while the dev being more capable judging by the graph in your video), so I guess the same Vram requirements too? Thanks.

@fossil98 2 месяца назад

@@-Burs Yes dev (fp8 mode) is the best bet. Also helps to have 32GB+ RAM

@-Burs 2 месяца назад

@@fossil98 Thanks. I have 64GB installed, so FP16 works ok (when nothing is running in the background = more free ram). The question was more about Vram requirement. Meanwhile, I hear people with only 8GB Vram can run it, so good to see that. Dev model should indeed be more capable, but it takes 28s on my 4080 to generate an image (20 passes), while Schnell takes only 8 seconds (4 passes) without obvious image quality loss. So I think I will use that one and FP8 because I need more ram for other stuff I use on my PC.