Тёмный

FLUX Fine Tuning with LoRA | Unleash FLUX's Potential 

AINxtGen
Подписаться 544
Просмотров 23 тыс.
50% 1

Опубликовано:

 

27 окт 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 57   
@TheColonelJJ
@TheColonelJJ Месяц назад
Thank you for adding how much VRAM you have!!! That was helpful! I also have 12.
@AINxtGen8
@AINxtGen8 2 месяца назад
Fal.ai fal.ai/models/fal-ai/flux-lora-general-training You can also train LoRa on civitai and replicate.com: civitai.com/models/train replicate.com/ostris/flux-dev-lora-trainer/train If your computer has a powerful GPU, you can train locally, script to traning on local machine: github.com/ostris/ai-toolkit/tree/main
@부정선거4.15
@부정선거4.15 2 месяца назад
@@AINxtGen8 thanks bro
@steve-g3j6b
@steve-g3j6b Месяц назад
what if I want my generations to be 16:9 should I use that size of pics to train? or 1:1 is best?
@AINxtGen8
@AINxtGen8 Месяц назад
@@steve-g3j6b Hello, thank you for your question: In fact, you don't need to crop your images to a specific size because I recently learned that fal.ai also uses ai-toolkit script from Ostris for training LoRA. This script supports a technique called 'bucketing', which is an automatic method that groups images of similar aspect ratios together during training. This means you don't need to manually crop your images to a specific size anymore. Bucketing is a technique that allows the model to train on images of various sizes and aspect ratios efficiently. It works by grouping similar-sized images into 'buckets' and processing them together, which helps maintain image quality and reduces the need for excessive resizing or cropping. This approach is particularly useful when working with datasets that contain images of different dimensions, as it preserves the original aspect ratios while still allowing for efficient batch processing during training.
@steve-g3j6b
@steve-g3j6b Месяц назад
@@AINxtGen8 I would imagine it will make much better backgrounds too (assuming the ai will also learn some of the BG)
@steve-g3j6b
@steve-g3j6b Месяц назад
@@AINxtGen8 would be a cool vid to have a comprehensive look at this workflow.
@steve-g3j6b
@steve-g3j6b Месяц назад
would love a followup video where you learned whats the best way to use those sliders on the fal web.
@Ittiz
@Ittiz Месяц назад
you want better results? hand write the captions for each training image in the same way you like to write your own prompts!
@AINxtGen8
@AINxtGen8 Месяц назад
I agree, writing captions manually will usually yield better results.
@pedrohenriquespl1038
@pedrohenriquespl1038 8 дней назад
Hey buddy, how u doing? This is by far the best video I’ve seen so far a out LoRA training! Tks a lot!! When u say that if u were going to retrain this LoRA you’d need to prepar le better quality data, what do tou mean by that? More pictures? Better pictures? Different settings when training? Tks bro 👊
@ahtoshkaa
@ahtoshkaa 2 месяца назад
Great guide. thank you!
@Reddkomet
@Reddkomet Месяц назад
Can you make a tutorial for creating style Loras?
@AINxtGen8
@AINxtGen8 Месяц назад
Yes, I am planning to make a video about style LoRA training
@ee89199
@ee89199 2 месяца назад
thank you can i use this to train my dog?
@AINxtGen8
@AINxtGen8 2 месяца назад
yes, of course you can
@charagga
@charagga 2 месяца назад
@@AINxtGen8 I think ee89199 is trying to be funny 🤔
@sirishkumar-m5z
@sirishkumar-m5z 2 месяца назад
Machine Learning: SmythOS’s pre-configured support for machine learning frameworks accelerates model development and deployment, streamlining the machine learning lifecycle.
@quangminhnguyen7834
@quangminhnguyen7834 Месяц назад
Can I use the trained lora to generate images on any free website that has flux?
@geekyprogrammer4831
@geekyprogrammer4831 Месяц назад
gpus dont come for free
@fahimabdulaziz4255
@fahimabdulaziz4255 Месяц назад
can I train lora for a consistent streetwear t-shirt design style?
@AINxtGen8
@AINxtGen8 Месяц назад
Certainly, you can train a LoRA for a consistent streetwear t-shirt design style. Training for a specific style is generally more challenging than training for a character, but it's definitely achievable. Here are some tips to help you succeed: Data preparation: Gather a larger dataset of high-quality images (at least 50 good quality images). There's no need to crop these images due to the bucketing technique which is fal also used Training steps: I recommend increasing the number of training steps to at least 2000. This allows the model more time to learn the nuances of the style. Learning rate: Start with a learning rate of 0.0002. You can adjust this later if needed. Checkpoints: Make use of the new feature on fal called 'Experimental Multi Checkpoints Count'. Set this to save 4 checkpoints during the training process. This is crucial because it allows you to test different stages of the model after training and choose the one that produces the best results. Remember, training for a style requires more attention to detail and experimentation. Don't be discouraged if your first attempt isn't perfect - it often takes some fine-tuning to get the desired results.
@fahimabdulaziz4255
@fahimabdulaziz4255 Месяц назад
@@AINxtGen8 thank you soo much, Ma Sha Allah
@mehmetalirende
@mehmetalirende 2 месяца назад
what about combining 2 loras in 1 picture for couples?
@aknownj
@aknownj 2 месяца назад
A whole romantic getaway to any fictional destination of your imagination
@AINxtGen8
@AINxtGen8 2 месяца назад
yes you can, use Lora Stack node in ComfyUI, refer to this workflow link: openart.ai/workflows/macaque_keen_26/flux-with-multi-lora-loader-workflow/DfB4A8yL27WCwgEGi3YA or try running on replicate: replicate.com/lucataco/flux-dev-multi-lora
@ronnydaca
@ronnydaca 2 месяца назад
​@@AINxtGen8 It's possibile with forge?
@AINxtGen8
@AINxtGen8 Месяц назад
@@ronnydaca in forge you can also load multiple lora, and adjust the weights for each lora, but I haven't actually tested the results for lora used for Flux on Forge imgur.com/HYCFTrq
@chrisgg
@chrisgg 2 месяца назад
I think, taking a celebrity creates out of the box good results without training a model?
@AINxtGen8
@AINxtGen8 2 месяца назад
As I mentioned in this part of the video: 00:00:20 I chose Scarlett Johansson for testing purposes. The reason for this choice is that when I used her name as a keyword, Flux generated images that didn't resemble Johansson. This suggests that her name was likely removed from Flux's training data. I selected Scarlett Johansson for this test because she is a well-known celebrity, which makes it easier to compare the results before and after training.
@hellfire3278
@hellfire3278 2 месяца назад
Can I train a LoRA model to control the measurements of a mannequin? The idea is to use trigger words for the waist, chest, and hip measurements, for example: (chest: 94cm; waist: 72cm; hips: 98cm). However, I'm unsure if all of these can be incorporated into a single LoRA model, as it might become complicated. In short, do you know how the trigger words interact with the training dataset?
@AINxtGen8
@AINxtGen8 2 месяца назад
Thank you for your interesting question about controlling mannequin measurements using AI. While training a LoRA model for this purpose is creative, it might be complex and challenging to achieve the desired results. I haven't seen anyone create a LoRA specifically for controlling measurements (possibly due to the difficulty in achieving the desired results). Training such a model to accurately control multiple body measurements simultaneously (chest, waist, hips) would require an extensive and precisely labeled dataset, which could be difficult to create and maintain. Instead, I suggest using ControlNet, a simpler and potentially more effective approach. ControlNet allows for detailed control during image generation using sketches or guide images to control the mannequin's shape and measurements. This method offers several advantages: Precise control: Create a basic sketch with desired measurements. Flexibility: Easily adjust body shape by modifying the input sketch. Consistency: Generate multiple images with the same measurements. Intuitive workflow: Drawing or modifying a sketch is often easier than fine-tuning complex prompts. ControlNet can provide more accurate and consistent results in controlling mannequin measurements compared to the LoRA approach.
@sankyuubigan
@sankyuubigan Месяц назад
How do you think when will appear models without censorship, in which will be at once all the celebrities already trained ? I mean communities where publish these models, of course only for introductory viewing, because nsfw content can not be done because it is very bad from the point of view of morality.
@sebastianpodesta
@sebastianpodesta 2 месяца назад
Hi, if I want to make a Lora to give people baby faces or Asian faces, should I make a Lora with many different Asian or baby faces? What would make a good data set?
@AINxtGen8
@AINxtGen8 2 месяца назад
Hi, as I understand, you want to create a baby cute, kawaii style. If you're just creating a general image in this style, Flux can do it. Try some of the prompts below to see. If you want to create this style for a specific face, you'll need to create a LoRA for that face, then combine it with style keywords like those below. Another method that doesn't require LoRA is using IPAdapter Face, but it only works well on SDXL versions. Currently, FLUX doesn't have a well-functioning IPAdapter, although Xlabs has just released an IPAdapter model for FLUX, it's not very good. Reference prompts: "Asian with baby face, cute chibi style, big eyes" "Kawaii Asian portrait, childlike expression" "Cartoon Asian character, baby face, adorable" "Chibi Asian, oversized head, tiny body, playful smile" "Cute Asian portrait, youthful features, cartoon-like eyes" Images created from prompts: imgur.com/a/SQP9Ln5
@rtberbary0101
@rtberbary0101 2 месяца назад
for some reason, it keeps failing for me. doesn't start the training eventhough i changed nothing. only uplaod my photos and trigger word same as you did. anyone else having this issue?
@AINxtGen8
@AINxtGen8 2 месяца назад
Have you tried clicking the "see log" button in the left hand window after clicking the "start" button? Does the log show anything?
@rtberbary0101
@rtberbary0101 2 месяца назад
@@AINxtGen8 i figured it out! apparently there is a limit on photos. you can add a maximum of 99 images for the training. anything beyond that results in an error
@부정선거4.15
@부정선거4.15 2 месяца назад
Hi thanks. Where could I get the images I need to use?
@AINxtGen8
@AINxtGen8 2 месяца назад
Hi ! Thank you for your question. Depending on what type of LoRA you want to train - whether it's for a character, object, or style - one of the most commonly used image sources is Google (filtered for large images): images.google.com/advanced_image_search Alternatively, you can also use AI image generators to create a dataset for training. One example of this approach is using ComfyUI. You can refer to this workflow: openart.ai/workflows/serval_quirky_69/one-click-dataset/QoOqXTelqSjMwZ0fvxQ9
@frizzfrizz3550
@frizzfrizz3550 Месяц назад
great video, I want to contact you for a chat or a call, how can I do?
@paulfranco9673
@paulfranco9673 2 месяца назад
how did you get it to generate the thumbnail? i'm trying to use Flux to generate multiple views of characters but I'm struggling to do so, if you could give me some guidance pls!
@AINxtGen8
@AINxtGen8 2 месяца назад
The prompt will generally be like below, with the keyword here being "character design sheet". Below is the prompt that I used ChatGPT to create (I input a similar sample image and then asked ChatGPT to generate this prompt): " Character design sheet for Scarlett Johansson as Black Widow in modern 2D animation style. Horizontal layout. Left side: full body front and side views in signature black catsuit with front zipper. Right side: two close-up face views (3/4 and profile) showing detailed features. Add third full body view in dynamic fighting pose. Short wavy red hair, large green eyes with highlights, bold red lips. Exaggerated body proportions for visual appeal. Clean, sharp lines with minimal shading. Flat colors with subtle highlights. Include varied facial expressions: neutral, smiling, serious. Add rear view and close-ups of iconic accessories (e.g. wrist gauntlets, belt). White background with soft shadows. Professional, polished illustration style reminiscent of high-end animated series. "
@omegablast2002
@omegablast2002 Месяц назад
to reply to the title: literally no one said it was hard, its just extremely painfully long.
@charagga
@charagga 2 месяца назад
Thanks for the video! I wonder if one could use it for replacing fashion shoots. I would 1) train on a certain character/person/model (photo realistic ofc) 2) then train a let’s say skirt or fashion piece, maybe a couple of images of the piece 3) then somehow combine it How would you do this, would you also use controlNet for this?
@AINxtGen8
@AINxtGen8 2 месяца назад
Yes, you can, here's a simplified approach: 1. Train a LoRA for the Flux to create your specific character/model. Use ControlNet Pose to control the model's posing accurately. 2. Use ComfyUI's CatVTON node to change dress the AI-generated model in different outfits. This method combines character-specific LoRA models with virtual try-on technology. You can refer to the node below: github.com/chflame163/ComfyUI_CatVTON_Wrapper openart.ai/workflows/HaxcrNaVvjae9pdkut64
@charagga
@charagga 2 месяца назад
@@AINxtGen8thanks a lot!
@debdutbhadurishorts
@debdutbhadurishorts 2 месяца назад
Can I use multiple people lora in same pic ? For example lora of scarlet and Donald Trump , together dancing. And if yes then how
@AINxtGen8
@AINxtGen8 2 месяца назад
Yes, you can train separate LoRAs and then load them together. If you're using ComfyUI, there's a node called 'LoRA Loader Stack' in the rgthree extension (which can be installed via Comfy Manager). You can use that node to load multiple LoRAs, and adjust the strength of each LoRA to achieve good results. imgur.com/a/GldHkqE I understand that Donald Trump was just an example, but if you want to quickly test whether Flux has been trained on a specific keyword, there's a recently launched website called fastflux.ai that can do this. This site uses the Flux Schnell model and generates images at a very high speed. imgur.com/PWOiPMM imgur.com/gubtT0v
@agnosticatheist4093
@agnosticatheist4093 2 месяца назад
You mean lora lora lora lora.....?
@shirleywang9584
@shirleywang9584 Месяц назад
Hi, I'm Tess from Digiarty Software. Interested in a collab?
@zorayanuthar9289
@zorayanuthar9289 2 месяца назад
Great guide but poor choices relating to models... Cameltoe come-on 😂
@sdprompts
@sdprompts 2 месяца назад
AI images 👍 AI voice 👎
@AINxtGen8
@AINxtGen8 2 месяца назад
Thanks for your feedback! I totally get it about the AI voice. My English isn't good, and when I tried recording myself, it sounded pretty rough. I worried viewers might struggle to understand me. While AI voices can't match a fluent speaker's emotion, I think it's better for tutorials than my voice right now. I'm always trying to improve, though! Any suggestions on making the videos better? I'm all ears!
@hasstv9393
@hasstv9393 Месяц назад
Replica is best cause it cost 2$
Далее
новое испытание
00:40
Просмотров 351 тыс.
Unlock Realistic & Film-Like Images in Flux AI
9:27
Просмотров 28 тыс.
"I need AI photos that look like me" - Here's how
22:18
Flux LoRA using FluxGym Tutorial: AI Image Training
5:57
FLUX INPAINTING IMAGE TO IMAGE COMFYUI WORKFLOW
9:50
AI images just got WAY too real. FLUX 1.1 deep dive
33:15
Fine Tune Flux Diffusion Models with Your Photos
51:57
Просмотров 2,1 тыс.