Тёмный

ComfyUI With Florence 2 Vision LLM - This Is Not Just A Segmentation Model 

Future Thinker @Benji
Подписаться 46 тыс.
Просмотров 7 тыс.
50% 1

ComfyUI With Florence 2 Vision LLM
In this video, I delve into a new LLM - Florence 2, an extraordinary vision foundation model developed by Microsoft. Join me as I discuss its features, demonstrate its capabilities, and guide you through the installation process.
Florence 2: An Image-to-Text Prompt Large Language Model
Florence 2 is trained with the massive FLD-5B dataset, making it one of the most accurate and detailed text generation models for images. In this video, I'll showcase two custom nodes that connect to Florence 2: the KJ version and the Spacepxl version. These custom nodes enable segmentations, image captioning, and object detection.
Explainer About Florence-2 : • Florence-2 And Deepsee...
Workflows In This Tutorial : / 106792381
ComfyUI-Florence-2
huggingface.co/microsoft/Flor...
arxiv.org/abs/2311.06242
github.com/spacepxl/ComfyUI-F...
github.com/kijai/ComfyUI-Flor...
Installing Florence 2 Custom Nodes in ComfyUI
Before we dive into the demonstrations, we need to install the Florence 2 custom nodes. Don't worry, I'll guide you through the process step by step. Just head to the ComfyUI manager, search for the custom nodes, click install, and wait for the downloads to complete. Once installed, you'll have the powerful Florence 2 custom nodes at your fingertips.
Exploring the KJ Version of Custom Nodes
Let's start our journey by testing the KJ custom nodes. With just two simple custom nodes, you'll be able to perform segmentations, captionings, and bounding boxes. I'll demonstrate their usage using an example image, providing you with a clear understanding of how these custom nodes enhance your workflow.
Unleash the Power of the Spacepxl Version
Next, we'll explore the Spacepxl version of the Florence 2 custom nodes. These custom nodes offer even more features and functions, allowing you to create diverse and versatile workflows. With separate custom nodes for each capability, you'll have the flexibility to incorporate Florence 2 seamlessly into your ComfyUI projects.
The Time is Now: Experience the Future of AI Image Generation
Florence 2 harnesses the potential of large language models, providing accurate and detailed text descriptions for any element in an AI image. Witness the magic of caption-to-phrase grounding, region captions, and object detection as Florence 2 effortlessly combines text and visuals. Prepare to be amazed by the future of AI image and video generation!
If You Like tutorial like this, You Can Support Our Work In Patreon:
/ aifuturetech
Discord : / discord

Наука

Опубликовано:

 

23 июн 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 39   
@reaperhammer
@reaperhammer 6 дней назад
It will be interesting to see how you integrate this into other workflows as you suggested
@TheFutureThinker
@TheFutureThinker 6 дней назад
Yes I will make another video about that. It should be interesting.
@Ai-dl2ut
@Ai-dl2ut 6 дней назад
@@TheFutureThinker Can't Wait :)
@TheCinefotografiando
@TheCinefotografiando 6 дней назад
I have found myself watching your videos more and more
@swannschilling474
@swannschilling474 6 дней назад
Great tutorial! Thanks!!😊
@TheFutureThinker
@TheFutureThinker 6 дней назад
Glad it was helpful!
@crazyleafdesignweb
@crazyleafdesignweb 6 дней назад
that is great stuff! We are getting out of Stable Diffusion with more alternatives.
@TheFutureThinker
@TheFutureThinker 6 дней назад
There's more 😉 well, I think many users are going to keep SD1.5 and SDXL, others from Stupidity AI throw to bin. 🤭
@Rico-nj3vl
@Rico-nj3vl 6 дней назад
Very nice !
@TheFutureThinker
@TheFutureThinker 6 дней назад
Thank you! Cheers!
@x3gxu
@x3gxu 6 дней назад
Hi. How did you make the talking head in the corner? Looks pretty good. It's not wav2lip. V-express? Hallo? I wasn't able to achieve results like this, so I'm very interested. Can you point me in the right direction?
@pabloapiolazza4353
@pabloapiolazza4353 5 дней назад
Also interested!
@vitalis
@vitalis 5 дней назад
So cool
@TheFutureThinker
@TheFutureThinker 2 дня назад
Yup
@context_eidolon_music
@context_eidolon_music 6 дней назад
Holy crap!
@kalakala4803
@kalakala4803 6 дней назад
nice! I just check the Florence-2 LLM video you did. this AI model looks promising. Can you integrate this with AnimateDiff V2V?
@TheFutureThinker
@TheFutureThinker 6 дней назад
😉👍you got it
@promptaganda
@promptaganda 23 часа назад
using spacepxl node, i am getting strange polygons for all images i try to run region to segmentation on. The captioning is working correct. any ideas?
@jairuskersey8311
@jairuskersey8311 6 дней назад
Nice vid. Can you also make a tutorial on how you made the talking avatar in this video? Thanks~
@TheFutureThinker
@TheFutureThinker 6 дней назад
just use Hedra, very easy website no need tutorial :) I believe you can do it
@SageGoatKing
@SageGoatKing 5 дней назад
I don't miss the right click menu bar at all since getting the sidebar where I can pin my favorite nodes, etc.
@TheFutureThinker
@TheFutureThinker 5 дней назад
Normally, I use search. I don't have favourite nodes. Cause I use too many
@triojakeson116
@triojakeson116 4 дня назад
Hey bro i wanted to ask about the kling ai video, can u tell me how much time it will take for u to get accepted after getting into waitlist, cz am already in waitlist for one day, just wanna make sure, if u know pls reply thanks?
@TheFutureThinker
@TheFutureThinker 4 дня назад
Depends, some got it few days in waiting.
@triojakeson116
@triojakeson116 4 дня назад
​@@TheFutureThinkeralso bro am from india, and here all chinese apps are banned, so i had to use an American vpn and a fake chinese number, do u think they will accept my waitlist request if they see all this, will they check all this info 😢
@TheFutureThinker
@TheFutureThinker 4 дня назад
@@triojakeson116 sorry to hear that. This is more about the company policy, I have no comment.
@triojakeson116
@triojakeson116 4 дня назад
@@TheFutureThinker ok bro i will update if something happens 😭
@TheFutureThinker
@TheFutureThinker 4 дня назад
@@triojakeson116 but wish you good luck , i see other people are getting access now. So hopefully it will be okay for ya
@patagonia4kvideodrone91
@patagonia4kvideodrone91 4 дня назад
There are other nodes, I don't remember the name now, but what do you say detect me such a thing, and it generates the automatic mask, (but not square) but with its real contour,
@TheFutureThinker
@TheFutureThinker 4 дня назад
Segment Anything
@promptaganda
@promptaganda День назад
@@TheFutureThinker every time ive ever tried to add a prompt to a segment anything mode it makes zero mask ....... any suggestions?
@TheFutureThinker
@TheFutureThinker День назад
@@promptaganda what is your setting?
@nkofr
@nkofr 6 дней назад
what's the use of the 'finetuned' versions?
@TheFutureThinker
@TheFutureThinker 6 дней назад
Finetuned model on a collection of downstream tasks
@nkofr
@nkofr 6 дней назад
@@TheFutureThinker excuse me but what does that mean?
@riggitywrckd4325
@riggitywrckd4325 3 часа назад
It means that if you have a dataset of stuff you want it to learn you can train those ideas and it will learn to spot them like it does in this one.
@bilalalam1
@bilalalam1 5 дней назад
Hi , I would like to use Florence 2 with LM studio
@TheFutureThinker
@TheFutureThinker 2 дня назад
No sure, but I use OpenWebUi x Ollama
Далее
Ayollar orzusidagi er😂😂
01:01
Просмотров 781 тыс.
🎙️ПЕСНИ ВЖИВУЮ от КВАШЕНОЙ💖
3:23:13
GEN-3 Just Stunned The AI Video World
12:22
Просмотров 66 тыс.
GraphRAG: LLM-Derived Knowledge Graphs for RAG
15:40
Просмотров 78 тыс.
You’ve NEVER Heard AI Music Like This :(
10:33
Просмотров 144 тыс.
Text to 3D is AWESOME now! - AI Tools you need to know
10:51
Ollama UI - Your NEW Go-To Local LLM
10:11
Просмотров 88 тыс.
Main filter..
0:15
Просмотров 12 млн
Mac Studio из Китая 😈
0:34
Просмотров 153 тыс.