Тёмный

Qwen2-VL: The Best Open Source Vision Model for OCR & VQA 

AI Anytime
Подписаться 32 тыс.
Просмотров 6 тыс.
50% 1

Опубликовано:

 

30 окт 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 23   
@atultiwari88
@atultiwari88 Месяц назад
awesome video, as always. I request you to kindly make a finetuning tuorial on Qwen2-VL 2B with any VQA. that would help us learning more. thank you so much.
@ROKKor-hs8tg
@ROKKor-hs8tg 10 дней назад
Thanks for the video and question is can it be played on t4?
@IsmailIfakir
@IsmailIfakir Месяц назад
is there is multimodal llm can fine-tuning for image text audio and video
@ucduyvo4552
@ucduyvo4552 Месяц назад
how to finetune Qwen2-VL with custom image data? Thanks for your video.
@ganeshchowdhary7479
@ganeshchowdhary7479 Месяц назад
me too looking for the same thing did u found a way ? can u please guide me
@AliAlias
@AliAlias Месяц назад
Nice, very beautiful ✌️🌹
@brknmed
@brknmed 2 дня назад
Please , i need to create a script or anything that helps me ocr bulk data for a free app for students who would help
@QorQar
@QorQar Месяц назад
with lamacpppython image locally +ggufmodel how?
@jeremiahagware4289
@jeremiahagware4289 Месяц назад
Can you please explain what the instruct version of llms mean? Like mistral3 instruct and so on
@vinitlondhe584
@vinitlondhe584 Месяц назад
instruct means they are fine-tuned versions of the base models to specifically follow instructions from user's prompts and respond back in more guided and useful manner.
@jeremiahagware4289
@jeremiahagware4289 Месяц назад
@@vinitlondhe584 thank you
@BLACKSHADOW-ok5fv
@BLACKSHADOW-ok5fv Месяц назад
Can we use for analysis of UI Design?
@AIAnytime
@AIAnytime Месяц назад
Absolutely
@SonGoku-pc7jl
@SonGoku-pc7jl 7 дней назад
1:47 what is lamini? speak of this ;)
@mohsenghafari7652
@mohsenghafari7652 Месяц назад
how to use to detect ocr Persian language?
@VincentBard-m2k
@VincentBard-m2k 22 дня назад
Rodriguez Jose Hall Scott Hernandez Charles
@IlllIlllIlllIlll
@IlllIlllIlllIlll Месяц назад
Can you add chapters
@ThePedronie
@ThePedronie Месяц назад
can you try with video?
@AIAnytime
@AIAnytime Месяц назад
Watch the latest video
@VincentBard-m2k
@VincentBard-m2k Месяц назад
Thompson Cynthia Williams Betty Martinez David
@BurneJonesClaire-b1v
@BurneJonesClaire-b1v 16 дней назад
Johnson Patricia Davis Mark Johnson Helen
@davefufeturner8028
@davefufeturner8028 Месяц назад
White Melissa Johnson Sandra Gonzalez Helen
Далее
Fine Tune Qwen2 VL Model using Llama Factory
28:57
Просмотров 3,8 тыс.
skibidi army returns (skibidi toilet 77)
00:49
Просмотров 2,5 млн
Ледник 1:0 Мужик
00:53
Просмотров 1,7 млн
Chat with Video File using Qwen2 VL Model
20:03
Просмотров 2,5 тыс.
AWS CEO - The End Of Programmers Is Near
28:08
Просмотров 541 тыс.
The Weird Rise Of Anti-Startups
12:57
Просмотров 403 тыс.
Have You Picked the Wrong AI Agent Framework?
13:10
Просмотров 75 тыс.
AI and Quantum Computing: Glimpsing the Near Future
1:25:33
New Qwen2.5-72B MATH & Vision (BEST Open-Source?)
18:18
skibidi army returns (skibidi toilet 77)
00:49
Просмотров 2,5 млн