Mosleh Mahamud

84
119 686

Subscribe for regular AI content

9:13

Fine Tune Qwen 2.5 With TRL & Unsloth

9 часов назад

7:35

Llama 3.2: Llama Goes Multimodal ! What happened + Inference Code

14 часов назад

7:24

Fine Tune Phi 3.5 Vision With Your Data

16 часов назад

5:11

Qwen 2.5: Underrated Open Source Model

День назад

5:01

NV Embed 2: Best Embeddings Model For RAG (Open Source)

14 дней назад

10:12

LitServe: Better Than vLLM? Deploy Llama 3.1 With Litserve

14 дней назад

10:57

NEW OpenAI O1 Model: An LLM That Masters Reasoning

21 день назад

8:26

Fine Tune Qwen 2 VL With Your Data

21 день назад

8:29

3 Ways to Quantize Llama 3.1 With Minimal Accuracy Loss

28 дней назад

5:33

2 New Vision Models Crushes GPT-4o & Claude

Месяц назад

5:20

Calculating GPU Requirements For Open Source LLMs

Месяц назад

5:31

Flux 1.0: Most Powerful Open Source Image Gen Model Yet

Месяц назад

7:41

Fine Tune Phi 3.5 with Your Data

Месяц назад

7:23

What is vLLM & How do I Serve Llama 3.1 With It?

Месяц назад

5:19

Fine Tune SAM 2 With Your Data

Месяц назад

7:33

Turning Llama 3.1 Into Embeddings Model

Месяц назад

7:59

Building Fine Tuning Dataset Using Llama 3.1

Месяц назад

7:34

Prompt Tuner: Create Better Prompt For Better Generation

2 месяца назад

8:42

Fine Tuning and Deploying Embeddings Model With AWS

2 месяца назад

6:05

I Analysed My Finances Using Local LLMs

2 месяца назад

7:04

Why GPT4-o Mini Is Better Than Llama 3.1

2 месяца назад

3:36

Building RAG With Llama 3.1

2 месяца назад

5:58

Fine Tune Llama 3.1 with Your Data

2 месяца назад

3:18

Run New Llama 3.1 on Your Computer Privately

2 месяца назад

8:05

Is It Cheaper To Deploy Local LLM?

2 месяца назад

12:47

What are AI Agents and How Can It Help Me?

2 месяца назад

7:10

How to 10x Your RAG With Open Source Models

2 месяца назад

3:23

Get Embeddings From Images: Nomic Embed Vision

2 месяца назад

9:20

What Are LLM Function Calling? How Can It Help Me?

2 месяца назад

Комментарии

@Noumaan_Ahamed 21 час назад

What about longer videos ? Won't the transcript exceed the context size of the llm ?

@moslehmahamud 21 час назад

Good point! I have to check what else i have to do to make it work for long videos

@encoderdecoder-f8e 4 дня назад

The guy just makes a video of things that's available on the internet and doesn't even cite the source. You are just taking someone's code without 0 revisions and you make a video to increase views?

@moslehmahamud 4 дня назад

Spread love instead

@raomuhammadsharjeel8029 6 дней назад

retriever = SelfQueryRetriever.from_llm( llm, vectorstore, document_content_description, metadata_field_info, enable_limit=True, verbose=True, ) this enable_limit will solve your problem, that retriever is returning incorrect responses.

@mrdbourke 6 дней назад

Sensational video Mosleh! The world of VLMs looks to be exploding. Will be good to be able to fine-tune these models for specific use cases.

@ManojKumar-s7p9h 9 дней назад

Can you tell what ollama and llm2vec is using behind the scene to extract embeddings from a decoder only model? are they processing hidden state?

@rebhuroy3713 9 дней назад

You are doing on pricing gpu. On free tier not possible

@rebhuroy3713 9 дней назад

I am trying to fine-tuning on my custom dataset but I am having 30% data loss on fine-tuning. Can you help on that

@tecnopadre 10 дней назад

Would be great to see how to use it. Thnks

@amritsubramanian8384 10 дней назад

great video :)) would love to have videos on Chain of thoughts, program of thoughts and Tool Reasoning.

@moslehmahamud 11 дней назад

Getting extreme amounts of spam comments on this video.

@LyttonRose-b5n 11 дней назад

Alexis Avenue

@GlendaHornback-u9v 11 дней назад

Ziemann Light

@ShawnRoach-u9h 11 дней назад

Koss Park

@DeloresAmanda-p1x 12 дней назад

Lawson Parkways

@EzequielMckay-z5x 12 дней назад

Brody Parkways

@hirotrex42 12 дней назад

Thank you

@ZimmermanMark-l6d 12 дней назад

Waldo Drive

@ChestertonReg-f6b 12 дней назад

Tia Courts

@tyty-v8d 12 дней назад

Shirley Bridge

@MarionBaird-c6i 12 дней назад

Keebler Station

@WalterMaxwell-b4s 12 дней назад

Kreiger Spur

@RobMorris-e3v 12 дней назад

Schmeler Field

@KeenedakbeKebabs-f9l 12 дней назад

Rusty Valley

@MelindaDejoode-m7t 13 дней назад

Bashirian Extensions

@GladstoneNancy-i9e 13 дней назад

Joey Field

@LattimoreMarjorie-u5e 13 дней назад

Jaren Square

@NinaDewitt-o2j 13 дней назад

Green Rapid

@BartlettGreg-p9w 14 дней назад

Herzog Stravenue

@jcchoo2973 14 дней назад

this video is completely useless. thanks for wasting my time

@SeánCarmody-y3p 14 дней назад

Hi, great video. How can I inference the saved model I just fine tuned?

@moslehmahamud 14 дней назад

really good question, i've been scripting it for phi 3.5 using swift. Having a hard time myself doing inferences. I've been able to export (there is an export script for swift) the model but struggling to make inference on my colab instance. Have you tried exporting and saving it on HF hub?

@SeánCarmody-y3p 14 дней назад

@@moslehmahamud They hid this line in those docs that I'm trying at the moment: CUDA_VISIBLE_DEVICES=0 swift infer \ --ckpt_dir output/qwen2-vl-7b-instruct/vx-xxx/checkpoint-xxx \ --load_dataset_config true --merge_lora true I think this creates a merged model that you can load in a script then like this: # Load the model model_checkpoint = "swift/output/qwen2-vl-7b-instruct/v2-20240919-150643/checkpoint-1200-merged" model = Qwen2VLForConditionalGeneration.from_pretrained(model_checkpoint, torch_dtype=torch.bfloat16, device_map="auto") # Load processor processor = AutoProcessor.from_pretrained("Qwen/Qwen2-VL-7B-Instruct")

@moslehmahamud 14 дней назад

@@SeánCarmody-y3p nice detective work. This should be it

@SeánCarmody-y3p 14 дней назад

@@moslehmahamud yeah it worked!

@VasanthShankar-z8e 16 дней назад

Good Info mate!. Can you guide me how we can serve this quantized/fine tuned model as an API endpoint? I tried with vLLM ans Litserv but in both case they are not supporting the local model or our own finetuned model.

@moslehmahamud 16 дней назад

facinating problem! why don't you send me a mail we can take it there

@VasanthShankar-z8e 15 дней назад

@@moslehmahamud Sure, we'll do that!

@MrRaveHaven 17 дней назад

There are numerous other videos on RU-vid that are 20+ minutes in length. They have their merits, but yours is nice and straight to the point - with examples! Thank you!

@moslehmahamud 17 дней назад

@@MrRaveHaven Thank you! I value my audiences time and put extra effort to make it straight to the point

@WalidBoudabbous 18 дней назад

Always up to date ;)

@AIwithAniket 18 дней назад

hi Mosleh, great video! 💜 I am one of the maintainers of LitServe and would love to talk you. Please feel free to reach out (also I am sending an email)

@niaznafirahman344 18 дней назад

What is your specs? I have a 4090ti gpu. Should I feel that I am low with resoources?

@moslehmahamud 18 дней назад

Should work for a 8B model just fine, might need to quantize though.

@Rubyboat 19 дней назад

I literally lowered my batch size to 1, and im still getting memory issues

@Hmmm0135 19 дней назад

please give notebook access you shared in description

@darionappa9706 20 дней назад

I got the error cannot pickle 'classmethod' object when using from langchain_huggingface import HuggingFaceEmbeddings

@SurajPrasad-bf9qn 20 дней назад

Hi Molesh, please give me access to the notebook, I have sent the request

@rabeyatussadia6729 21 день назад

Hello, Thanks for the video. It was very informative. I requested for the code access. Would you please share it? Thanks in Advance.

@wombodombo9005 21 день назад

nice video!

@amedyasar9468 23 дня назад

what about dataset structure for QA llama 3.1? Could you please guide me?

@asafdelmedigo5893 25 дней назад

Hi Mosleh, thanks for sharing this straight forward tutorial! I wonder how to use a custom dataset? I'm struggeling to find the path to latex-ocr-print, also on model scope

@moslehmahamud 25 дней назад

Thanks! You should be able to pass in a .jsonl file with your data. You can probably find latex dataset in hf hub (if i'm not completely wrong)

@abdshomad 24 дня назад

@@moslehmahamud folow up questions: how to create the custom datasets? I heard that creating synthetic dataset is faster than labelling itself? Hope there is easy to follow tutorial on it.

@philtoa334 26 дней назад

Nice .

@gileneusz 28 дней назад

great vid!

@davidinawe791 29 дней назад

Thanks, very helpful and straightforward

@Max-lr6dk 29 дней назад

i have custom data but also with an history of a conversation and i want to fine tune qwen 2 with it. Where should i write it, in the input should i wirte the history of the conversation ? if so should i use a specifi format or should i make my own ? Maybe something like that : "User 1 : hello AI: Hello" ? i can't find any answers

@aadyapipersenia668 Месяц назад

please give access to the code. have requested it.

@atultiwari88 Месяц назад

hi. thank you for this awesome video. i request you to please make videos on finetuning both of these. that would be a great help. thank you.

@moslehmahamud Месяц назад

Working on it :)

@atultiwari88 Месяц назад

@@moslehmahamud thank you

@Samuraiizen-Studio Месяц назад

Does it support windows installation ?

@moslehmahamud 29 дней назад

Looked through the documentation. Unsure if vllm was built for windows