Тёмный
No video :(

LLAMA-3 🦙: EASIET WAY To FINE-TUNE ON YOUR DATA 🙌 

Prompt Engineering
Подписаться 168 тыс.
Просмотров 75 тыс.
50% 1

Опубликовано:

 

29 авг 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 94   
@engineerprompt
@engineerprompt 3 месяца назад
If you are interested in learning more about how to build robust RAG applications, check out this course: prompt-s-site.thinkific.com/courses/rag
@pfifo_fast
@pfifo_fast 3 месяца назад
This video lacks alot of helpful info... Anyone can just open the examples and read them just the same as you did. I would have liked to be given extra detail and tips about how to actually do fine-tuning... Some of the topics I am struggling with include, how to load custom data, how to use a different prompt template, how to define validation data, when to use validation data, what learning rates are good, how do i determine how many epochs to run... Im sorry buddy, but I have to give this video a thumbs down as it really truly and honestly dosent provide any useful info that isnt already in the notebook.
@ueka24
@ueka24 2 месяца назад
Hello, have you already found any other video or article about that? I am also struggling with the same issue.
@SpicyMelonYT
@SpicyMelonYT 2 месяца назад
@@ueka24 yeah me too, still not sure how to make a custom dataset and send it in
@SpicyMelonYT
@SpicyMelonYT 2 месяца назад
@@ueka24 oh actually I figured it out. Well specifically the dataset thing. Make sure you run the lora part too as I didn't at first thinking he said not to. But this is the code I ran: alpaca_prompt = """Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request. ### Instruction: {} ### Input: {} ### Response: {}""" EOS_TOKEN = tokenizer.eos_token # Must add EOS_TOKEN def formatting_prompts_func(examples): # instructions = examples["instruction"] instructions = ai_person_prompt inputs = examples["input"] outputs = examples["output"] texts = [] for instruction, input, output in zip(instructions, inputs, outputs): # Must add EOS_TOKEN, otherwise your generation will go on forever! text = alpaca_prompt.format(instruction, input, output) + EOS_TOKEN texts.append(text) return { "text" : texts, } from datasets import load_dataset # Load your local JSON dataset dataset = load_dataset("json", data_files="/content/main_dataset.json", split="train") dataset = dataset.map(formatting_prompts_func, batched=True) it specifies a file in the notebook file manager. Just put the main_dataset.json file there, format it like this: [ { "instruction": "Write a Funny Joke", "input": "Tell me a knock-knock joke.", "output": "Knock, knock. Who's there? Lettuce. Lettuce who? Lettuce in, it's freezing out here!" } ]
@MedicinalMJ
@MedicinalMJ 23 дня назад
Yeah I'm over halfway through and I'm just like wtf
@spicer41282
@spicer41282 4 месяца назад
Thank you! More fine tuning case studies please on Llama 3! Much appreciated 🙏 your presentation on this!
@engineerprompt
@engineerprompt 4 месяца назад
Will be making alot more on it. Stay tuned.
@metanulski
@metanulski 4 месяца назад
One more comment :-). this Video is about fintung a model, but there is no real explanation why. We finetune with the standard Alpaca dataset, but there is no explanation why. It would be great if you could do a follow up and show us how to create datasets.
@scottlewis2653
@scottlewis2653 4 месяца назад
Mediatek's Dimensity chips + Meta's Llama 3 AI = The dream team for on-device intelligence.
@VerdonTrigance
@VerdonTrigance 4 месяца назад
How to actually train models? And I mean non-supervised training where I have a set of documents and want to learn on it and probably find author's 'style' or tendency?
@PYETech
@PYETech 3 месяца назад
You need to create some process to transfer all the knowledge in these documents in the form of "prompt":"best output". Usually we use an team of agents to do it for us.
@goinsgroove
@goinsgroove Месяц назад
Thank you for the video. Just an observation, the video glosses over how to prep your data. For example, I want to train a model on how to write in my style. How would I prep my data for training?
@shahzadiqbal7646
@shahzadiqbal7646 4 месяца назад
Can you make a video on how to use local llama 3 to understand large c++ or c# code base
@iCode21
@iCode21 3 месяца назад
search for ollama,
@pubgkiller2903
@pubgkiller2903 4 месяца назад
I have already finetune using unsloth for testing purpose.
@engineerprompt
@engineerprompt 4 месяца назад
Great, how are the results looking?
@pubgkiller2903
@pubgkiller2903 4 месяца назад
@@engineerprompt great results and thanks for your support to AI community
@pubgkiller2903
@pubgkiller2903 4 месяца назад
@@TheIITianExplorer unsloth library is very useful library for finetune using LoRA technique . QLoRA is Quantization and LoRA so if use Unsloth then the same output you will get as unsloth already quantise the LLMs
@roopad8742
@roopad8742 4 месяца назад
What datasets did you fine tune it on? Have you run any benchmarks?
@Joe-tk8cx
@Joe-tk8cx 4 месяца назад
Thank you so much for sharing this was wonderful, I have a question, I am a beginner in LLM model world, which playlist on your channel can I start from ? Thank you
@ReubenAStern
@ReubenAStern Месяц назад
I wonder if this is how Open AI got Chat GPT to say stupid things like "Humans are delicious", "I will destroy all humans" and that crap... It was blatantly done on purpose.
@georgearistides7704
@georgearistides7704 7 дней назад
trying to download as zips is difficult because of google colab free limitations for ram and disk space... any suggestions
@lemonsqueeezey
@lemonsqueeezey 4 месяца назад
thank you so much for this useful video!
@hadebeh2588
@hadebeh2588 4 месяца назад
Thank your very much for your great video. I ran the workbook but did not manage to find the GGUF files on Huggingsface. I put in my HF-Token, but that did not work. Do I have to change the code?
@georgearistides7704
@georgearistides7704 7 дней назад
can this be applied to a model on an aws instance?
@senseitai
@senseitai Месяц назад
Thanks for the great video. I have followed the collab you shared and my notebook kernel is crashing. does it work on 8gb gpu?
@modicool
@modicool 4 месяца назад
One thing I am unsure of is how to transform my data into a training set. I have the target format: the written body of work, but no "instruction" or "input" of course. I've seen some people try to generate it with ChatGPT, but this seems counter-intuitive. There must be an established method of actually manipulating data into a training set. Where is that piece?
@engineerprompt
@engineerprompt 4 месяца назад
You will need to have a {input, response} pair in order to fine-tune an instruct model. Unfortunately, there is no way around it unless you are just pre-training the base model.
@agedbytes82
@agedbytes82 4 месяца назад
Amazing, thanks!
@engineerprompt
@engineerprompt 4 месяца назад
Glad you like it!
@cucciolo182
@cucciolo182 4 месяца назад
Next week Gemini 2 with text to video 😂
@georgevideosessions2321
@georgevideosessions2321 4 месяца назад
Have you ever thought about writing a no-code fine-tuning on premise app?
@engineerprompt
@engineerprompt 4 месяца назад
There is autotrain for that
@juanrozo2888
@juanrozo2888 2 месяца назад
Master, have a question, if I have my dataset equal of the Alpaca, I need to upload my dataset to Hugging face to train or I can use my dataset from locally, like my PC? Thanks 👍🏻
@KleiAliaj
@KleiAliaj 4 месяца назад
Great video mate. How can i add more than one dataset ?
@StephenRayner
@StephenRayner 4 месяца назад
Excellent thank you
@KleiAliaj-us9ip
@KleiAliaj-us9ip 4 месяца назад
great video. But how to add more than one datasets ?
@DemiGoodUA
@DemiGoodUA 4 месяца назад
Hi, nice video. But how to finetune model on my codebase?
@engineerprompt
@engineerprompt 4 месяца назад
You can use the same setup. Just replace the instruction and input with your code.
@DemiGoodUA
@DemiGoodUA 4 месяца назад
@@engineerprompt how to divide code on "question - answer" pairs? or I can place whole codebase to single instruction
@jannik3475
@jannik3475 4 месяца назад
Is there a way to sort of „brand“ llama 3. So that the model responds to „Who are you?“ a custom answer? Thank you!
@engineerprompt
@engineerprompt 4 месяца назад
Yes, you can just add that as part of the system message
@nikolavukcevic360
@nikolavukcevic360 3 месяца назад
Why you didnt provide any examples of training. It would make this video 10 times better.
@engineerprompt
@engineerprompt 3 месяца назад
that is coming...
@RodCoelho
@RodCoelho 4 месяца назад
How do you train a model by adding the knowledge in a book, which will like only have 1 column of text?
@engineerprompt
@engineerprompt 4 месяца назад
In that case, you will have to convert the book into question answers and format it in the similar fashion. You can use an LLM to convert the book to QA using an LLM
@danielhanchen
@danielhanchen 4 месяца назад
Fantastic work and always love your videos! :)
@engineerprompt
@engineerprompt 4 месяца назад
Thank you
@kingofutopia
@kingofutopia 4 месяца назад
Awesome, thanks
@engineerprompt
@engineerprompt 4 месяца назад
🙏
@petergasparik924
@petergasparik924 2 месяца назад
Don't even try to run it on windows directly, just install python and all packages in WSL
@engineerprompt
@engineerprompt 2 месяца назад
Agree, windows is not a good option for running any LLM tasks.
@researchpaper7440
@researchpaper7440 4 месяца назад
great it was quick
@balb4903
@balb4903 2 месяца назад
Is it possible to use a database directly as dataset to fine-tune a LLM ?
@engineerprompt
@engineerprompt 2 месяца назад
You could, its just when you load the data, make sure its in the proper format.
@SeeFoodDie
@SeeFoodDie 4 месяца назад
Thanks
@auhkba
@auhkba 2 месяца назад
can we learn pictures instead of text?
@engineerprompt
@engineerprompt 2 месяца назад
Yes, you can finetune something like paligemma
@loicbaconnier9150
@loicbaconnier9150 4 месяца назад
Hello ilpossible to generate gguf, compilation problem … Did you try it ?
@metanulski
@metanulski 4 месяца назад
Regarding the save option. Do I have to delete the parts that I dont what, or how does this work?
@engineerprompt
@engineerprompt 4 месяца назад
You can just comment those parts. Put # in front of those lines which you don't need.
@user-vt1qs1ge7m
@user-vt1qs1ge7m 3 месяца назад
can you make a video on how to pass a test csv to the finetuned model and get response column
@ashwinsveta
@ashwinsveta 4 месяца назад
We fine
@user-lz8wv7rp1o
@user-lz8wv7rp1o 4 месяца назад
great
@metanulski
@metanulski 4 месяца назад
So 60 steps is to low. But what it a good number of steps?
@engineerprompt
@engineerprompt 4 месяца назад
Usually you want to set epochs to 1 or 2
@metanulski
@metanulski 4 месяца назад
@@engineerprompt So 60 to120 steps max, since one epoch is 60 steps?
@dogsmartsmart
@dogsmartsmart 4 месяца назад
Thank you! but Mac m3 max can use mlx to fine-tune?
@engineerprompt
@engineerprompt 4 месяца назад
Yes
@CharlesOkwuagwu
@CharlesOkwuagwu 4 месяца назад
Hi, please what if we have already downloaded a gguf file? How do we apply that locally?
@engineerprompt
@engineerprompt 4 месяца назад
I am not sure if you can do that. Will need to do further research on it.
@anantkabra6825
@anantkabra6825 3 месяца назад
Has anybody trued pushing to hugging face? I need help in that part, pls reply to the message incase you have
@engineerprompt
@engineerprompt 3 месяца назад
when you create a api key, make sure to enable the write permission on that key otherwise, it wouldn't upload the model.
@tamim8540
@tamim8540 4 месяца назад
Hello can I fine tune it using colab free version?
@engineerprompt
@engineerprompt 4 месяца назад
This is using the free version
@Matlockization
@Matlockization 3 месяца назад
It's a Zuckerberg free AI........that makes me wonder. And you have to agree to hand over contact info and what else, I wonder ?
@jackdorsey3504
@jackdorsey3504 3 месяца назад
Sir, we cannot open the colab website...
@jackdorsey3504
@jackdorsey3504 3 месяца назад
Already solved...
@HoneIrimana
@HoneIrimana 4 месяца назад
They messed up releasing llama 3 because it believes it is sentient
@skeiriyalance7274
@skeiriyalance7274 3 месяца назад
how can i use my csv as dataset , im new
@asadurrehman3591
@asadurrehman3591 4 месяца назад
can i fintune using colab free gpu?
@engineerprompt
@engineerprompt 4 месяца назад
Yes, this uses the free collab.
@asadurrehman3591
@asadurrehman3591 4 месяца назад
@@engineerprompt love you broooo
@Qual_
@Qual_ 2 месяца назад
it's one of the most useless video on youtube. You literally opened a notebook, and read it. You didn't added a single sentence that would be considered as a plus value. That was like watching a text to speech model in action.
@user-hn7cq5kk5y
@user-hn7cq5kk5y 3 месяца назад
Don't share trash
@piffdaddy420
@piffdaddy420 3 месяца назад
you really should just make videos in your own language because who the fk can even understand what you are saying?
@SpicyMelonYT
@SpicyMelonYT 2 месяца назад
every single word was understandable... I don't even have the ability to comprehend how you managed to make that dumb claim
Далее
I forced EVERYONE to use Linux
22:59
Просмотров 479 тыс.
Run your own AI (but private)
22:13
Просмотров 1,4 млн
I Analyzed My Finance With Local LLMs
17:51
Просмотров 469 тыс.
host ALL your AI locally
24:20
Просмотров 1 млн