Тёмный

Create Training Data for Finetuning LLMs 

APC Mastery Path
Подписаться 194
Просмотров 586
50% 1

🚀 Mastering LLM Fine-Tuning: From PDFs to JSONL Files🚀
Welcome to APC Mastery Path! In this comprehensive tutorial, we dive deep into the process of creating training data for fine-tuning Large Language Models (LLMs). We'll guide you through extracting text data from PDFs using the powerful `marker-pdf` Python library, cleansing the resulting markdown, and converting it into a JSONL format ready for LLMs.
🔔Agenda:
00:08 Intro
00:59 Part 1: Main concept of the solution
01:50 Part 2: Marker PDF Package Overview & Installation
05:46 Part 2-1: Single File Conversion
08:29 Part 2-2: Multiple File Conversion & Conversion to JSONL format
15:03 Part 3: Finetuning LLMs using extracted data
22:01 Outro
At APC Mastery Path, we offer bespoke mentoring and teaching packages to RICS APC candidates. Enhance your APC journey with our expert guidance and tailored support.
Don’t forget to subscribe, like, and share! Let’s embark on this LLM fine-tuning journey together! 🚀✨
🔗 General Links & Resources:
⚫Our Website: www.apcmasterypath.co.uk
⚫All APC Mastery Path Blogposts: www.apcmasterypath.co.uk/blog...
⚫Personal Linkedin Page: / mohamed-ashour-0727
⚫APC Mastery Path Linkedin Page: / apc-mastery-path
📽️Useful videos:
⚫Finetune your LLMs on custom datasets using Unsloth: • Finetune Your LLM on C...
⚫Deploy Open WebUI with Zero Coding Skills : • Unlocking Local AI: De...
📝Prerequisites & Dependencies:
⚫Nvidia Cuda Toolkit v 12.1: developer.nvidia.com/cuda-12-...
⚫Windows subsystem for Linux : learn.microsoft.com/en-us/win...
⚫Anaconda for Linux: repo.anaconda.com/archive/Ana...
⚫ Pytorch: pytorch.org/
⚫Ollama : www.ollama.com/download
⚫Docker: desktop.docker.com/win/main/a...
⚫Open WebUI on Github: github.com/open-webui/open-webui
📚Github & Huggingface repositories:
⚫Unsloth available LLMs: huggingface.co/unsloth
⚫Marker PDF on GitHub: github.com/VikParuchuri/marker
⚫Unsloth GitHub Repository: github.com/unslothai/unsloth?...
#LLM #MachineLearning #DataScience #AI #Python #PDFConversion #JSONL #MarkerPDF #FineTuning #APCMasteryPath #RICSAPC #Mentoring #Education #TechTutorials

Наука

Опубликовано:

 

1 июл 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 8   
@mufeedco
@mufeedco 7 дней назад
Thank you. Very informative.
@APCMasteryPath
@APCMasteryPath 6 дней назад
@@mufeedco Glad you liked it.
@anasrachmadi9603
@anasrachmadi9603 Месяц назад
This What i need, cant wait your next video
@APCMasteryPath
@APCMasteryPath 29 дней назад
Thanks for your comment. Working on other stuff as well. Hoping to share them soon. Stay tuned.
@anasrachmadi9603
@anasrachmadi9603 27 дней назад
@@APCMasteryPath hey can i get the source code to convert marker into question answer json?
@raoufkamal5748
@raoufkamal5748 Месяц назад
👍
@APCMasteryPath
@APCMasteryPath Месяц назад
A million thanks for your outrageous support.
@Blooper1980
@Blooper1980 17 дней назад
Awesome, but wow.. move away from you mic!!!
Далее
Python RAG Tutorial (with Local LLMs): AI For Your PDFs
21:33
СМОТРИМ YOUTUBE В МАЙНКРАФТЕ
00:34
Просмотров 720 тыс.
ТЫ С ДРУГОМ В ДЕТСТВЕ😂#shorts
01:00
Why Fine Tuning is Dead w/Emmanuel Ameisen
50:07
Просмотров 29 тыс.
The moment we stopped understanding AI [AlexNet]
17:38
Просмотров 852 тыс.
Three Best AI tools for Data Analysis
15:39
Просмотров 34 тыс.