Тёмный

RAG from Scratch without any Frameworks 

Prompt Engineering
Подписаться 170 тыс.
Просмотров 21 тыс.
50% 1

Опубликовано:

 

17 сен 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 60   
@nmstoker
@nmstoker 2 месяца назад
Brilliantly explained with clarity and insight, thank you! Also really pleased you point out that RAG emerged from IR ideas and wasn't brand new: when I saw it I was like, haven't people seen Facebook's DrQA from 2017?!? And even that wasn't out the blue, there's a long established history with IR 👍
@engineerprompt
@engineerprompt 2 месяца назад
thank you. I agree, in most of the case, we are reinventing the wheel and giving old approaches with new names. Interestingly enough a simple keyword based search (BM-25) will still out perform dense embeddings in most cases!
@michaelponce5965
@michaelponce5965 3 месяца назад
This is exactly what I've been trying to find for the last couple of days. Simple instructions on how to do this with pure python and local LLM. Thank you!
@antonioalvarez3246
@antonioalvarez3246 2 месяца назад
x2! thanks @prompt engineering!
@vitalis
@vitalis 3 месяца назад
Problem with RAG solutions is they don’t hold up with bigger amounts of unstructured data. I wish there was a solution that includes long term memory for chat agents so that they get smarter about your context as you chat with them
@engineerprompt
@engineerprompt 3 месяца назад
Google released context caching for their long context models. This could be a solution
@Kishorekkube
@Kishorekkube 2 месяца назад
​@@engineerpromptis there a way to save and load the vector store that you made here sir ?
@tollington9414
@tollington9414 Месяц назад
The graph rag solution may work better for large amounts of unstructured data
@madbike71
@madbike71 2 месяца назад
Excelent and concise description. Thank you.
@CreativeEngineering_
@CreativeEngineering_ 2 месяца назад
I just got done implementing an almost identical setup. Used SQLite and fastBart all in C# it’s amazing
@austinzobel4613
@austinzobel4613 13 дней назад
Nice I've been wanting to start in C# for RAG... Any tips or guidance for a newbie? I was using KoboldCPP's webui for LLM generation... but have NO idea where to go. None of these videos even hint at anything with C#... let alone Kobold.
@LEANSCH96
@LEANSCH96 2 месяца назад
Can this also be implemented with a local model through Ollama?
@nguyentran7068
@nguyentran7068 13 дней назад
Of course there is no restriction
@RūtenisRaila
@RūtenisRaila 12 дней назад
great work! very well explained
@gkhan753
@gkhan753 Месяц назад
As a newbe im hooked on this channel. Im about to take your RAG course, the issue have is, everytime ive been trying to use Langchain i get crazy errors about upgrades and in compatibilities with Python versions. How do you address this issue? Frustrating to resolve if at all.
@engineerprompt
@engineerprompt Месяц назад
My recommendation is to stick to a version of langchain and don't use the latest version. You can fix that in the requirements.txt. you don't need to latest version in most cases. For Python, use 3.10. Hope this helps
@nshettys
@nshettys 2 месяца назад
Brilliant! Thanks for this one
@Francotujk
@Francotujk 3 месяца назад
Hello! I’ve a doubt. The similarities is a way to reduce the number of tokens that is sent to the openAi api? So basically when you make a query to the llm you are not sending the entire text of the wikipedia page? I ask it because of tokens cost, to know exactly what openai will charge us. Your content is probably the best on youtube! Really appreciate all your videos
@luizemanoel2588
@luizemanoel2588 3 месяца назад
Probably. He used a Wiki page but you may have a 1000 pages pdf that will cost a lot to process and maybe most of it is irrelevant to what you want. When you break the text, and then get the 'n' most relevant chunks you get what you want faster and cheaper.
@luizemanoel2588
@luizemanoel2588 3 месяца назад
And if you use a AI locally, the more info you use the slower it will be. So it can make a not so powerful PC do the job too.
@engineerprompt
@engineerprompt 3 месяца назад
Yes, there are two parts as mentioned by @luizemanoel. First the document can contain a lot of irrelevant info. You only want to provide what is relevant to the query to the LLM. This will improve the responses. And the added benefit is reduced tokens which means less cost as well.
@Francotujk
@Francotujk 3 месяца назад
@@engineerprompt @luizemanoel2588 Ok thanks to both!
@bastabey2652
@bastabey2652 3 месяца назад
I never liked RAG frameworks .. thanks for the useful content
@ujjwalsrivastava6248
@ujjwalsrivastava6248 2 месяца назад
Hello sir! I want to build a question answering chatbot which gives answer form provided knowledge base in pdf or text format with python language. I'm working on this since last 10 days but failed to do till now! Can you please guide me through this project sir?
@vaishnokmr
@vaishnokmr 3 месяца назад
yes! i did the same a year ago in research duration.. it works.
@ignaciopincheira23
@ignaciopincheira23 2 месяца назад
Hi, could you convert complex PDF documents (with graphics and tables) into an easily readable text format, such as Markdown? The input file would be a PDF and the output file would be a text file (.txt).
@engineerprompt
@engineerprompt 2 месяца назад
Yes, checkout this video: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-mdLBr9IMmgI.html
@MoFields
@MoFields 3 месяца назад
What are the best ways of importing documents into the RAG system From corporate systems, such as Google Docs or Confluence or Notion without asking your IT? I have actually done a few things manually, but they are very labour-intensive and manual for example using scraping tools and chrome extensions but is there something that is a bit more streamlined?
@MoFields
@MoFields 3 месяца назад
Also - how to add indexing, link backs, more nuances chunking mechanisms (context and type of info aware)?
@engineerprompt
@engineerprompt 3 месяца назад
You are looking for data connectors in this case. Each of these services will have their own APIs or you can use data loaders from langchain (python.langchain.com/v0.2/docs/integrations/document_loaders/). This is one aspect where i would recommend using a framework.
@MrJekyllDrHyde1
@MrJekyllDrHyde1 2 месяца назад
Great job. I'd try to make this work with free/opensource AI Models I also wants to see if this will work with bigger corpus.
@engineerprompt
@engineerprompt 2 месяца назад
it should work with open models. For bigger corpus, you will need to think about latency in retrieval. You might want to look into Quantized embeddings in that case.
@aryandhakal3158
@aryandhakal3158 3 месяца назад
could you please make a video on a a chatbot that can interact with pdf files and answer questions with recent tech ? I'm having the most difficulties with outdated tutorials. It would be a great help!
@Connor51440
@Connor51440 3 месяца назад
Great video, nice style and easy to listen to, subscribed 👍🏼
@nekososu
@nekososu 3 месяца назад
can u also show how to make structured output?
@prathameshmandavkar7591
@prathameshmandavkar7591 3 месяца назад
Great work 👍🏻 Thanks
@antonioalvarez3246
@antonioalvarez3246 2 месяца назад
great work! thanks!
@leomeza9396
@leomeza9396 2 месяца назад
Thank you so much!
@rabeemohammed5351
@rabeemohammed5351 2 месяца назад
language arabic is supported or not
@TheCopernicus1
@TheCopernicus1 3 месяца назад
Legend!
@user-sd3qe7qu9c
@user-sd3qe7qu9c 2 месяца назад
500 likes, keep it up !
@drp111
@drp111 2 месяца назад
Thanks for the video! However, RAG never convinced me. I'm looking for fine-tuning in 10 lines of code.
@Salionca
@Salionca 3 месяца назад
Great! Thanks!
@themax2go
@themax2go 2 месяца назад
... yes, you can do it that way - but, you lose functionality in terms of accuracy of relevance between topics
@geekyprogrammer4831
@geekyprogrammer4831 Месяц назад
Your course is too expensive
@MeinDeutschkurs
@MeinDeutschkurs 3 месяца назад
No frameworks, but please install RAGatuille? WTF!
@Yocoda24
@Yocoda24 3 месяца назад
Are you also mad he used numpy? Hahahahah wtf Framework: a collection of libraries to build applications Libraries: a tool to leverage functionality
@MeinDeutschkurs
@MeinDeutschkurs 3 месяца назад
@@Yocoda24 , well: if the claim is pure python, no frameworks, yes. WTF.
@Yocoda24
@Yocoda24 3 месяца назад
@@MeinDeutschkurs not sure where you’re pulling “pure python” from? Can you give me a timestamp to when it is said in the video?
@MeinDeutschkurs
@MeinDeutschkurs 3 месяца назад
@@Yocoda24 Read the video title: “RAG from Scratch in 10 lines Python - No Frameworks Needed!”
@Yocoda24
@Yocoda24 3 месяца назад
@@MeinDeutschkurs oh okay so it doesn’t say pure python, and he doesn’t use any frameworks. Glad we could come to an understanding
@crazytrain86
@crazytrain86 3 месяца назад
"10 lines" 🤣
@uwegenosdude
@uwegenosdude 2 месяца назад
Thanks for this great video. I tried to run your juypter notebook. When calling the line "from google.colab import userdata" I get the error: ModuleNotFoundError: No module named 'google'. and somewhere I see pkg_resources is deprecated as an API Is python 3.12.3 too new? OK, I replaced the google part. There are other ways to create an OpenAI client ! Now it works !
@lesteroliver911
@lesteroliver911 3 месяца назад
Thankyou
Далее
Google Gemma-2: Technical Report Deep Dive
14:04
Просмотров 3,6 тыс.
Agentic RAG: Make Chatting with Docs Smarter
16:11
Просмотров 15 тыс.
Multi-modal RAG: Chat with Docs containing Images
17:40
RAG from the Ground Up with Python and Ollama
15:32
Просмотров 30 тыс.
Chat with your PDF Using Ollama Llama3 - RAG
6:18
Просмотров 2,8 тыс.
Graph RAG: Improving RAG with Knowledge Graphs
15:58
Просмотров 60 тыс.
The Future of Knowledge Assistants: Jerry Liu
16:55
Просмотров 95 тыс.
Python RAG Tutorial (with Local LLMs): AI For Your PDFs
21:33