Тёмный

Retrieval Augmented Generation with Python+Ollama+Phi3+ChromaDB | How to RAG with a local model 

DevXplaining
Подписаться 4,2 тыс.
Просмотров 848
50% 1

Hi and welcome to DevXplaining channel! Todays I've got a long-form video of a Retrieval Augmented Generation (RAG) using Ollama, ChromaDB, and a little bit of Python. We'll be using RSS feed as our source, and create a program that's able to query that source and use generative AI model to answer questions. Cool thing in todays video is that it's all open source, it's all local model, so this is something you can run on your own computer.
I wanted to take my time and write the code line by line, explaining as we go, so there was no way to do a short video - this is going to be rather long one. But I think this topic and knowledge is a must-have for any developer in 2024 - and I hope this will be clear and useful for you too.
As always, code and references in the video can be found from links below. Happy coding!
And remember to like, subscribe, comment and feedback as you see fit, you know the drill by now :)
Timecodes:
0:00 - Blah blah blah
1:35 - Explaining the setup
4:00 - Prerequisites & How to Hello World with a local model
5:13 - Add necessary libraries
7:02 - Grab the NYT RSS feed with SpeedParser
14:00 - Put the data in ChromaDB Vector database
20:22 - Similarity search into vector database
25:50 - Use Ollama and Phi3 local model to ask questions about the data
36:40 - The Good Stuff
Links:
- github.com/crystoll/ollama-rag
- github.com/ollama/ollama
- github.com/chroma-core/chroma
- github.com/ronikeuru/local-ra...

Опубликовано:

 

1 авг 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 10   
@JoanApita
@JoanApita 28 дней назад
Thank you for sharing this video. I learned a lot today. Please keep them coming. Thank you
@DevXplaining
@DevXplaining 27 дней назад
Thank you for your feedback! Much appreciated!
@sugaith
@sugaith 16 дней назад
yes the best coding videos are very long and we are able to see the person suffering when coding as it is in the reality.
@DevXplaining
@DevXplaining 16 дней назад
Hahaa yeah, I can relate to that :)
@leonardosouzaconradodesant6213
@leonardosouzaconradodesant6213 27 дней назад
Great thank you. And by the way, I'd like that video using chromaDB as an application node running in background. See you!
@DevXplaining
@DevXplaining 27 дней назад
Thank you!
@rasmustoivanen2709
@rasmustoivanen2709 Месяц назад
Great video! Ideas for future videos: Try out different embedding models (I think the ChromaDB default is all-MiniLM-L6-v2) Set up a separate embedding database. Maybe something more familiar with developers like Postgres + pgvector + hnsw indexing (Important to trial out different index building parameters)
@DevXplaining
@DevXplaining Месяц назад
Thank you! Very good ideas!
@cesarhernandezcinfo
@cesarhernandezcinfo Месяц назад
Great video, very practical information. Thanks
@DevXplaining
@DevXplaining Месяц назад
Thank you! Much appreciated! More to follow on the topic. Stay tuned. :)
Далее
Intro to RAG for AI (Retrieval Augmented Generation)
14:31
Вопрос Ребром - Субо
49:41
Просмотров 1,6 млн
Ne jamais regarder une fille à la plage 😂
00:10
Просмотров 766 тыс.
OpenAI Embeddings and Vector Databases Crash Course
18:41
Graph RAG with Ollama - Save $$$ with Local LLMs
12:09
RAG from the Ground Up with Python and Ollama
15:32
Просмотров 27 тыс.
Python RAG Tutorial (with Local LLMs): AI For Your PDFs
21:33
Why Does Diffusion Work Better than Auto-Regression?
20:18