Тёмный
No video :(

Choosing the right Chunk Size for RAG 

Matt Williams
Подписаться 30 тыс.
Просмотров 6 тыс.
50% 1

Retrieval Augmented Generation is the technique used to ask your documents questions. There are a lot of variables to consider with RAG and chunk size is just one of them. Learn more about it here.
The code for this is on github.com/tec...
Be sure to sign up to my monthly newsletter at technovangelis...
And if interested in supporting me, sign up for my patreon at / technovangelist

Опубликовано:

 

29 авг 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 22   
@mshonle
@mshonle 5 месяцев назад
I’d be interested in hearing more about free and local vector stores. I’d also like to hear more about different similarity measures.
@FunwithBlender
@FunwithBlender 5 месяцев назад
qdrant is a good free on disk solution compared to in memory
@iham1313
@iham1313 5 месяцев назад
About vectorstores: an overview of those floating around would nice for starting this topic. Chroma, redis, postgre, … what are the main differences and benefits of choosing one over the other. I really like, that you want to stay with local oss setups!
@philippechassany7279
@philippechassany7279 5 месяцев назад
Big issues when it comes to unstructured info i.e. a pdf with boxes, tables and so on. Then chunk strategy is not adapted.
@gaboceron100
@gaboceron100 3 месяца назад
You will have to extract first the text from those unstructured data, with OCR for example.
@ErikTaraldsen
@ErikTaraldsen 5 месяцев назад
My primary use case for vector or RAG would be to get better coding assistance. Use case would be the closed source code libraries at work. Questions like "how do I fetch customer data from X system?", "show a example of batch job on Y customer type". The legacy codebase spans decades, has different levels of documentation, and often the original author is no longer working at my company any more.
@carlosmosquera8246
@carlosmosquera8246 3 месяца назад
Will be nice try with vector stores and different versions of llama 3
@business24_ai
@business24_ai 5 месяцев назад
Great Video. Maybe Semantic or Agentic Splitting can improve the RAG results.
@VastCNC
@VastCNC 5 месяцев назад
PgVector vote here.
@fearnworks
@fearnworks 5 месяцев назад
echoing this
@CaptZenPetabyte
@CaptZenPetabyte 2 месяца назад
So do you use agents, or do you generate RAG content; or can you just feed the documents into the LLM and instruct it to only reference the provided documents. Ive gotten very good responses from an LLM with the last mode and I didnt need to learn python to do it
@technovangelist
@technovangelist 2 месяца назад
Um, yes.
@solidUntilLiquidBeforeGas
@solidUntilLiquidBeforeGas 5 месяцев назад
Very interesting to watch and a lot to learn! Thanks, Matt. Comment: Is it not possible to make the evaluation of the outputs a bit more quantitative than qualitative? For e.g., can I spot the five things in the output, or mention of the 3 critical facts, etc. Of course this will mean we'd need to have a better set of RAG input as well as expected output. What are your thoughts?
@technovangelist
@technovangelist 5 месяцев назад
Hmmm tell me more. Not sure I understand
@AndrewPeebles
@AndrewPeebles 4 месяца назад
This "top answers" script you reference ... I'd be interested in that script, if it is open source. I would like to evaluate some variables like chunk size, model, re-rank top-k, etc using this "top answers" technique.
@user-he8qc4mr4i
@user-he8qc4mr4i 4 месяца назад
I found it quite tricky to setup a reliable RAG! Pluse dealing with PDFs is a whole different animal :-/
@technovangelist
@technovangelist 4 месяца назад
this stuff is all so early.... hopefully soon this will all just be part of apps and we don't have to even know what RAG means.
@endo9000
@endo9000 5 месяцев назад
🍸
@emmanuelgoldstein3682
@emmanuelgoldstein3682 5 месяцев назад
That thumbnail kinda makes it look like an "I" instead of a "U". Just a warning 😅
@florentflote
@florentflote 5 месяцев назад
Далее
Whats the best Chunk Size for LLM Embeddings
10:46
Просмотров 11 тыс.
Optimize Your AI Models
11:43
Просмотров 8 тыс.
Fine Tune a model with MLX for Ollama
8:40
Просмотров 1,2 тыс.
Ollama 0.1.26 Makes Embedding 100x Better
8:17
Просмотров 45 тыс.
How AI 'Understands' Images (CLIP) - Computerphile
18:05
Semantic Chunking for RAG
29:56
Просмотров 22 тыс.
GraphRAG: The Most Incredible RAG Strategy Revealed
10:38
Easiest Tutorial to Learn DSPy with LLM Example
11:22
Просмотров 4,3 тыс.
LangGraph Simplified: Master Custom AI Agent Creation
43:51