No video :(

Choosing the right Chunk Size for RAG

Подписаться 30 тыс.

Просмотров 6 тыс.

50% 1

Retrieval Augmented Generation is the technique used to ask your documents questions. There are a lot of variables to consider with RAG and chunk size is just one of them. Learn more about it here.
The code for this is on github.com/tec...
Be sure to sign up to my monthly newsletter at technovangelis...
And if interested in supporting me, sign up for my patreon at / technovangelist

Опубликовано:

29 авг 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 22

@mshonle 5 месяцев назад

I’d be interested in hearing more about free and local vector stores. I’d also like to hear more about different similarity measures.

@FunwithBlender 5 месяцев назад

qdrant is a good free on disk solution compared to in memory

@iham1313 5 месяцев назад

About vectorstores: an overview of those floating around would nice for starting this topic. Chroma, redis, postgre, … what are the main differences and benefits of choosing one over the other. I really like, that you want to stay with local oss setups!

@philippechassany7279 5 месяцев назад

Big issues when it comes to unstructured info i.e. a pdf with boxes, tables and so on. Then chunk strategy is not adapted.

@gaboceron100 3 месяца назад

You will have to extract first the text from those unstructured data, with OCR for example.

@ErikTaraldsen 5 месяцев назад

My primary use case for vector or RAG would be to get better coding assistance. Use case would be the closed source code libraries at work. Questions like "how do I fetch customer data from X system?", "show a example of batch job on Y customer type". The legacy codebase spans decades, has different levels of documentation, and often the original author is no longer working at my company any more.

@carlosmosquera8246 3 месяца назад

Will be nice try with vector stores and different versions of llama 3

@business24_ai 5 месяцев назад

Great Video. Maybe Semantic or Agentic Splitting can improve the RAG results.

@VastCNC 5 месяцев назад

PgVector vote here.

@fearnworks 5 месяцев назад

echoing this

@CaptZenPetabyte 2 месяца назад

So do you use agents, or do you generate RAG content; or can you just feed the documents into the LLM and instruct it to only reference the provided documents. Ive gotten very good responses from an LLM with the last mode and I didnt need to learn python to do it

@technovangelist 2 месяца назад

Um, yes.

@solidUntilLiquidBeforeGas 5 месяцев назад

Very interesting to watch and a lot to learn! Thanks, Matt. Comment: Is it not possible to make the evaluation of the outputs a bit more quantitative than qualitative? For e.g., can I spot the five things in the output, or mention of the 3 critical facts, etc. Of course this will mean we'd need to have a better set of RAG input as well as expected output. What are your thoughts?

@technovangelist 5 месяцев назад

Hmmm tell me more. Not sure I understand

@AndrewPeebles 4 месяца назад

This "top answers" script you reference ... I'd be interested in that script, if it is open source. I would like to evaluate some variables like chunk size, model, re-rank top-k, etc using this "top answers" technique.