Тёмный

Chunk large complex PDFs to summarize using LLM 

Rajib Deb
Подписаться 1,2 тыс.
Просмотров 15 тыс.
50% 1

In this video, I talk about a technique to do context aware chunking of large PDFs and then summarize the content using map-reduce framework(implemented through Langchain)
References:
arxiv.org/pdf/2307.03172.pdf
developer.adobe.com/document-...
python.langchain.com/docs/use...
smith.langchain.com/hub/
code:github.com/rajib76/langchain_...

Опубликовано:

 

29 сен 2023

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 17   
@fabsync
@fabsync 4 месяца назад
oh man.. thanks for your videos! They are precious gold! I love the way you think and teach!
@user-on1qo8un2t
@user-on1qo8un2t 2 месяца назад
Man, very great explaination. I m constantly visiting your channel for great new tutorials @@@:)
@ianmatthews8342
@ianmatthews8342 4 месяца назад
Excellent, nice idea and very well explained! Thanks!
@aarshmehtani5468
@aarshmehtani5468 29 дней назад
Thanks Sir. Actually I am working on some project based on it and find difficult for me to find materials to understand the concept practically. After watching this video, I understand and implement it successfully and step ahead. At last Thanks Sir for this video.
@virunew
@virunew 4 месяца назад
nicely explained!
@user-ll6mn8ky8d
@user-ll6mn8ky8d 5 месяцев назад
you'll be famous soon
@timtensor6994
@timtensor6994 6 месяцев назад
Maybe for image, how does GPT4 multimodal models work ?
@bilalzahoor5608
@bilalzahoor5608 4 месяца назад
Hi Rajib, Thanks for making this video. It has been really helpful as I try to build a RAG system for a B2B use case. However, I did try setting up the Adobe API but I must say it's not too easy as I am getting stuck at various steps. I am not able to get 201 response code. Can you please share the steps you followed to setup the API? Regards, Bilal
@elephant742
@elephant742 8 месяцев назад
Hi Rajib, Really insightful video. Especially the Extract API for the context-aware extraction of text from PDF. Are you aware of any open-source alternatives for the Extract API? Regards, Dev
@rajibdeb4059
@rajibdeb4059 8 месяцев назад
:) I was also looking for one. Tabula, Camelot did not work for me. Looks like the enterprise grade solutions come with a price.
@MadhanAnbalagan-ff5qt
@MadhanAnbalagan-ff5qt 6 месяцев назад
Closest open source I found working is unstructuredIO
@rajibdeb4059
@rajibdeb4059 6 месяцев назад
that also did not work for me, in fact i have shared the results with unstructured @@MadhanAnbalagan-ff5qt
@user-ll6mn8ky8d
@user-ll6mn8ky8d 5 месяцев назад
if you had to pick an open source alternative for this, what would you chose? ​@@rajibdeb4059
@loknathbasak1533
@loknathbasak1533 5 месяцев назад
@@MadhanAnbalagan-ff5qt Did you find a okaish opens source one?
@PrabakaranSPpraba984
@PrabakaranSPpraba984 8 месяцев назад
Hi , Could you please share your LinkedIn profile ? I am doing the same PoC , I need some clarification.
@vikasrajpurohit8730
@vikasrajpurohit8730 4 месяца назад
Thank you, can you give your LinkedIn handle
Далее
LangChain: How to Properly Split your Chunks
10:41
Просмотров 25 тыс.
Et toi ? Joue-la comme Pavard ! 🤪#shorts
00:11
Просмотров 1,9 млн
Nice hiding.
00:43
Просмотров 3,6 млн
Adding Agentic Layers to RAG
19:40
Просмотров 16 тыс.
5 Levels Of LLM Summarizing: Novice to Expert
19:19
Просмотров 50 тыс.
Python RAG Tutorial (with Local LLMs): AI For Your PDFs
21:33
What is LangChain?
8:08
Просмотров 161 тыс.
RAG But Better: Rerankers with Cohere AI
23:43
Просмотров 53 тыс.