Tutorial | Chat with any Website using Python and Langchain (LATEST VERSION)

Alejandro AO - Software & Ai

Подписаться 43 тыс.

Просмотров 66 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

28 сен 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 331

@alejandro_ao 8 месяцев назад

🐱 GitHub repository: github.com/alejandro-ao/chat-with-websites 🔥 Join the LangChain Master Program (early access): link.alejandro-ao.com/langchain-mastery 💬 Ask your questions in our Discord Server (but please leave a comment here too for engagement): link.alejandro-ao.com/981ypA ❤ Buy me a coffee (thanks): link.alejandro-ao.com/YR8Fkw

@xspydazx 5 месяцев назад

when fine tuning a model : it seems that it is adjusting the model weights based on the input and expected output? that would mean the brain is @Open? so its basically live when taking to the brain in training does this effect its weights ? can only the trainer effect the weights of the model ? i expect this is happening in memory? based on Positive and Negative feedback in Chat could we not be talking to the brain and teaching it and adjusting its weights on the fly? Also (sorry) When using the Rag type systems are the documents being tokenized using the tokenizer from the model ? (i know it locks the database down a bit) , as then we could consider the local rag as the working memory system ? and the llm as the long term memory system , should there be a bridge between the database and trainer so that it could essentially update the Longterm memory periodically releasing the local rag data? ie essentially training a lora to be applied (or merged) ... hence the llm should have a lot of loras from each interval of updates or training Or not if the strategy is full merge?

@anabellarivolta Месяц назад

can i just say you make this whole thing bearable because your voice delivery is on point i won't go anywhere else i'm gonna learn everything here what a bless

@christophemarie-reine1624 4 месяца назад

Incredible! It's the first time I understand everything just by watching a tutorial once. A true educator

@alejandro_ao 4 месяца назад

it means a lot! keep it up 💪

@regisgesnot6840 7 месяцев назад

I very rarely comment a video. It was very clear, good flow to understand. Thanks a lot

@alejandro_ao 7 месяцев назад

i appreciate it :) i hope you learned a lot! keep it up!

@TitanGriffins Месяц назад

Yes, it was challenging until you came in. I'm proud of you!

@alejandro_ao Месяц назад

i appreciate it!

@ashishmalhotra2230 3 месяца назад

Your clarity and grasp of the topic are great!

@alejandro_ao 3 месяца назад

hey there! very glad you liked this! :)

@davidtindell950 7 месяцев назад

Yes! We did learn a lot about the latest version of LANGCHAIN!! Thank You!!!

@alejandro_ao 7 месяцев назад

it is my pleasure! stay tuned for more :)

@MarkOBrien-ue4vg 4 месяца назад

Fantastically clear and methodical walkthrough. Keep it up!!!

@alejandro_ao 4 месяца назад

Thanks, will do!

@paalhoff63 8 месяцев назад

Exceptional clarity - great video!!!

@alejandro_ao 7 месяцев назад

Glad you liked it!

@phoenixx9545 4 месяца назад

hey Alejandro, thanks for your video! This project is my first side project! I reaally appropriate your amazing job! If you are looking for the idea for next tutorial, the text-to-mindmap maybe a good idea!

@alejandro_ao 4 месяца назад

hey there! i'm glad you enjoyed the project. that sounds like a fun idea, I will probably me doing something like that in the future. Maybe something using knowledge graphs?

@bharathitulasi258 2 месяца назад

Hi! Your video is buttery smooth and your explanation, voice are so peaceful.! How did you learn all these topics, methods and that flow?

@AIdevel 8 месяцев назад

Thank you so much for your amazing videos , this one in particular is outstanding as far as RAG is concerned , you are a real master in gen AI , I do find your videos tremendously helpful , keep on making them cheers from Saudi Arabia 😍😍😍💯

@alejandro_ao 8 месяцев назад

i appreciate it! i'm glad that you're finding these useful :) i have many more coming

@DesmondBai 3 месяца назад

Great video Alejandro, can I ask which theme you are using for your terminal

@alejandro_ao 3 месяца назад

hey there, thanks! sure. the theme is robbyrussell and i'm running it with ohmyzsh 😎

@touchthesun 6 месяцев назад

This is great, thanks so much for sharing this with us!

@alejandro_ao 6 месяцев назад

my pleasure!

@jamalnuh8565 8 месяцев назад

This guy is a legend

@alejandro_ao 8 месяцев назад

nah you are

@anitasharmaist 5 месяцев назад

awesome tutorial!

@alejandro_ao 5 месяцев назад

Thank you!

@seshuyt 5 месяцев назад

Great video, thanks for doing it for us.

@alejandro_ao 5 месяцев назад

i’m glad you appreciate it :)

@moglimogify 4 месяца назад

Easy to follow, thank you

@alejandro_ao 4 месяца назад

my pleasure!

@salmansohail1081 27 дней назад

Great video and Good explanation. can we implement this without having an openai API key for free???

@officialseethesky 7 месяцев назад

waiting for the next video

@alejandro_ao 7 месяцев назад

coming very soon!

@thomasgaye7580 6 месяцев назад

Awesome course Alejandro! By the way, I'm having issue with installing "Chroma" See....I'm getting "ERROR: Failed building wheel for chroma-hnswlib"

@atifshaik1156 3 месяца назад

heyy I had a doubt u said that after we do similarity based ranking with a vector db and get few chunks(context) to answer our query and then we'd pass these contexts with user query along with chat_history to an LLM, But if we pass chat_history wouldn't that exceed the max token size of an LLM if conversation went too long ??

@alejandro_ao 3 месяца назад

hey there. great question. yes, totally. if the conversation is too long, then you can exceed the context window of your LLM. however, keep in mind that modern LLMs, such as GPT-4, Claude and especially Gemini 1.5 have gigantic context windows, so this might not be too much of a concern. also, consider that sending the entire conversation history is only one method for implementing memory in these systems. you can also send a summary of the conversation + the last 10 messages, for example. or produce a NER-based memory. i don't think there is an industry standard yet for implementing memory, though. so feel free to try out several methods.

@dlunlun7073 5 месяцев назад

Thanks for the video, it's great stuff. Wonder if you can do these videos with Gemini LLM, the LLM and embeddings are free as far as I can tell. I somehow made it work by watching your video, thanks man.

@alejandro_ao 5 месяцев назад

hey there, that's a great idea. good that you made that work. i was going to use them for a video a few weeks ago, but then realized that i needed to set up a vpn before recording. these models were not available in the eu when i checked a few weeks ago (great)... so i got lazy and went for openai. i will set up this vpn thing (or move out of europe) :P

@明明马-c6n 5 месяцев назад

can it show some picture's knowledge？

@ziangliu7866 7 месяцев назад

Great video! subscribed and liked. Just one part that I could not understand. the stuff_documents_chain tasks input and context. How is the context being passed from retriever_chain to stuff_documents_chain? Dose LangChain just defined that creste_retrieval_chain can pass context from the first argument to the second?

@alejandro_ao 7 месяцев назад

hey there! welcome to the club! yeah that's exactly what is happening. we are using a prebuild chain that does the `.invoke('{"context": [...]"})`by itself without us having to call the variable. if you look at the prebuilt chain's source code, you will see that it is calling the invoke method inside the runnable. i will make a future video creating chains ourselves so that this is easier to understand!

@ubaisalih2987 7 месяцев назад

amazing , and how to deploy it to hugging face please ?

@alejandro_ao 7 месяцев назад

Hey there. This is not necessarily deployable to HuggingFace, as HF is a place to host models. Since we are not creating our own models here, that would not work. We are building an app, however. And you can deploy it to the web! If you want to see how to do that for free, you can watch this video: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-74c3KaAXPvk.html

@fbravoc9748 7 месяцев назад

What a great tutorial!!! Really enjoyed it!! New subscriber here!! Only one question...just for testing purposes I used the chat to ask if he could answer topics different from the context of the websites I was chatting with, and it did. It even wrote code for me. Is there a way to restrict the app to only answer about the content of the website we are chatting with and not other questions? Thanks for the amazing video

@alejandro_ao 7 месяцев назад

hey there, welcome onboard! yeah, that's on of the main complications of creating rag applications. i have seen several ways of dealing with this. you might want to try some of these: 1. the main thing to do to control this issue is to add a restrictive instruction to your initial prompt. so you would have to add something like this: "Don’t justify your answers. Don’t give information not mentioned in the CONTEXT INFORMATION" 2. to go even further, you can fine-tune your model with some of these edge cases. 3. lastly, a more complex approach would be to add a "policing" ai to read the main ai's answers and decide wether to accept them or not. this is, of course, more expensive and complex.

@fbravoc9748 7 месяцев назад

@@alejandro_ao thanks for the quick answer

@shaquibkhan1609 7 месяцев назад

It looks interesting. 👌 But One question: Can I give any website link into it and ask for the best keys used for this website (like using it for digital marketing concepts)

@alejandro_ao 7 месяцев назад

i suppose you can! although i would think that this requires a different approach than using a LLM. you might need other NLP algorithms to deal with this. maybe some pipeline that strips your corpus of text, removes filler and useless words and gets the main words. it then could generate a bag of words for the keywords of your website. check it out, you can do all of this with python ;)

@Nazar-p4b 7 месяцев назад

Plz make a video using Gemini-Pro-API

@alejandro_ao 7 месяцев назад

i'm totally doing this soon

@chibuzoemelike6403 8 месяцев назад

This is so cool, thank you so much. Can this be applied to database with so many views or tables so you can ask questions and it's intelligent to perform joins to bring the answer? It will be interesting to see if it's possible or create a video. Thank you so much

@alejandro_ao 8 месяцев назад

that's a great idea. i would build an agent do achieve that. i'll plan a video about that :)

@sunilsrinivaas3167 6 месяцев назад

Thank you for this ♥️♥️♥️..do you know how can we host it?

@alejandro_ao 5 месяцев назад

no worries! check out this other video: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-74c3KaAXPvk.html all the best!

@mohsenghafari7652 7 месяцев назад

❤❤❤

@OswaldCobblepot-c8t 7 месяцев назад

Hii, I am not able to load data from the web in Windows its everytime shows 3 errors like import pwd, from langchain_community.document_loaders.pebblo import PebbloSafeLoader and after doing that it is not still working. what to do please assist me. Thank You, You have done a very good job.

@adelbrice8549 7 месяцев назад

There is a bug in the last langchain version. Pls use langchain==0.1.6 and langchain-community==0.0.19 and langchain-core==0.1.23

@OswaldCobblepot-c8t 7 месяцев назад

thanks ✌✌@@adelbrice8549

@2glory 8 месяцев назад

Thanks!

@alejandro_ao 7 месяцев назад

thank you! you are amazing

@tanmayjagtap78 6 месяцев назад

Hey Alejandro. I tried to deploy Chatwithpdf app to stream lit but its not working. whenever I upload pdf and start processing its starts to download weights. I am using huggingface APIs instead of OpenAI. Does streamlit has any sort of limitation ? IF yes is there any way I can perform this task without downloading weights just like OpenAI APIs?

@TestProd-f1s 3 месяца назад

try running ollama model and check

@seththunder2077 3 месяца назад

Can we have a video where user uploads multiple file types like pdf, word doc etc...

@alejandro_ao 3 месяца назад

i love this idea

@seththunder2077 3 месяца назад

@@alejandro_ao thanks! Excited for it. I think its a very common use case where ppl wanna have multimodal RAG on several doc types and return the source doc

@gopalpoojary8964 7 месяцев назад

I did given in the video ,but in the end the open ai session timed out due to its free version after i put the url 😭, please give me some solution on how to solve this issue and make it a runnable chatbot

@alejandro_ao 7 месяцев назад

hello there. that's very strange. are you sure you are not accidentally hitting the limit of the context window? you can use a model with a larger context window maybe. try 'gpt-4-0125-preview', it has a 16k token window. if the issue persists, try opening an issue in our discord forum: link.alejandro-ao.com/discord it's free :)

@Johnpaul18345 6 месяцев назад

Sir it's installing chromadb plz rectify the error???

@Johnpaul18345 6 месяцев назад

Sir it's not installing chromadb plz rectify the problem

@avidlearner8117 8 месяцев назад

But... Doesn't GPT-4 already has access to Bing to search the web autonomously? I use it all the timeé

@alejandro_ao 8 месяцев назад

Not the GPT-4 model itself! The model only has access to its training material. The Chat-GPT Plus app uses a feature like this one here to enrich what the model does :) This is to show you how that mechanism works, now using the latest version of LangChain

@avidlearner8117 8 месяцев назад

@@alejandro_aoYesh, of course it doesn't use it per say but rather sends a query, launches a function that fetch content of the page then reads/interprets the content and "clicks" to change page and so on until it gets results. The intent of the video went above my head :) Keep it up! Love the content, there's not enough of it on RU-vid!

@JonathanNatakusuma-q3m 8 месяцев назад

Hi Alejandro great clip, I'm encountering this particular error especially after I replace one website with another. Is there a way to reset the session_state after uploading a new website to safe memory? BadRequestError: Error code: 400 - {'error': {'message': "This model's maximum context length is 4097 tokens. However, your messages resulted in 15766 tokens. Please reduce the length of the messages.", 'type': 'invalid_request_error', 'param': 'messages', 'code': 'context_length_exceeded'}}

@klausschroder7095 7 месяцев назад

I get that without even changing websites XD

@klausschroder7095 7 месяцев назад

same thing, have you found a solution?

@alejandro_ao 7 месяцев назад

Hey there. That's odd. About the first question, you can deal with this by creating a button that flushes all the variables stored in streamlit's session state. Something like `if st.button("flush"): st.session_state = []` If you are having this problem without having a super long conversation history, you may want to use a shorter chunk size. The `RecursiveCharacterTextSplitter` takes the `chunk_size` parameter and you can set it to as many maximum characters you want per chunk. You can get more info about this text splitter here, it's actually pretty cool: python.langchain.com/docs/modules/data_connection/document_transformers/recursive_text_splitter

@chebbifethi9694 7 месяцев назад

very good 🫶

@alejandro_ao 6 месяцев назад

thanks!

@joaosantos1294 8 месяцев назад

Great video, u rock!! Waiting for a "Chat with any database using Python and Langchain" 💯💯

@alejandro_ao 7 месяцев назад

Thanks! Coming soon!

@matten_zero 7 месяцев назад

I think LlamaIndex has a MongoDB loader and JSON_query_loader.

@OkayNoway 6 месяцев назад

underrated content, edit: forgot to say thanks, was too focused with your content, Thanks!

@alejandro_ao 6 месяцев назад

thank you! i really appreciate it :)

@kamalpathak1 7 месяцев назад

Please create a video with free open source AI models, without OpenAI models.

@alejandro_ao 6 месяцев назад

yeah, i will be diversifying this more!

@Amr-Ibrahim-AI 5 месяцев назад

Thank you Alejandro for your clear and insightful tutorials. They are much appreciated. I remade your full solution and it's working like a charm. However, when I ask it a question that's outside of the context of the webpage (like: What's the capital of France?, for example), it still answers. How can I ground it to answer only from the context? i tried to add the following to the system prompt of the get_conversational_rag_chain: ("system", "Answer the user question based on the below context. If the context does not contain the answer, do not make up an answer and respond with \"I am sorry I cannot answer this out-of-context question!\": Context:{context}") However, it still answers irrelevant questions. What do you think?

@davidoktavianusmendrofa2631 5 месяцев назад

Hi, thank a lot for your sharing. And also I want to ask about error message APIConnectionError: Connection error while running Chroma.from_documents(""chunkc","embeddings"). Here I use AzureOpenAIEmbeddings.. how to solve this error? thanks for helping.

@AkulSamartha 2 месяца назад

Awesome video. What extensions do u use in VSCode? Seems to be very helpful. Also, can you please show how to input many data sources as an input. i.e many pages of a website

@ifeolu8501 4 месяца назад

You are a rare gem. I really appreciate your knowledge sharing. Please help us release video that uses natural language to SQL and we can connect to WhatsApp. To make things more exciting that we can load image from WhatsApp to our database.

@vijbca 24 дня назад

One of the best tutors. Hatsoff keep the good work buddy. Wishes🎉 from India

@nj_droidtech 4 месяца назад

How can i build this without showing the sidebar(url input), only the chat UI?

@ibikunleadekiitan9882 7 месяцев назад

I have try your code it work but when i input the question i received a message TypeError: serializable.__init__() takes 1 positional argument but 2 were given

@alejandro_ao 7 месяцев назад

hey mate, can you log in to our discord server and post your question in the forum? it's free and that way it's easier to help you out. remember to include your code and the full error message so that we can help you! here is the link: link.alejandro-ao.com/discord

@yanbinhei8673 7 месяцев назад

I like your style, warm and peaceful

@alejandro_ao 7 месяцев назад

i am very warm and peaceful indeed

@jorgeperez8914 3 месяца назад

Simplemente genial y una muy buena explicación, gracias Alejandro por este tipo de contenido.

@alejandro_ao 3 месяца назад

hola jorge! gracias a ti! un placer poder ayudar :)

@divyanshu_dubey 5 месяцев назад

Great content buddy ! One question , how can we connect this chatbot to apps like WhatsApp ?

@easysolution8208 5 месяцев назад

i like you learning and implementing style can you guide how i improve my learning speed in this dynamic tech world

@Johnpaul18345 6 месяцев назад

Sir it's not installing chromadb plz rectify the problem

@teddyperera8531 4 месяца назад

Amazing content as always. I have a request: Could you create a tutorial on creating a vector database with PDF files and using LangChain to query on it?

@alejandro_ao 4 месяца назад

definitely! i’ve been meaning to update my previous version of that tutorial for a while now. expect that soon :)

@ADG306 2 месяца назад

Hi! The chat is showing the scraped text and and the message in prompt template How to solve it? (As we only want to show queries and the response only.)

@ADG306 2 месяца назад

I've even tried your other tutorial of multiple PDFs, same issue there, Would appreciate your response

@verifmobile597 7 месяцев назад

Salut Alejondro Tu peux me laisser ton mail je souhaite te contacter stp pour un projet et faire appel à tes compétences si possible…

@alejandro_ao 7 месяцев назад

Salut, bien sûr, tu peux m'envoyer un mail à hello@alejandro-ao.com

@rolaina 5 месяцев назад

How can i use streamlit stream feature in this example ?

@99352786qwe123 6 месяцев назад

Is there a reason why on the github you changed the chromadb version to 0.3.29 from 0.4.22?

@alejandro_ao 6 месяцев назад

whoops, the reason is an overlooked line in a PR 😅

@albertogonzalezpascoe6937 6 месяцев назад

Hi, I think there's a little error on line 66: "input": user_query; it must be "input": user_input, doesn't it?

@alejandro_ao 6 месяцев назад

you are right! currently, it is working because it's getting the `user_query` variable from the wider scope. but it should be using the function param instead or it will stop working as soon as you modularize the app. thanks! i will push the corrected code in a moment :)

@student7261 8 месяцев назад

Wow....seriously great tutorial....keep up the good work..👍

@alejandro_ao 8 месяцев назад

Thank you so much 😀 let me know what other topics/libraries you would like to see covered

@basicvisual7137 7 месяцев назад

One tutorial with open source models like mistral would be great , what about chatting with json file ?

@alejandro_ao 6 месяцев назад

that's coming very soon!

@luisjaimes9849 2 месяца назад

thanks for sharing your knowledge, pdf files can be added?

@alejandro_ao 2 месяца назад

absolutely, but you would have to use a PDFLoader instead of the WebBasedLoader!

@abhishekmukundan8617 7 месяцев назад

@alenjandro_ao can you suggest some open models that can be used instead of openai models for this particular used case?

@alejandro_ao 7 месяцев назад

hey there! go for mixtral, those guys are on fire. even mixtral-8x7B is just amazing and super lightweight!

@matthiasandreas6549 8 месяцев назад

Hi Alejandro, thanks for your videos, your videos helps me to solve my first steps in use of LLM models. Please is it possible, that you show some solution of your last Langchain videos but as GPU version, how to run ChatBot with own PDF on GPU? Thanks alot.

@alejandro_ao 8 месяцев назад

hey Mathias, thanks mate, it means a lot! sure thing, i'm very glad that you share these video ideas that can be super useful to the community! i'll be working on a video about it!

@shivparashar2413 7 месяцев назад

Great video, can we pass multiple website URLs like 2 or 3

@moltak1453 4 месяца назад

Does it also have access to subpages (hyperlinks) that are at the main website?

@alejandro_ao 4 месяца назад

hey there. this method only loads the web page you pass to it. you would have to code something a bit more sophisticated to crawl the entire website! beautifulsoup, browserless and puppeteer are your friends 😎

@yazanrisheh5127 7 месяцев назад

Please continue this and do a part 2

@alejandro_ao 7 месяцев назад

what would you like to see in part 2? btw, you're hereby invited to join our Discord community: link.alejandro-ao.com/discord I hope to see you there!

@a123m1 3 месяца назад

@@alejandro_aoHi. It would be appreciated if you could add one last part showing how you would use beautifulSoup to extend this app from a web page chat to a whole web site chat. Thanks.

@antonyraj5867 2 месяца назад

Hey Hi can You link with Gemini instead of OpenAi

@antonyraj5867 2 месяца назад

Please post code for that because It is my final year project

@FahadKiani1 6 месяцев назад

the GUI tutorials are fire - God Bless

@RicardoIturra-w8y 8 месяцев назад

Man you're awesome! Some day I'll be too

@alejandro_ao 8 месяцев назад

you're already awesome my friend, keep it up and keep improving every day! 💪

@oludelehalleluyah6723 5 месяцев назад

Hi. Thanks for the very nice tutorial, how do you create those beautiful flow diagrams? Is there an app that you use?

@alejandro_ao 5 месяцев назад

that's miro.com/ but i have since switched to excalidraw.com/ (example here: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-kBXYFaZ0EN0.html ) both have a free tier

@oludelehalleluyah6723 5 месяцев назад

@@alejandro_ao Thanks a lot

@gabscar1 8 месяцев назад

Does this chat with a webpage or a website?

@hajamydeen2025 7 месяцев назад

Do you refer to langchain's document loader or reader documentation. Many document loaders are available this is webpage loader. U post websites like blogs or any articles for retrieving data using open ai api. Read the langchain documentation and llama index data framework also

@gabscar1 7 месяцев назад

@@hajamydeen2025 So, with the current configuration, this only works on a webpage, is that correct?

@alejandro_ao 7 месяцев назад

hey there, yeah this is correct! you would have to scrape the whole website first to chat with the whole website. you can use beautifulsoup to do that too, or you can use a no-code tool like octoparse (it's great). sorry about the title, the keyword rating for "chat with website" was like 50x better than "chat with web page" :S

@gabscar1 7 месяцев назад

@@alejandro_ao Thanks!

@siddupatil364 4 дня назад

Please can you do video on that

@srinivasvamshi5410 5 месяцев назад

Hi can we use opensource Ollama llama2 or mistral ??

@alejandro_ao 5 месяцев назад

absolutely. just initialize ollama and import it like this: python.langchain.com/docs/integrations/chat/ollama/

@sleepwonder5302 8 месяцев назад

Awesome job man! Just a question, can this work with forum-like site with multiple internal pages?

@simonsays...5061 8 месяцев назад

Thats what Im very curious about. I made a pdf webscraper for mining documents to train a model, but i could only get it to pull the ones on the actual page the url directed to specifically. Wondering if theres a technique for automatically searching through all the pages associated with the original url. Im sure there is, I'm just a programming noob so im learning best I can.. ✌️

@alejandro_ao 8 месяцев назад

hey there! there is a way indeed, but it is a bit more complex. you would have to either: 1. have access to the website database. this way is simpler because you would just have to apply a RAG algorithm to a database. 2. scrape the website. this is more complex, as it requires using something like python's beautifulsoup to scrape the contents of the entire website. but beware because some websites don't allow bots (sometimes they can even try to get you in trouble). a no-code tool for scraping that is very good is octoparse, but know that this is on the edge of what is allowed and they have had several lawsuits in the past for making scraping so easy.

@smzhd9642 6 месяцев назад

such a gem this video is .... i don't know why your tutorials are not on trending page ..... Bro you are just awesome 🔥🔥🔥🔥🔥🔥

@alejandro_ao 6 месяцев назад

thank you!! hopefully i'll get there someday 🥲

@meetvasa6955 6 месяцев назад

Please Mate can you upload videos about Agents , Function calling lansmith and langserve .....Also how to create user data personalized llm's

@ChhatreMihir 6 месяцев назад

Question - If retrieval chain is already finding the most relevant documents chunks based on conversation history and user's input and passing it through the {context}, what is the need to integrate retrieval chain using "create_retrieval_chain(retriever_chain, stuff_documents_chain)"

@AIEntusiast_ 7 месяцев назад

website with pdf, and docs tutorial please

@alejandro_ao 7 месяцев назад

you mean all at the same time? you can use streamlit pages to merge some of the other apps we've built in this playlist! Here's their docs on how to do that: docs.streamlit.io/get-started/tutorials/create-a-multipage-app i'll be updating the chat pdf video soon with the latest langchain as well

@krishshah8881 5 месяцев назад

As OpenAI has stopped its free service, can we use HuggingFace API?

@alejandro_ao 5 месяцев назад

i’ll be showing how to use this with local models this week as soon as i manage to finish the video 🫡

@krishshah8881 5 месяцев назад

@@alejandro_ao that’s great Waiting for the video then Thank you!!

@Mahammad_Nabi_D 7 месяцев назад

Great tutorial! I'm a new subscriber and found it really helpful. I'm interested in using the Pinecone vector database for my projects. Could you please provide some guidance on how to get started with it? Any tips or resources would be greatly appreciated. Thank you!

@draaken0 6 месяцев назад

Excellent video. But just for suggestion can you make video on how do we deploy the same code using some microservices like fastapi? As most of your videos are using streamlit ( I actually learned a lot about streamlit 😅) but in case of simple app deployment on even localhost with fastapi or flask will be very helpful.

@zkhan3366 2 месяца назад

Do i need to buy any api like chat gpt or no

@alejandro_ao 2 месяца назад

You can get access to the api by signing up for billing on the openai api platfor (not the same as getting chatgpt plus). OR you can use Groq for free ;) they have a bunch of open source models and they are free to use. just swap a couple of lines on this code. here is the docs with langchain: python.langchain.com/v0.2/docs/integrations/chat/groq/

@rickgeyer9685 8 месяцев назад

Also would this site work with an open source model like Mixtral?

@j0hnc0nn0r-sec Месяц назад

Yes it does

@yazanrisheh5127 4 месяца назад

Does this talk to the entire URL and it goes through every sub domain?

@alejandro_ao 4 месяца назад

hey there. this only goes through the webpage that you load in the URL. if you want to scrape the entire website you would have to use another kind of technique, such as scraping with beautifulsoup, browserless or even puppeteer

@yazanrisheh5127 4 месяца назад

@@alejandro_ao I looked at the langchain docs and I was wondering if you could do a video about the differences between all those URL loaders like WebBaseLoader, UnstructuredURLLoader, RecursiveURLLoader and how do we know when to use which

@GerardoBarcia 8 месяцев назад

Nice one! Is there any chance to cover how to put all of this into production in the most cheapest possible way? Thanks!

@alejandro_ao 8 месяцев назад

Working on it!

@alejandro_ao 7 месяцев назад

hey, in case you haven't seen it. here's the video on how to put this to production in a simple, free way: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-74c3KaAXPvk.htmlsi=xfL4RZuDTb3H4rgr

@GerardoBarcia 7 месяцев назад

@@alejandro_ao Thanks! Checked! Amazing work! I'll buy you a coffee for sure.

@tz2014 3 месяца назад

Awesome, your are brilliant

@alejandro_ao 3 месяца назад

you are

@lahavmorris9919 6 месяцев назад

could you make a version where you don't delete stuff all the time as it wasn't always clear what was supposed to be deleted and what needed to be kept so i often missed when you did it and it made it harder to follow what you were doing.

@daneshwarimelshetty105 5 месяцев назад

Is it necessary to use chatgpt-4?

@alejandro_ao 5 месяцев назад

not really! gpt3 is great at summarization, and its 16k token window should be enough for most websites and conversation histories

@felixkuria1250 6 месяцев назад

Good work 👏👏 Can you do one tutorial illustrating how you can implement the same in flask or fastapi

@nazarmohammed5681 7 месяцев назад

how to make this project using hugging face models plz make a video

@Travller211 8 месяцев назад

Please make an video for langchain evaluators that how can we use them using huggingface it would be helpful for us

@nazarmohammed5681 7 месяцев назад

make a video on chating with pdf and website url both combined

@ajaynorman8141 4 месяца назад

Its an amazing tutorial! Really really good. Clear articulation and reasoning behind every line

@Matepediaoficial 5 месяцев назад

Great video!! Thanks!

@werneylima6172 7 месяцев назад

Hi, great video! but after I change the website URL,it keeps answering based in the first URL. I closed the browser and opened again. Inserted a different URL, and asked What is the title of the articles and still giving the title of the first URL. Even after change the URL in the text input, the program does not change the URL, because keeps answering based on the first URL.

@maria-wh3km 6 месяцев назад

i guess you have to stop and start the app again to be able to give a differnt url!

@RicardoIturra-w8y 7 месяцев назад

Friend, could you teach me how to receive a data stream from OpenAI with langchain through document analysis? I swear I've put a lot of effort into figuring it out, but I'm struggling a bit with the language and the new documentation 😢

@alejandro_ao 7 месяцев назад

hey my friend, sure thing. but i'm not sure i can help more without more info. why don't you bring that up in our discord server? maybe we can help you out there: link.alejandro-ao.com/discord

@sanderhenk10 7 месяцев назад

I am having trouble with the Chroma library. When I import it only recognises chroma class instead of Chroma. When I use chroma, it says it doesn't find de from_documents. But when I try Chroma it doesn't exist. Can someone point me in the right direction?

@alejandro_ao 7 месяцев назад

hey there, yeah, i have seen this issue before. try verifying that chromadb is correctly installed in your virtual environment by running `pip freeze | chromadb` in your terminal. if it is installed, just import Chroma even though it is recommending you to use `chroma` for some reason. this worked for me. le me know how it goes. also, feel free to join the discord server to have more news about the channel and the community! 👉 link.alejandro-ao.com/discord