ClippyGPT - How I Built Supabase’s OpenAI Doc Search (Embeddings)

Rabbit Hole Syndrome

Подписаться 25 тыс.

Просмотров 175 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

17 сен 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 335

@Jonathan-rm6kt Год назад

I am blown away at how much information is densely packed into this. You got yourself a new subscriber, sir. It’s staggering to think about how these technologies will shape the landscape for data and analytics. This is just the beginning.

@RabbitHoleSyndrome Год назад

Thanks for the sub! I am continuously blown away by the possibilities of large language models 😀

@Candyapplebone Год назад

Damn, I spent like a week researching this shit on my own, and have been working on almost exactly the same thing. Processing MDX files into embeddings etc. It’s really cool to see somebody doing almost the same exact thing. Makes me think I am really on the right track!

@RabbitHoleSyndrome Год назад

Nice! Glad to help give some validation 😄 are you also building for docs?

@automioai Год назад

Hey! how had you turn your pdf files into a propper mdx format ? tnx

@RabbitHoleSyndrome Год назад

@@automioai In this project no PDF files were used - all documentation had been written directly in MDX. You'll have to do some research on ways to extract text from PDF files. Once you have that, I wouldn't bother with MDX at all - just generate embeddings directly on that content.

@FacadeMan Год назад

Prediction: This is gonna get a million views. Just saw fireship video about vector databases and wanted to understand embeddings. Before I could even search, this video was in the page. Though I wasn’t interested in a 40 min video (had a feeling I’ll just stop after 5 mins like I usually do) I ended up watching it all. The rabbit hole 🐇 🕳️ format is so naturally elegant. Clear end to end use case. I secretly don’t want to share it with anyone but I am forced to fulfill my prediction.

@RabbitHoleSyndrome Год назад

Thanks for the comment! Glad to hear the format is working 😃

@SatvikAgnihotri Год назад

The clarity of this video while maintaining detailed granularity of the subject is very impressive and very appreciated. Thank you for making this video.

@milovangudelj Год назад

Wow, this is incredibile... I can see a future where every docs site does the same thing. This is truly powerful stuff! Well done 👏

@RabbitHoleSyndrome Год назад

I’m am both excited and blown away by the possibilities. Thanks for watching!

@trimonmusic Год назад

Found your channel while learning React Three Fiber, subbed with notifications immediately. Today I get a notification for a well-explained ChatGPT tutorial, right as I embark on building a similar thing. Fantastic continued work, thank you very much!

@RabbitHoleSyndrome Год назад

Awesome! Thank you!

@fotoflo Год назад

Ive been looking for this information for months. Such an excellent tutorial and I love that Supabase's code is all open source so i can actually clone it and read how it works in detail later. Thank you so much for the walk through. Super talented dude too - love the blender stuff.

@RabbitHoleSyndrome Год назад

Glad to hear it helped! Agreed - open source is amazing! Let me know if you hit any road blocks along the way, happy to help 😃

@fraternitas5117 Год назад

@@RabbitHoleSyndrome why is the generate embeddings file so different in the video then what is in the repo now? I can't find anything you talk about in minutes 10-13.

@RabbitHoleSyndrome Год назад

Hey @@fraternitas5117! Supabase moves pretty quick - the code I references has been refactored now to support multiple knowledge sources (ie. more than just markdown). You can find the markdown specific code here: github.com/supabase/supabase/blob/1b2361c099c2573afa1fe59d3187343bb8f1bcab/apps/docs/scripts/search/sources/markdown.ts

@morrisy0x Год назад

This 40-minute video looks like 10 minutes. I have been researching related engineering topics recently and they have been very inspiring to me.

@RabbitHoleSyndrome Год назад

The potentials with LLMs seem to be endless 🤯

@franciscofredviana743 Год назад

What a fantastic video and content. I’ve gone through multiple videos trying to better understand embedding and how to work with ChatGPT in the best way for querying large amount of content and producing an analyzed response. I’m not a developer, have a background in computer science but I’m a software sales person that is curious about technology and I was able to completely understand your video and content. Subscribed, liked and will be watching more of your videos. Thank you!

@RabbitHoleSyndrome Год назад

Glad it was helpful!

@javiasilis Год назад

Wow. I'm so thrilled to know that you were one of the ones behind that great feature. I've been using Supabase for 6 months, and have been pretty happy with it. Except for the docs and the transition to 2.0. I was blown away when I saw that it generated the code for me when I started writing its documentation

@RabbitHoleSyndrome Год назад

Glad to hear it has helped you! Any feedback on the docs that you think should improve?

@javiasilis Год назад

@@RabbitHoleSyndrome So far so good! I think one challenge is to know how we can check if a user's email address exists. (Or other specific user's metadata) I couldn't find it in the docs. There was a GitHub issue which said to store the user's data in a separate table as the auth table was private. I ended up doing that and haven't had any problems. Btw, thanks again for all the awesomeness!

@anonymousXYZ659 10 месяцев назад

Seems to have found one great coding channel among the noise of today; back to good old days when coding was boring & nerdy. Great job!

@jit-r5b Год назад

This was so valuable to just settle the thoughts into a clear action plan as to how to implement in production. Thank you!

@RabbitHoleSyndrome Год назад

Glad it helped!

@Kichaka_Ranch Год назад

Thank you for sharing . I was struggling on how to get started . This was very well presented . From this side of the world asante sana !

@RabbitHoleSyndrome Год назад

Glad it was helpful 😃

@rayaffas857 Год назад

First time on this channel. The way you structured the video, the pace and the explanations are all on point. Keep up the good work. +1 subscriber.

@RabbitHoleSyndrome Год назад

Glad to hear that, thanks for the sub!

@JimmySting Год назад

This content is really top notch. I appreciate the clear and detailed explanations of everything!

@RabbitHoleSyndrome Год назад

Glad you found it helpful!

@roxforgegames4548 Год назад

The most amazing think is that I basically made a chatbot app in less than a week with only the help of GPT4, I had no knowledge of AWS services, PostgreSQL or python. Everthing you told in the video is what GPT4 told me. All of the serves and database are setup, it has memory, STT, TTS and Cognito login/register.

@RabbitHoleSyndrome Год назад

It is quite amazing - and I’m sure it will only get better!

@vince2nd Год назад

Fireship guy? Either way, its been pretty useful in terms of learning API's and how to connect them to my nocode builder. Spent hours trying to get things working and the Assistant basically told me what i was doing wrong and how to fix it. So well done with the implementation.

@RabbitHoleSyndrome Год назад

Glad it helped!

@9uifranco Год назад

So you're the one who made this beautiful thing. Pretty nice.

@user-ow5mn6dn7n 11 месяцев назад

Dude, that's exactly what I was looking for. No more bullshit articles with clickbait titles, just DIY in the essence. Is there a way to support you through patreon or smth. ?

@RabbitHoleSyndrome 11 месяцев назад

Glad to hear it! You support by watching 🙌

@Jandodev Год назад

I'm doing something even cooler with the vector embeds I cant wait till I can share!

@rikawrites7104 2 месяца назад

whaaaattt tell us??

@ooogabooga5111 Год назад

Insane, I love how you are able to do so many things. My laptop atm is unable to power every wish of mine (getting into 3D) but I hope I will soon be able to do so.

@AngelEduardoLopezZambrano Год назад

This channel is awesome! Love the rabbit holes you take us one! keep them coming please!

@jaiderariza8441 Год назад

I am grateful to this video. This open my eyes to Embeddings

@RabbitHoleSyndrome Год назад

Glad it helped 😃

@zaynjarvis9443 Год назад

fantastic content, didn't expect it will be so informative in just 40 mins. Looking forwards to the next one!

@RabbitHoleSyndrome Год назад

Thanks for watching 😀

@koigxiritb7ttgyuv Год назад

I'm at 1:42 and this video already is 10/10

@swyxTV Год назад

incredible end to end tutorial, nicely done

@RabbitHoleSyndrome Год назад

Thank you! 😃 Have you explored embeddings or pgvector?

@field-officer 11 месяцев назад

I'm reading the comments, and I'm like.. Yeah WTF 🗿🔥

@haunebuiii4103 Год назад

This is amazing, you’ve created Clippy just as enthusiastic and helpful you are! Thanks a lot

@RabbitHoleSyndrome Год назад

Thanks for watching 😄

@maertscisum Год назад

@@RabbitHoleSyndromehow did you generate the mdx files?

@RabbitHoleSyndrome Год назад

The MDX files weren’t generated - the Supabase team wrote them as you would any markdown file.

@ThiagoVictorino Год назад

You are awesome! Now I can understand how these things work together.

@RabbitHoleSyndrome Год назад

Happy to hear it! 😀 thanks for supporting the channel!

@flying-kite-spectre Год назад

Wonderful primer on prompt engineering.

@amardeep.sahota Год назад

Amazed by your content. Fantastic work here .

@RabbitHoleSyndrome Год назад

Thanks, glad it helps!

@aldousd666 Год назад

This is a glorious illustration. Thank you very much! I've been trying to find an example of doing this, and yours has put it all together for me! Subscribed!

@RabbitHoleSyndrome Год назад

Glad it helped, thanks for the sub!

@jtjt8777 Год назад

thx for saving me hours if not days. I wanted to add openai to my supabase app and found the exact tutorial.

@RabbitHoleSyndrome Год назад

Glad it helped!

@imranaalam Год назад

excellent . you are hands-on & practical

@user-nt2fs7qp6c 11 месяцев назад

incredible value in this video

@benmak5326 10 месяцев назад

This really is/was an epic video clearly and well laid out!

@ApplicableProgramming Год назад

Thanks Greg for a great explanation. I like your presentational style!

@RabbitHoleSyndrome Год назад

Glad it was helpful!

@gr8tbigtreehugger Год назад

Many thanks for this insightful and helpful video!

@RabbitHoleSyndrome Год назад

Happy to help, thanks for watching!

@caliwolf7150 Год назад

This is truly valuable and useful content, thanks a lot.

@RabbitHoleSyndrome Год назад

You bet! Glad it was helpful

@JuanUys Год назад

28:37 Whenever I see examples of decoder (GPT) prompts starting with "You are a helpful finance advisor" or "You are an enthusiastic support rep", I can almost see the AI clearing its throat and sitting up straight and saying "right, ok". Gimme that can-do attitude, GPT!

@RabbitHoleSyndrome Год назад

🤣

@antonodman5709 Год назад

Absolutely amazing quality here, glad I found your video. Subbed!

@RabbitHoleSyndrome Год назад

Thank you!

@ryanyoung1925 Год назад

You do a such great job bro ! I love what youhave built and your video, keep build great things bro.

@martinfilteau8668 Год назад

Amazing video! I'm going to put this in practice right now!

@RabbitHoleSyndrome Год назад

Awesome! Feel free to share as you make progress!

@NickLambourne Год назад

GREAT content. It explained everything you need to know about creating 'chat docs' or similar in one run, and all open source. Kudos! And subscribed.

@RabbitHoleSyndrome Год назад

Thanks for the sub! Great to hear 😃

@CraigShieldsAOTG Год назад

This video was fantastic, lots of information given in an easily digestible way. Subscribing!

@RabbitHoleSyndrome Год назад

Glad it was helpful, thanks for the sub!

@marspark6351 Год назад

One thing that might help is if the question result shows the links to the documents that it acquired the information from. Since you are currently fetching which document to run chatgpt on based on similarity of features, maybe you can change the prompt so that it also returns the link of the document that was deemed as a "similar document"

@RabbitHoleSyndrome Год назад

Absolutely! This should definitely be the next progression.

@ahmadbasyouni9173 27 дней назад

best tutorial i have ever watched u are a genius ty!

@fiftygrapes Год назад

Cool video. I was thinking about doing something similar for some reference pdfs. Thanks for the video

@RabbitHoleSyndrome Год назад

Thanks for watching & best of luck!

@keepitdialed Год назад

Thanks for the knowledge share my friend. 💪🙏

@RabbitHoleSyndrome Год назад

You bet!

@naibafYT Год назад

Just what I was looking for - thank you very much🥳

@RabbitHoleSyndrome Год назад

Great 😃 Thanks for watching!

@jonasqiao8834 Год назад

that's great, you did great job! helps me a lot in this.

@RabbitHoleSyndrome Год назад

Glad it helped!

@tohafi Год назад

Amazing video! Great and detailed information!

@RabbitHoleSyndrome Год назад

Glad it helped!

@phemartin Год назад

I'd love to learn how to also incorporate user-feedback (thumbs up/down)

@RabbitHoleSyndrome Год назад

This will likely make it into the next iterations. Will be a good challenge!

@phemartin Год назад

@@RabbitHoleSyndrome That's awesome! Can't wait

@jit-r5b Год назад

@@RabbitHoleSyndrome amazing video! Please let us know if you get to it:)

@forbiddenera Год назад

@@RabbitHoleSyndromeany progress?

@neociber24 Год назад

Cool project, would be cool to create a tool like this that you can embed in any documentation, reading directly the markdown or scraping the website.

@RabbitHoleSyndrome Год назад

Definitely!

@GeyzsonKristoffer Год назад

Watched because of the content, subscribed because of the dog. 👍🏻

@RabbitHoleSyndrome Год назад

Thanks for the sub! 🐶

@BernhardSchlegel Год назад

First time here. This is so well done. Subscribed. Your viewer number will explode! I like how you approached the topic in a very calm way without jumping on the "LLMs will take over the world" train :) You don't happen to have the clippy blender asset somewhere?

@mrc580 Год назад

Amazing video! This is exactly what I was looking for a long time. You basically explains everything I wanted to know about how to create a search engine using open ai. But I have a few questions: How much did you spend on open ai embending API building this? How much supabase spends monthly with searchs using the open ai api? It is possible to use an open source embedding API instead of calling the open ai api ? Wouldn't it be less expensive than the approach you took?

@RabbitHoleSyndrome Год назад

Glad it was useful! As for costs, you may be surprised how inexpensive OpenAI embeddings are (at least I was). To put it in perspective, for the Supabase guides we currently have around 1500 page sections which total just over 220000 tokens. At OpenAI's current embedding price ($0.0004/1k tokens), that brought us to just less than $0.10 for the entire guide knowledge base (~one-time pre-processing). After that the average query is likely

@vedantnn7 Год назад

This is really helpful and valuable, thanks a ton!!!

@RabbitHoleSyndrome Год назад

You bet, thanks for watching!

@rogerganga Год назад

This is a fantastic video! Thank you very much for sharing :D Quick question - Currently if the info is not in the documentation it responds "Sorry I don't know how to help with that". But how can we make it respond like this: "Sorry I don't have relevant info in the documentation but you can do something like this". For e.g. "I don't have any info about how to make banana pancakes in the documentation, but here is how you can make one...." Idea here is to make it act like chatgpt on top of the information provided. Keen to know more on this and thank you so much for making this video :D

@ssahillppatell Год назад

Thanks for explaining everything so clearly :))

@RabbitHoleSyndrome Год назад

You bet! Thanks for watching 😃

@RikLogtenberg-uv4gx 10 месяцев назад

Such a great video.

@sensvitae Год назад

Thanks for the share !

@joaorodriguesjr Год назад

This is really interesting! I'm looking to build something similar.

@RabbitHoleSyndrome Год назад

Best of luck!

@dataray Год назад

I have not seen such a great video in a while! How wonderful have you explained the whole process 👍👍! Could you explain a bit more about how did you choose 0.78 as threshold for embeddings comparision? have you statisticized that wether the most relevant sections can be found with it?

@RabbitHoleSyndrome Год назад

Glad you liked it! 0.78 was a first-stab threshold that worked best based on a limited sample of test queries. I wouldn’t claim that this number is universal - almost certainly this could change by domain.

@thehouse2620 Год назад

excellent info, great presentation

@RabbitHoleSyndrome Год назад

Glad it was helpful!

@olboone Год назад

Really good video, nice job! 🎉

@RabbitHoleSyndrome Год назад

Cheers! 😀

@benrobo8 Год назад

Great job 👍, this was really helpful

@RabbitHoleSyndrome Год назад

Glad it helped!

@BradleyKieser Год назад

Damn that's a great, helpful video!

@RabbitHoleSyndrome Год назад

Glad to hear it was helpful, thanks for watching!

@hellofahmid2331 Год назад

Brilliant.

@juanandrade2998 Год назад

"pretty much every single open source project I've seen that has documentation uses either markdown or MDX". I'm pretty sure the embedding process and AI stuff is interesting... The real question is who would do an entire markdown for a specific open source project... For free!! I mean I understand the willingness to give code for free since you not only get exposure, but you get to show off your coding abilities... But documentation... Let's review the lifecycle of a project just to see how difficult is to materialize a piece of **Documentation.** First comes the goal of a project, what problem it aims to solve. Maybe the goal is to compete financially. Then the scouting process, this may be combined with actual development, even if you think you know everything you'll get lost... Finally the testing phase, the project has been "finished" and it is being tested on every possible configuration and all different devices. Finally comes the documentation. Now the best/worst part is that the Documentation is a WHOLE program on it's own, with it's own set of keywords and ruleset. You know what's my guess?? Only the projects actually making money... Or the ones with a wealthy owner are the ones getting documentation. In fact, the fact that this documentation is a 1:1 match with it's super nice web page, tells me that the web page came BEFORE the MDX doc... Which means the entire project was envisioned as a capital backed enterprise with investors and what not. EVEN IF the documentation process is something which is automated, the question is what type of input this automation accepts... No one will code in function of a good automated documentation... So I'm sure you either a) need to clean the automated autogenerated documentation b) a low payed intern does it. c) there is no such thing as being automated, you just need a line of coke and full goblin mode. d) there is actual people doing documentation for free... You want to know the worst thing? I've never found ANY documentation worth my time reading... None. Most documentation hide important information and make assumptions about what users will or will not do with the code. I've seen documentation from million dollars companies being COMPLETELY wrong about what the actual code does. Systems cannot be explained with words since they are 2 dimensional, you cannot reference, cross reference or encapsulate ideas, to abstract them and use them as functions... you actually need to see the source code to REALLY understand. What I see happening, or what I REALLY really hope happens is that the cost of doing menial jobs such as writing documentation will balloon because the market will notice it's importance in the process of AI embedding. Writing the documentation IS the real hard work.

@joshmadrid5253 Год назад

this would be amazing for obsidian note taking application

@RabbitHoleSyndrome Год назад

Great idea!

@user-mu8gs9mk9z 11 месяцев назад

Subscribed, amazing content.

@jamelljones122 Год назад

This is awesome, thank you!

@RabbitHoleSyndrome Год назад

Glad it was helpful!

Год назад

Great content. I wonder if it can generate code as well. Lets say you are doing this for cloudflare products, kv, workers, durable objects etc all documents injected. Then give prompt like "generate a worker for x" and it will specifically generate with given docs etc..

@RabbitHoleSyndrome 11 месяцев назад

I think this is where the industry is going next!

@MT222100 Год назад

Amazing Content..

@phil-jc8hp Год назад

You basically build llama-Index yourself, good job

@fflv_irn Год назад

super helpful. thanks. love supabase

@RabbitHoleSyndrome Год назад

Happy to help!

@casualcycling8738 Год назад

🔥🔥🔥 This is amazing!

@voldrichmatous Год назад

Loved it, came for the embeddings and left with that and whole lot more! Too bad that the clippy did not made to the web on supabase :). Copyright problems?

@RabbitHoleSyndrome Год назад

Thanks for watching! Clippy did make it - but things move fast at Supabase 😅. This feature has evolved into a unified cmd+k menu and just renamed as “Supabase AI”.

@pranayaryal Год назад

What kind of vectors did you generate from chatGPT. Are they word vectors? You passed one whole section of the mdx so they are not word vectors but paragraph vectors?

@RabbitHoleSyndrome Год назад

Yeah you got it - the community mostly calls these "sentence embeddings". Check out SBERT/sentence transformers for some good info

@diaryofacrankykid7270 Год назад

Enjoyed this tutorial. Would love to know how you'd approach fine-tuning a model with this data to build a chat bot?

@RabbitHoleSyndrome Год назад

Thanks! Any reason you're looking to fine-tune vs. embeddings+context injection?

@senethys Год назад

Could you please explain why you chose context injection over fine tuning? What are the strenghts and weakness etc.

@RabbitHoleSyndrome Год назад

Hey! Context injection does a couple things: 1. Primes the prompt with specific information we want GPT to reply with 2. We can always use up-to-date information - anytime we need to add/remove/update information in our knowledge base, it's as simple as a DB update. No need to re-fine-tune the model all over again

@haisai4159 Год назад

amazing! how does clippy update the vector db and process the text as embedding when NEW documentation is added? is it automatic? maybe i missed it in the video

@RabbitHoleSyndrome Год назад

Great question! The `generate-embeddings` script was designed to be diff-based. So next time you run it, it will pull in only the documents that have changed and re-create embeddings on just those. It currently works using checksums: 1. Generate a checksum for the content and store in the DB 2. Next time the script runs, compare the checksums. If they don't match, the content has changed and embeddings should be re-generated. The script runs on CI, so anytime documents change a GitHub Action will trigger the script. See this PR for details: github.com/supabase/supabase/pull/13936

@ShotterManable Год назад

This was very useful! Thanks a lot ofr sharing. Are there plans for keep it up? I'm glad to found this type of content

@RabbitHoleSyndrome Год назад

You bet! Definitely - videos will keep coming. Some take a bit longer than others - thanks for your patience 😃

@ShotterManable Год назад

@@RabbitHoleSyndrome Thanks you for making this content for us for free! I very much appreciate it and you motivates me to share knowledge with my friends :)

@automioai Год назад

Thanks, amazing content. there is a way to junk pre-process PDFs to Mdx ?

@RabbitHoleSyndrome Год назад

Thanks! PDF to MDX will be a tough task. But if your end goal is embeddings, you could consider pulling the content out of the PDF and generating embeddings on it directly without getting MDX involved.

@BusinessAutomatedTutorials Год назад

This is amazing. Super clear and short explanation of embeddings. Great walkthrough the code touching on the relevant parts. Subbed the channel👍 I see the code is still in the Supabase repo - but the Clippy is not there? what happened?

@RabbitHoleSyndrome Год назад

Glad it was helpful! Edit: Just realized you might have been talking about the Clippy graphic on the site, not the code. Search and Clippy have been combined into the same interface - you can find Clippy by clicking search, then switch to “Ask Clippy”. Original: Things move quick at Supabase 😆 The Clippy frontend code got moved when search was upgraded to also use embeddings. After the refactor everything is just under “Search”: github.com/supabase/supabase/blob/0ecc238ad6d81202bb2301f7919b166a98929697/apps/docs/components/Search/SearchModal.tsx The backend Clippy logic is still in the same edge function: github.com/supabase/supabase/blob/0ecc238ad6d81202bb2301f7919b166a98929697/supabase/functions/clippy-search/index.ts

@BusinessAutomatedTutorials Год назад

@@RabbitHoleSyndrome cool. You can feedback to Supabase, that a) this video significantly increased my interest in adoption of Supabase b) clipy icon is tooo small , it should be large and obstructive. Serious question Prompt Engineering: With Chat Completion API - do you do system persona and pack all prompt into user? Or would you break prompt like here into example 3 user entries? E.g.: Context, question, use markdown. Does it make a difference?

@RabbitHoleSyndrome Год назад

Good feedback 🙌 and great question about prompt engineering in the chat API. We are actually experimenting with this right now and trying to understand what produces the best results. At the moment we do a bit of both (system message and user message with a bit of prompt overlap between both). OpenAI says that the model doesn’t pay strong attention to the system message, so it may be better to use system message strictly to set identity and provide instructions & context in a user message.

@talhahasan6470 11 месяцев назад

Great video! How closely do the .mdx files have to match this structure before they can be processed into embeddings? Do they need to export the meta const, for example?

@RabbitHoleSyndrome 11 месяцев назад

The meta const is optional! You’re also free to tweak the pre processing logic to fit whichever format you need to work with

@mohali4338 Год назад

Good job! :)

@spirobel Год назад

are there alternatives to openai to create these vectors? Dont really feel comfortable building something around a closed source api that is controlled by one vendor.

@RabbitHoleSyndrome Год назад

Really great question. You’ll want to look into sentence embeddings. There has been a lot of work on the OSS side with Sentence-BERT (SBERT) you can check out. You might also want to look into Universal Sentence Encoder (USE) and InferSent.

@RabbitHoleSyndrome Год назад

LlamaIndex actually uses OpenAI (text-embedding-ada-002) by default for embeddings today. They're more of a toolkit layer to assist with the workflow. There are many other alternatives though (which LlamaIndex supports via LangChain) that are worth checking out: langchain.readthedocs.io/en/latest/reference/modules/embeddings.html

@trejohnson7677 Год назад

LOL what computer arch r u on

@Jonathan-rm6kt Год назад

16:10 would love an explanation on how embedding work with a document structure? I.e query is “summarize chapter 3”. The embedding sans retrieval don’t seem to capture the structure of the chunks that are contained in title chunk “chapter 3 “. All explanations on embedding I’ve seen all rely on the text content within a chunk.

@thetrends5670 Год назад

Amazing new feature in supabase. Btw wdym by supabase hired you for building this tool? or do you mean they sponsor you?

@RabbitHoleSyndrome Год назад

They asked me to help develop it!

@berndeckenfels 8 месяцев назад

Can you make the completion also tell which chunks have been used and link to them or get “read more” links or would you do that by just listing “top 5” matches from the context?

@hallowatcher Год назад

So if I understood correctly: The embeddings were only used to check for similarity between the user's input and the doc's content, in order to provide the prompt with relevant (text) context, right? Is there a way to provide the GPT model with the embeddings instead?

@RabbitHoleSyndrome Год назад

That’s correct. You could have used an alternate search method, but embeddings have a nice alignment with LLMs since they also use language model themselves. Unfortunately no there is not currently a way to inject embeddings directly into GPT today. Maybe this will change in the future or become available in open source models like LLaMa in the same way we’ve seen it happen with Stable Diffusion.

@Davidlavieri Год назад

Any reason why used text-davinci instead of chat-turbo? Or wasn't released? And today what would you choose

@RabbitHoleSyndrome Год назад

It wasn’t released yet (things move fast these days 😆). We’ve actually just updated to use gpt-3.5-turbo. Much cheaper, and better suited for multi-message chat-style interactions (though single prompt responses are still possible). Most difficult challenge with gpt-3.5-turbo has been getting it to work well with a prompt. Doesn’t seem quite as good at following the original instructions.

@Davidlavieri Год назад

@@RabbitHoleSyndrome i see, that's my problem with it, it does hallucinate a lot more if misses info and doesn't pay attention to the prompt 100%

@aliandiazperez7602 Год назад

Excellent video and the whole methodology for deep dive into sou much information!!!! will you consider elastic as vector database?

@RabbitHoleSyndrome Год назад

Thank you! I haven't looked too much into elastic yet. I might consider it if I was already using Elastic in my stack. Have you tried it?

@onemanops Год назад

Thank you

@LiiittleBigPlanet Год назад

Isn't it super expensive to calculate the similarity twice (~27min), in the select and where?

@zejiaann Год назад

Hey, awesome video! Could I check with you if it is possible to search the database that contains CSV files?

@RabbitHoleSyndrome Год назад

Hey - CSV files are definitely doable. It will mostly come down to how you plan to pre-process them. Perhaps you can import your CSV file into a table itself and generate embeddings on the content within it.

@WilbertoCasillas Год назад

Loved the video validated a lot of the decisions we are making at work. I have a question however on the section about context injection. You mention that you search for relevant information to inject into the prompt. How do you accomplish the search part ? Is it using an index or a sql query amongst all columns ?

@RabbitHoleSyndrome Год назад

Glad it was helpful! The search is done through embeddings - we perform a similarity search between the embeddings generated from user's query and the pre-generated embeddings on the knowledge base (stored in a column using pgvector).

@WilbertoCasillas Год назад

@@RabbitHoleSyndrome ahh so: 1) call OpenAi embedding api for the query 2) use cos sim to compare the query embedding against the stored embeddings 3) utilize the top results to inject into a prompt that we compile to send to OpenAi completion api ?

@RabbitHoleSyndrome Год назад

You got it 👍

@namangarg4897 Год назад

Hey great content! I was looking for exactly this. Do you have a discord for asking queries? I was making the same thing but for ".md" files not mdx and faced some issues.

@RabbitHoleSyndrome Год назад

Thanks! For now, feel free to reach out to the channel's email address and I'll do my best to give you a hand.