Self Learning GPTs: Using Feedback to Improve Your Application

LangChain

Подписаться 62 тыс.

Просмотров 15 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

29 окт 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 22

@RobertoDuransh 7 месяцев назад

Okay, this is exciting! awesome work!!!

@kierkegaardrulez 7 месяцев назад

Can you rescind an instruction? Eg you later decide you don’t want emojis. If you say, “stop using emojis” will it add a new instruction or will it know that it can remove the previous instruction to achieve the same effect?

@theyvesloy 7 месяцев назад

I don't think thats working :(

@jzam5426 7 месяцев назад

Thanks for sharing!! So many questions! Is it really finetuning the model on the fly? how can one implement this using the API?

@kalahaval 7 месяцев назад

That's awesome stuff. Curious to learn how this is done under the hood.

@infocyde2024 7 месяцев назад

Yeah, is this dataset an embedding? Or is there some sort of automated fine tunning going on here?

@crotonium 7 месяцев назад

the description mentions that "then automatically use that feedback to improve over time. It does this by creating few-shot examples from that feedback and incorporating those into the prompt."

@insitegd7483 7 месяцев назад

@@crotonium I think that could be solved easily retrieving top-k last messages in a vector database or retrieving the last examples in a sql/nosql database to remember, but I am not sure.

@kirby145x 7 месяцев назад

My guess is it takes the top user scored items and then relays those if a related text is given to the agent. "Use these examples for a response that is desirable" etc.

@darwingli1772 7 месяцев назад

Thanks for the video. May I know if the token will be increased as user provides more feedback? I am not sure. But I assume the feedback from the user is also fed into the prompt to change the behavior of the LLM?