Evaluating LLM-based Applications

[Webinar] LLMs for Evaluating LLMs

Ледник 1:0 Мужик

Алан, ты в порядке? 🤪 #dota2 #teamspirit

КАК СТАТЬ ГУРАМОМ АМАРЯНОМ #иванабрамов #гурамамарян #пародия #shorts

КОГДА К БАТЕ ПРИШЕЛ ДРУГ😂#shorts

Mitigating LLM Hallucinations with a Metrics-First Evaluation Framework

Подписаться 347 тыс.

Просмотров 24 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

29 окт 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 20

@ajeethkumar6296 4 месяца назад

Thanks for the clear cut explanation

@MMSS-e9o Год назад

The real contribution seems to be the prompt they used to generate the CoT and the metric value... Could you share the code used for the metric and the prompt for ChatPGT?

@HonestGraduate Год назад

Thank you for the presentation and demo!

@purvislewies3118 Год назад

Blessed love...givethanks...Cape Town

@KokkeOP Год назад

The paper and the Slides are both in the description, guys. :) read.

@MMSS-e9o Год назад

Nice talk! Could you please share the notebook?

@JuliusOpusprofundum Год назад

❤

@danteblink Год назад

Do you think human intervention in the evaluation process is going to last? It seems its a process that LLMs could achieve by themselves in the near future.

@senderlapin Год назад

Я из России. Спасибо за вебинар.

@JuliusOpusprofundum Год назад

F u. I AM FROM UKRAINE.

@zaursamedov8906 Год назад

Guys would u be able to drop the notebook please?

@hcrespo3 Год назад

I'm also interested, thanks

@komalmistry7284 Год назад

Could someone share the link to the paper that was mentioned here "ChainPoll" , I believe.

@Deeplearningai Год назад

It is in the video description!

@davidvilla2402 Год назад

I don't know how bt I searched the n word and it came up

Далее

Evaluating LLM-based Applications

33:50

Evaluating LLM-based Applications

Просмотров 27 тыс.

[Webinar] LLMs for Evaluating LLMs

49:07

[Webinar] LLMs for Evaluating LLMs

Просмотров 10 тыс.

Ледник 1:0 Мужик

00:53

Ледник 1:0 Мужик

Просмотров 1,3 млн

Алан, ты в порядке? 🤪 #dota2 #teamspirit

00:13

Алан, ты в порядке? 🤪 #dota2 #teamspirit

Просмотров 126 тыс.

КАК СТАТЬ ГУРАМОМ АМАРЯНОМ #иванабрамов #гурамамарян #пародия #shorts

00:27

КАК СТАТЬ ГУРАМОМ АМАРЯНОМ #иванабрамов #гурамамарян #пародия #shorts

Просмотров 331 тыс.

КОГДА К БАТЕ ПРИШЕЛ ДРУГ😂#shorts

00:59

КОГДА К БАТЕ ПРИШЕЛ ДРУГ😂#shorts

Просмотров 3,1 млн

How to Build, Evaluate, and Iterate on LLM Agents

1:02:12

How to Build, Evaluate, and Iterate on LLM Agents

Просмотров 38 тыс.

Deep Dive into LLM Evaluation with Weights & Biases

59:11

Deep Dive into LLM Evaluation with Weights & Biases

Просмотров 18 тыс.

Building Trusted AI with LLMs with Richard Socher of You.com

32:41

Building Trusted AI with LLMs with Richard Socher of You.com

Просмотров 9 тыс.

Session 7: RAG Evaluation with RAGAS and How to Improve Retrieval

37:21

Session 7: RAG Evaluation with RAGAS and How to Improve Retrieval

Просмотров 21 тыс.

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

15:21

Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use

Просмотров 95 тыс.

VulnerabilityGPT: Cybersecurity in the Age of LLM and AI

1:18:28

VulnerabilityGPT: Cybersecurity in the Age of LLM and AI

Просмотров 22 тыс.

How to build Multimodal Retrieval-Augmented Generation (RAG) with Gemini

34:22

How to build Multimodal Retrieval-Augmented Generation (RAG) with Gemini

Просмотров 67 тыс.

Solving Gen AI Hallucinations

36:22

Solving Gen AI Hallucinations

Просмотров 3,1 тыс.

Navigating LLM Threats: Detecting Prompt Injections and Jailbreaks

52:21

Navigating LLM Threats: Detecting Prompt Injections and Jailbreaks

Просмотров 9 тыс.

How to evaluate an LLM-powered RAG application automatically.

50:42

How to evaluate an LLM-powered RAG application automatically.

Просмотров 22 тыс.

Ледник 1:0 Мужик

00:53

Ледник 1:0 Мужик

Просмотров 1,3 млн