Тёмный

Mitigating LLM Hallucinations with a Metrics-First Evaluation Framework 

DeepLearningAI
Подписаться 347 тыс.
Просмотров 24 тыс.
50% 1

Опубликовано:

 

29 окт 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 20   
@ajeethkumar6296
@ajeethkumar6296 4 месяца назад
Thanks for the clear cut explanation
@MMSS-e9o
@MMSS-e9o Год назад
The real contribution seems to be the prompt they used to generate the CoT and the metric value... Could you share the code used for the metric and the prompt for ChatPGT?
@HonestGraduate
@HonestGraduate Год назад
Thank you for the presentation and demo!
@purvislewies3118
@purvislewies3118 Год назад
Blessed love...givethanks...Cape Town
@KokkeOP
@KokkeOP Год назад
The paper and the Slides are both in the description, guys. :) read.
@MMSS-e9o
@MMSS-e9o Год назад
Nice talk! Could you please share the notebook?
@JuliusOpusprofundum
@JuliusOpusprofundum Год назад
@danteblink
@danteblink Год назад
Do you think human intervention in the evaluation process is going to last? It seems its a process that LLMs could achieve by themselves in the near future.
@senderlapin
@senderlapin Год назад
Я из России. Спасибо за вебинар.
@JuliusOpusprofundum
@JuliusOpusprofundum Год назад
F u. I AM FROM UKRAINE.
@zaursamedov8906
@zaursamedov8906 Год назад
Guys would u be able to drop the notebook please?
@hcrespo3
@hcrespo3 Год назад
I'm also interested, thanks
@komalmistry7284
@komalmistry7284 Год назад
Could someone share the link to the paper that was mentioned here "ChainPoll" , I believe.
@Deeplearningai
@Deeplearningai Год назад
It is in the video description!
@davidvilla2402
@davidvilla2402 Год назад
I don't know how bt I searched the n word and it came up
Далее
Evaluating LLM-based Applications
33:50
Просмотров 27 тыс.
[Webinar] LLMs for Evaluating LLMs
49:07
Просмотров 10 тыс.
Ледник 1:0 Мужик
00:53
Просмотров 1,3 млн
How to Build, Evaluate, and Iterate on LLM Agents
1:02:12
Deep Dive into LLM Evaluation with Weights & Biases
59:11
VulnerabilityGPT: Cybersecurity in the Age of LLM and AI
1:18:28
Solving Gen AI Hallucinations
36:22
Просмотров 3,1 тыс.
Ледник 1:0 Мужик
00:53
Просмотров 1,3 млн