Тёмный
No video :(

Why you should build an LLM benchmark [English] 

Big Data Demystified
Подписаться 627
Просмотров 1,8 тыс.
50% 1

📊 Dive Deep into the World of LLM Benchmarks! 📊
Objective: By the end of this session, you should have a good understanding of how to select and maintain your own LLM benchmark.
Agenda:
🔬 Demo!
🔍Discover what ARC, HellSwag, and MMLU are exactly
🧫 Learn how to select the right benchmark
🧪 Methods to test LLMs tailored to your unique use case
🧱 Q&A
Speaker: J. Yarkoni ex-Google AI/ML Specialist (Shujin.ai)
Jonathan comes from a background of leading R&D teams. Previously he co-founded NAM, an advertising startup, and AA-TLV meetup, which at its peak had 3,500 members. Over the last six years, he spearheaded AI/ML initiatives at Google Cloud Israel. More recently, he established Shujin.AI, a consultancy specializing in ML projects with an emphasis on Generative AI.
big-data-demys...

Опубликовано:

 

25 авг 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 1   
@jazzvids
@jazzvids Месяц назад
Thank you for this valuable talk! I am currently writing my masters' thesis in nlp and this is very helpful
Далее
СМАЗАЛ ДВЕРЬ
00:31
Просмотров 125 тыс.
Improving LLM accuracy with Monte Carlo Tree Search
33:16
The moment we stopped understanding AI [AlexNet]
17:38
Просмотров 957 тыс.
What are AI Agents?
12:29
Просмотров 207 тыс.