No video :(

Why you should build an LLM benchmark [English]

Big Data Demystified

Подписаться 627

Просмотров 1,8 тыс.

50% 1

Видео Поделиться Скачать Добавить в

📊 Dive Deep into the World of LLM Benchmarks! 📊
Objective: By the end of this session, you should have a good understanding of how to select and maintain your own LLM benchmark.
Agenda:
🔬 Demo!
🔍Discover what ARC, HellSwag, and MMLU are exactly
🧫 Learn how to select the right benchmark
🧪 Methods to test LLMs tailored to your unique use case
🧱 Q&A
Speaker: J. Yarkoni ex-Google AI/ML Specialist (Shujin.ai)
Jonathan comes from a background of leading R&D teams. Previously he co-founded NAM, an advertising startup, and AA-TLV meetup, which at its peak had 3,500 members. Over the last six years, he spearheaded AI/ML initiatives at Google Cloud Israel. More recently, he established Shujin.AI, a consultancy specializing in ML projects with an emphasis on Generative AI.
big-data-demys...

Опубликовано:

25 авг 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 1

@jazzvids Месяц назад

Thank you for this valuable talk! I am currently writing my masters' thesis in nlp and this is very helpful

Далее

Learn Core Machine Learning for FREE | Ultimate Course for Beginners

9:32:46

Learn Core Machine Learning for FREE | Ultimate Course for Beginners

Просмотров 439 тыс.

State of API Security Webinar - Wallarm + @gigaom

54:23

State of API Security Webinar - Wallarm + @gigaom

Просмотров 126

НАСКОЛЬКО ИЗМЕНИТСЯ ВЕС СИГАРЕТЫ НА ВЕСАХ?

17:24

НАСКОЛЬКО ИЗМЕНИТСЯ ВЕС СИГАРЕТЫ НА ВЕСАХ?

Просмотров 436 тыс.

z tutorial Up and Down #TREND 👀😂 #trend #tutorial #backstage #lol #magic #creative #shorts

00:18

z tutorial Up and Down #TREND 👀😂 #trend #tutorial #backstage #lol #magic #creative #shorts

Просмотров 626 тыс.

СМАЗАЛ ДВЕРЬ

00:31

СМАЗАЛ ДВЕРЬ

Просмотров 125 тыс.

Only Pro Knows this technique! Expert Hacks for Steel Ruler #shorts #diy #tips #tricks

00:25

Only Pro Knows this technique! Expert Hacks for Steel Ruler #shorts #diy #tips #tricks

Просмотров 3,4 млн

Understanding LLM Inference | NVIDIA Experts Deconstruct How AI Works

55:39

Understanding LLM Inference | NVIDIA Experts Deconstruct How AI Works

Просмотров 2,4 тыс.

Semantic Layer vs. Metric Layer in Business Intelligence [English]

59:37

Semantic Layer vs. Metric Layer in Business Intelligence [English]

Просмотров 930

Rise, Fall and re-Rise of the Semantic Layer [English]

32:04

Rise, Fall and re-Rise of the Semantic Layer [English]

Просмотров 383

How to Evaluate LLM Performance for Domain-Specific Use Cases

56:43

How to Evaluate LLM Performance for Domain-Specific Use Cases

Просмотров 1,2 тыс.

LLM Evaluation Essentials: Benchmarking and Analyzing Retrieval Approaches

53:47

LLM Evaluation Essentials: Benchmarking and Analyzing Retrieval Approaches

Просмотров 1,6 тыс.

Improving LLM accuracy with Monte Carlo Tree Search

33:16

Improving LLM accuracy with Monte Carlo Tree Search

Просмотров 10 тыс.

The moment we stopped understanding AI [AlexNet]

17:38

The moment we stopped understanding AI [AlexNet]

Просмотров 957 тыс.

What are AI Agents?

12:29

What are AI Agents?

Просмотров 207 тыс.

The Science of LLM Benchmarks: Methods, Metrics, and Meanings | LLMOps

45:03

The Science of LLM Benchmarks: Methods, Metrics, and Meanings | LLMOps

Просмотров 2,1 тыс.

Generative AI is just the Beginning AI Agents are what Comes next | Daoud Abdel Hadi | TEDxPSUT

13:16

Generative AI is just the Beginning AI Agents are what Comes next | Daoud Abdel Hadi | TEDxPSUT

Просмотров 175 тыс.

НАСКОЛЬКО ИЗМЕНИТСЯ ВЕС СИГАРЕТЫ НА ВЕСАХ?

17:24

НАСКОЛЬКО ИЗМЕНИТСЯ ВЕС СИГАРЕТЫ НА ВЕСАХ?

Просмотров 436 тыс.