Direct Preference Optimization (DPO) explained: Bradley-Terry model, log probabilities, math

LoRA: Low-Rank Adaptation of Large Language Models - Explained visually + PyTorch code from scratch

News Weekly: legal firestorm, MicroStrategy boosts Bitcoin holdings, $HMSTR token ⚡️ Hamster News

Искусство метафоры #сатир #пародия #маркарян #satyr

Лукашенко: Вы лишаете главного права людей - жить! #лукашенко #беларусь #новости #политика #польша

ДЖЕТПАКИ АСТАРТЕС #spacemarine2 #warhammer40k

ML Interpretability: feature visualization, adversarial example, interp. for language models

Подписаться 41 тыс.

Просмотров 7 тыс.

50% 1

Видео Поделиться Скачать Добавить в

In this video, I will be introducing Machine Learning Interpretability, a vast topic that aims at understanding the inner mechanisms of how machine learning models make their predictions, with the aim of debugging them, making them more transparent and trustworthy.
I will start by reviewing deep learning and the back-propagation algorithm, which are necessary for understanding adversarial example generation and feature visualization for computer vision classification models. In the second part, I will show how we can leverage the knowledge built in the first part of the video and apply it to language models. In particular, we will see how we can get insights on the bias of a language model by generating a prompt that maximizes the likelihood of the next token being a certain concept of our choice. This allows us to answer questions like:
"What does my language model think of women?"
"What does my language model think of minorities?"
This video has been built in collaboration with Leap Labs - an AI research lab that deals with machine learning interpretability and built the Leap Labs Interpretability Engine, which allows to get insights on how computer vision models work and how to improve them by generating prototypes, isolating features and understanding entanglement between classes.
Leap Labs: www.leap-labs....
Leap Labs Tutorials: docs.leap-labs...
As usual, the code and PDF slides are available at the following links:
- PDF slides: github.com/hkp...
- Adversarial Example Generation (tricking a classifier): github.com/hkp...
- Generate inputs for language models: github.com/jes...

Опубликовано:

16 сен 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 42

Далее

Direct Preference Optimization (DPO) explained: Bradley-Terry model, log probabilities, math

48:46

Direct Preference Optimization (DPO) explained: Bradley-Terry model, log probabilities, math

Просмотров 11 тыс.

LoRA: Low-Rank Adaptation of Large Language Models - Explained visually + PyTorch code from scratch

26:55

LoRA: Low-Rank Adaptation of Large Language Models - Explained visually + PyTorch code from scratch

Просмотров 24 тыс.

News Weekly: legal firestorm, MicroStrategy boosts Bitcoin holdings, $HMSTR token ⚡️ Hamster News

02:35

News Weekly: legal firestorm, MicroStrategy boosts Bitcoin holdings, $HMSTR token ⚡️ Hamster News

Просмотров 10 млн

Искусство метафоры #сатир #пародия #маркарян #satyr

00:52

Искусство метафоры #сатир #пародия #маркарян #satyr

Просмотров 324 тыс.

Лукашенко: Вы лишаете главного права людей - жить! #лукашенко #беларусь #новости #политика #польша

00:30

Лукашенко: Вы лишаете главного права людей - жить! #лукашенко #беларусь #новости #политика #польша

Просмотров 470 тыс.

ДЖЕТПАКИ АСТАРТЕС #spacemarine2 #warhammer40k

00:59

ДЖЕТПАКИ АСТАРТЕС #spacemarine2 #warhammer40k

Просмотров 42 тыс.

Why Computer Vision Is a Hard Problem for AI

8:39

Why Computer Vision Is a Hard Problem for AI

Просмотров 130 тыс.

Why Does Diffusion Work Better than Auto-Regression?

20:18

Why Does Diffusion Work Better than Auto-Regression?

Просмотров 312 тыс.

The Reparameterization Trick

17:35

The Reparameterization Trick

Просмотров 19 тыс.

The moment we stopped understanding AI [AlexNet]

17:38

The moment we stopped understanding AI [AlexNet]

Просмотров 1 млн

LLaMA explained: KV-Cache, Rotary Positional Embedding, RMS Norm, Grouped Query Attention, SwiGLU

1:10:55

LLaMA explained: KV-Cache, Rotary Positional Embedding, RMS Norm, Grouped Query Attention, SwiGLU

Просмотров 61 тыс.

BERT explained: Training, Inference, BERT vs GPT/LLamA, Fine tuning, [CLS] token

54:52

BERT explained: Training, Inference, BERT vs GPT/LLamA, Fine tuning, [CLS] token

Просмотров 38 тыс.

The Most Important Algorithm in Machine Learning

40:08

The Most Important Algorithm in Machine Learning

Просмотров 421 тыс.

25 AI Concepts EVERYONE Should Know

10:17

25 AI Concepts EVERYONE Should Know

Просмотров 4,3 тыс.

Mistral / Mixtral Explained: Sliding Window Attention, Sparse Mixture of Experts, Rolling Buffer

1:26:21

Mistral / Mixtral Explained: Sliding Window Attention, Sparse Mixture of Experts, Rolling Buffer

Просмотров 26 тыс.

Coding LLaMA 2 from scratch in PyTorch - KV Cache, Grouped Query Attention, Rotary PE, RMSNorm

3:04:11

Coding LLaMA 2 from scratch in PyTorch - KV Cache, Grouped Query Attention, Rotary PE, RMSNorm

Просмотров 37 тыс.

News Weekly: legal firestorm, MicroStrategy boosts Bitcoin holdings, $HMSTR token ⚡️ Hamster News

02:35

News Weekly: legal firestorm, MicroStrategy boosts Bitcoin holdings, $HMSTR token ⚡️ Hamster News

Просмотров 10 млн