Offline Reinforcement Learning: BayLearn 2021 Keynote Talk

The Case for Real-World Reinforcement Learning

Толстая девчонка не заметила котенка🤷‍♀️⁠@titwow

Песня РАСПУТИН на русском!🔥

Women’s Celebrations + Men’s 😮‍💨

Антон Теляков просит у прохожих укусить мороженое

Deep Reinforcement Learning with Real-World Data

Подписаться 20 тыс.

Просмотров 9 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

4 окт 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 3

@张力-u2i 7 месяцев назад

Offline RL for language models is indeed a promising direction to explore. It's worth noting that Sergey, an expert in the field, has expressed concerns about the feasibility of online RL with language models. This reminds me how brilliant of the RLHF approach is

@haonanchang908 Год назад

Very good explanation for offline-RL! Thanks for sharing.

@binjianxin7830 Год назад

1:44 Could you be more specific about prompt engineering? It seems an highly interesting topic about the internal probabilistic structures of large models explains how they are exploited by it or might be even edited.

Далее

Offline Reinforcement Learning: BayLearn 2021 Keynote Talk

45:14

Offline Reinforcement Learning: BayLearn 2021 Keynote Talk

Просмотров 9 тыс.

The Case for Real-World Reinforcement Learning

25:17

The Case for Real-World Reinforcement Learning

Просмотров 14 тыс.

Толстая девчонка не заметила котенка🤷‍♀️⁠@titwow

00:28

Толстая девчонка не заметила котенка🤷‍♀️⁠@titwow

Просмотров 220 тыс.

Песня РАСПУТИН на русском!🔥

00:56

Песня РАСПУТИН на русском!🔥

Просмотров 136 тыс.

Women’s Celebrations + Men’s 😮‍💨

00:20

Women’s Celebrations + Men’s 😮‍💨

Просмотров 5 млн

Антон Теляков просит у прохожих укусить мороженое

00:36

Антон Теляков просит у прохожих укусить мороженое

Просмотров 41 тыс.

Reinforcement Learning Pretraining for Reinforcement Learning Finetuning

51:03

Reinforcement Learning Pretraining for Reinforcement Learning Finetuning

Просмотров 6 тыс.

The Bitterest of Lessons: The Role of Data and Optimization in Emergence

46:49

The Bitterest of Lessons: The Role of Data and Optimization in Emergence

Просмотров 4,6 тыс.

Model Based RL Finally Works!

28:01

Model Based RL Finally Works!

Просмотров 34 тыс.

Reinforcement Learning Series: Overview of Methods

21:37

Reinforcement Learning Series: Overview of Methods

Просмотров 97 тыс.

A Gentle Introduction to Offline Reinforcement Learning

18:53

A Gentle Introduction to Offline Reinforcement Learning

Просмотров 8 тыс.

Keynote - Offline reinforcement learning

29:21

Keynote - Offline reinforcement learning

Просмотров 4,6 тыс.

MIT 6.S191 (2023): Reinforcement Learning

57:33

MIT 6.S191 (2023): Reinforcement Learning

Просмотров 133 тыс.

Reinforcement Learning, by the Book

18:19

Reinforcement Learning, by the Book

Просмотров 96 тыс.

GEOMETRIC DEEP LEARNING BLUEPRINT

3:33:23

GEOMETRIC DEEP LEARNING BLUEPRINT

Просмотров 183 тыс.

Making Real-World Reinforcement Learning Practical

38:23

Making Real-World Reinforcement Learning Practical

Просмотров 14 тыс.

Толстая девчонка не заметила котенка🤷‍♀️⁠@titwow

00:28

Толстая девчонка не заметила котенка🤷‍♀️⁠@titwow

Просмотров 220 тыс.