Тёмный

Deep Reinforcement Learning with Real-World Data 

RAIL
Подписаться 20 тыс.
Просмотров 9 тыс.
50% 1

Опубликовано:

 

4 окт 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 3   
@张力-u2i
@张力-u2i 7 месяцев назад
Offline RL for language models is indeed a promising direction to explore. It's worth noting that Sergey, an expert in the field, has expressed concerns about the feasibility of online RL with language models. This reminds me how brilliant of the RLHF approach is
@haonanchang908
@haonanchang908 Год назад
Very good explanation for offline-RL! Thanks for sharing.
@binjianxin7830
@binjianxin7830 Год назад
1:44 Could you be more specific about prompt engineering? It seems an highly interesting topic about the internal probabilistic structures of large models explains how they are exploited by it or might be even edited.
Далее
The Case for Real-World Reinforcement Learning
25:17
Просмотров 14 тыс.
Песня РАСПУТИН на русском!🔥
00:56
Women’s Celebrations + Men’s 😮‍💨
00:20
Model Based RL Finally Works!
28:01
Просмотров 34 тыс.
Reinforcement Learning Series: Overview of Methods
21:37
Keynote - Offline reinforcement learning
29:21
Просмотров 4,6 тыс.
MIT 6.S191 (2023): Reinforcement Learning
57:33
Просмотров 133 тыс.
Reinforcement Learning, by the Book
18:19
Просмотров 96 тыс.
GEOMETRIC DEEP LEARNING BLUEPRINT
3:33:23
Просмотров 183 тыс.
Making Real-World Reinforcement Learning Practical
38:23