Rich Sutton, Toward a better Deep Learning

Tea Time Talks 2024: Parham Panahi, Experience Selection in Deep RL

Израиль атакует весь Ближний Восток?

Позов, Бебуришвили, Матвиенко, Ваш, Korya_mc, Dina Blin, Гурам, Lixxx | WINLINE MEDIA POKER - Стол 1

КОНФЛИКТ. СВАДЬБА? КОНЕЦ ИСТОРИИ...

BMW всего за 2 миллиона рублей #автомобили #bmw

Tea Time Talks 2024: Mahshid Rahmani Hanzaki, Tile-coding for Count-based Exploration

Подписаться 5 тыс.

Просмотров 25

50% 1

Видео Поделиться Скачать Добавить в

Tea Time Talks are back for another year. This summer lecture series, presented by Amii and the RLAI Lab at the University of Alberta, give researchers the chance to discuss early-stage ideas and prospective research. Join us for another series of informal 20-minute talks where AI leaders discuss the future of machine learning research.
Abstract:
Exploration-exploitation tradeoff is one of the challenges in reinforcement learning where the agent must tradeoff between choosing actions that have previously been effective in producing rewards or trying actions it has not yet explored. Despite recent advances in reinforcement learning, most complex agents still rely on randomness to explore the environment because of its simplicity.
An alternative to random exploration is count-based methods, where actions with fewer visitation counts are preferred over those that have been visited more frequently. Despite their theoretical guarantees, count-based exploration methods have not been widely used with function approximation in practice.
In this talk, I will explain how tile-coding can be used as a simple method to generalize counts over states. I will highlight two experiments to demonstrate how tile-coding for count-based methods can lead to better exploration compared to randomness in certain environments.

Опубликовано:

4 окт 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии

Далее

Rich Sutton, Toward a better Deep Learning

31:36

Rich Sutton, Toward a better Deep Learning

Просмотров 5 тыс.

Tea Time Talks 2024: Parham Panahi, Experience Selection in Deep RL

37:40

Tea Time Talks 2024: Parham Panahi, Experience Selection in Deep RL

Просмотров 47

Израиль атакует весь Ближний Восток?

00:43

Израиль атакует весь Ближний Восток?

Просмотров 257 тыс.

Позов, Бебуришвили, Матвиенко, Ваш, Korya_mc, Dina Blin, Гурам, Lixxx | WINLINE MEDIA POKER - Стол 1

6:35:46

Позов, Бебуришвили, Матвиенко, Ваш, Korya_mc, Dina Blin, Гурам, Lixxx | WINLINE MEDIA POKER - Стол 1

Просмотров 144 тыс.

КОНФЛИКТ. СВАДЬБА? КОНЕЦ ИСТОРИИ...

17:14

КОНФЛИКТ. СВАДЬБА? КОНЕЦ ИСТОРИИ...

Просмотров 815 тыс.

BMW всего за 2 миллиона рублей #автомобили #bmw

00:25

BMW всего за 2 миллиона рублей #автомобили #bmw

Просмотров 18 тыс.

Tea Time Talks 2024: Aidan Bush, Multi-agent Deflection Routing with Bandits

36:29

Tea Time Talks 2024: Aidan Bush, Multi-agent Deflection Routing with Bandits

Просмотров 28

The Tea Time Talks: Khurram Javed, Meta-Learning Representations for Continual Learning (June 17)

32:30

The Tea Time Talks: Khurram Javed, Meta-Learning Representations for Continual Learning (June 17)

Просмотров 2,2 тыс.

CA Prop 33 and 34 #CARentControl #CAProp33 #CAProp34 #AB1482 #CostaHawkins

13:18

CA Prop 33 and 34 #CARentControl #CAProp33 #CAProp34 #AB1482 #CostaHawkins

Просмотров 47

Revolutionizing Video Game AI w/ Artificial Agency’s Andrew Butcher | Approximately Correct Podcast

32:53

Revolutionizing Video Game AI w/ Artificial Agency’s Andrew Butcher | Approximately Correct Podcast

Просмотров 208

AI Seminar Series: Marlos C. Machado - Autonomous nav of stratospheric balloons using RL (Jan 22)

1:08:05

AI Seminar Series: Marlos C. Machado - Autonomous nav of stratospheric balloons using RL (Jan 22)

Просмотров 1,5 тыс.

The Tea Time Talks: Rich Sutton, Open Questions in Model-based RL (May 27, 2019)

33:57

The Tea Time Talks: Rich Sutton, Open Questions in Model-based RL (May 27, 2019)

Просмотров 1,8 тыс.

DLRLSS 2019 - What’s Next - Yoshua Bengio

1:24:29

DLRLSS 2019 - What’s Next - Yoshua Bengio

Просмотров 2,8 тыс.

UB 2023: Practical Reinforcement Learning: Lessons from 30 years of Research, Keynote by Peter Stone

1:09:10

UB 2023: Practical Reinforcement Learning: Lessons from 30 years of Research, Keynote by Peter Stone

Просмотров 1 тыс.

Rich Sutton (DeepMind Alberta, University of Alberta, Amii) - Experience and Intelligence

1:36:43

Rich Sutton (DeepMind Alberta, University of Alberta, Amii) - Experience and Intelligence

Просмотров 2,3 тыс.

AI Seminar Series 2024: Contrastive Decoding for Concepts in the Brain, Cory Efird

51:51

AI Seminar Series 2024: Contrastive Decoding for Concepts in the Brain, Cory Efird

Просмотров 154

Израиль атакует весь Ближний Восток?

00:43

Израиль атакует весь Ближний Восток?

Просмотров 257 тыс.