Mastering Robotics with Hindsight Experience Replay | Paper Analysis

Подписаться 42 тыс.

Просмотров 5 тыс.

50% 1

Hindisght experience replay works pretty simply: swap out the original goal your agent was trying to receive with one it actually received. It deals with environments with sparse rewards and large state spaces. Check out my analysis of the paper here.
Learn how to turn deep reinforcement learning papers into code:
Get instant access to all my courses, including the new Prioritized Experience Replay course, with my subscription service. $29 a month gives you instant access to 42 hours of instructional content plus access to future updates, added monthly.
Discounts available for Udemy students (enrolled longer than 30 days). Just send an email to sales@neuralnet.ai
www.neuralnet....
Or, pickup my Udemy courses here:
Deep Q Learning:
www.udemy.com/...
Actor Critic Methods:
www.udemy.com/...
Curiosity Driven Deep Reinforcement Learning
www.udemy.com/...
Natural Language Processing from First Principles:
www.udemy.com/...
Reinforcement Learning Fundamentals
www.manning.co...
Here are some books / courses I recommend (affiliate links):
Grokking Deep Learning in Motion: bit.ly/3fXHy8W
Grokking Deep Learning: bit.ly/3yJ14gT
Grokking Deep Reinforcement Learning: bit.ly/2VNAXql
Come hang out on Discord here:
/ discord
Need personalized tutoring? Help on a programming project? Shoot me an email! phil@neuralnet.ai
Website: www.neuralnet.ai
Github: github.com/phi...
Twitter: / mlwithphil

Опубликовано:

13 сен 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 22

@bobingstern4448 2 года назад

Hey! I would love to see you do this with the AlphaZero or AlphaGo Zero papers

@MachineLearningwithPhil 2 года назад

Sounds good. I've been meaning to do it for a while but haven't gotten to it

@christianleininger2954 2 года назад

yeah would be nice

@taneryilmaz3790 2 года назад

When will Curiosity RL come for tensorflow?

@kae4881 2 года назад

Ayy another top-class paper analysis!

@MachineLearningwithPhil 2 года назад

Thanks dude

@MuhammadWaqas-yx9ps 2 года назад

In your last video you mentioned the Never Give Up algorithm, I hope some day you can cover that paper too!

@johanngerberding5956 2 года назад

Very cool channel! Keep going, Sir!

@ikvpyuifq3126 2 года назад

thanks always, please IMPALA, pytorch tutorial

@billykotsos4642 2 года назад

Wasn’t Mujoko acquired by. Deepmind a few months back? I believe its open to wider use now

@MachineLearningwithPhil 2 года назад

Perhaps! I found the open source solution some time ago and just ran with it.

@VermontStrolls 2 года назад

Thanks. Please keep this great job.

@MachineLearningwithPhil 2 года назад

Thank you Farshad

@AKNiloy 2 года назад

Sir could you do a walkthrough video on D4PG? would be off great help

@MachineLearningwithPhil 2 года назад

Lemme add it to the list

@pisoiorfan 2 года назад

I'm not convinced by domain expertise argument. If current general intelligence uses domain expertise in order to get best results in e.g. brick laying - when it is taught by an experienced mason not a nurse or gardener, why is RL trying to outperform that? Is it that important for a data scientist to be ignorant about anything except ML?

@scalbylasusjim2780 2 года назад

Hello Phil! Just curious if you have studied and/or would consider making a video on model based methods such as Meta Policy Optimization (MB-MPO) and other model ensemble RL methods?

@MachineLearningwithPhil 2 года назад

I haven't studied but would be happy to learn. Thanks for the suggestion.

@scalbylasusjim2780 2 года назад

@@MachineLearningwithPhil Awesome! Thanks