Reinforcement Learning from scratch

Подписаться 21 тыс.

Просмотров 41 тыс.

50% 1

How does Reinforcement Learning work? A short cartoon that intuitively explains this amazing machine learning approach, and how it was used in AlphaGo and ChatGPT.
Part 1 of 3.
0:00 - intro
0:13 - pong
0:28 - the policy
0:51 - policy as neural network
1:32 - supervised learning
2:51 - reinforcement learning using policy gradient
4:24 - minimizing error using gradient descent
4:45 - probabilistic policy
5:01 - pong from pixels
6:58 - visualizing learned weights
8:18 - pointer to Karpathy "pong from pixels" blogpost

Опубликовано:

16 июн 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 41

@darthvader4899 2 месяца назад

this is video is super underrated. In fact the whole channel is underrated.

@themax2go 3 месяца назад

agi: 1. ai develops understanding of win-loss conditions and sets policy params (inputs & actions) accordingly. 2. ai creates (= designs & builds) training env(s). 3. ai iterates, avals & adjusts policy parameters accordingly 4. done (or validation run(s) w/ human(s))

@themathguy3149 8 месяцев назад

Your Channel IS SO GREAT, I share with all my eng friends for you to get more visibility!

@tushargupta1999 3 месяца назад

This video is amazing. You explained everything in such a simple manner. I am feeling really motivated to learn more about reinforcement learning and neural networks after watching this.

@ashketchum1244 10 месяцев назад

I don't know how I stumbled upon this video but that was very interesting and intuitive to understand. Thank you.

@metaljacket8102 2 месяца назад

This is really awsome! It's the best video that explains DRL in such an easy to understand way!

@codybarton2090 10 дней назад

I agree once you see how it all works it seems like 1s and zeros give me some feed back on r/grand unified theory or cosmo knowledge

@a.aspden 9 месяцев назад

Your videos are great. Looking forward to more!

@marcinstrzesak346 8 месяцев назад

Great video, very helpful, easy to understand.

@gmjammin4367 10 месяцев назад

Amazing video as always :)!

@CptDoge-rn3ou 7 месяцев назад

I really like the way you visualize what you are talking about. Thank you for putting in the effort!

@cloudysh 2 месяца назад

This was so surprisingly great :3

@moldo800 5 месяцев назад

Excellent. Congratulations ❤

@luiseduardocraizer7416 27 дней назад

Excellent content!

@mado.madeleine 10 месяцев назад

Super helpful! Thank you 🙏🏽

@jameslibby5215 9 месяцев назад

Very very underrated channel

@benc7910 5 месяцев назад

Underrated, two Rs

@jameslibby5215 5 месяцев назад

@@benc7910 thank ya sir

@mohajeramir 2 месяца назад

Excellent

@nikbivation 10 месяцев назад

thank you for this!

@edvinbeqari7551 5 месяцев назад

What is your reward function for the pong game? I did a similar pong game and I couldn't get it to learn.

@ireoluwaTH 10 месяцев назад

Thank you!!!

@BlueBirdgg 9 месяцев назад

Can you playlist each one of your topics plz? I wanted to post on Twitter(X) your video topics but could only post a single video at a time. Great content by the way. Ty very much. Your perspective on some topics helped me a lot to get a more intuitive understanding.

@g5min 9 месяцев назад

Good idea! Here's one on generative AI: ru-vid.com/group/PLWfDJ5nla8UoR8P7AGqVw7ZPjXajUFLMo Here's one on reinforcement learning ru-vid.com/group/PLWfDJ5nla8UoexEaLqVMw7q3Ft0vRYscL Here's one on LLMs + text-to-image ru-vid.com/group/PLWfDJ5nla8UoG2mvvHs_OS0asAKC5HJeu

@BlueBirdgg 9 месяцев назад

@@g5min Ty!