Тёмный

Reinforcement Learning from scratch 

Graphics in 5 Minutes
Подписаться 21 тыс.
Просмотров 41 тыс.
50% 1

How does Reinforcement Learning work? A short cartoon that intuitively explains this amazing machine learning approach, and how it was used in AlphaGo and ChatGPT.
Part 1 of 3.
0:00 - intro
0:13 - pong
0:28 - the policy
0:51 - policy as neural network
1:32 - supervised learning
2:51 - reinforcement learning using policy gradient
4:24 - minimizing error using gradient descent
4:45 - probabilistic policy
5:01 - pong from pixels
6:58 - visualizing learned weights
8:18 - pointer to Karpathy "pong from pixels" blogpost

Опубликовано:

 

16 июн 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 41   
@darthvader4899
@darthvader4899 2 месяца назад
this is video is super underrated. In fact the whole channel is underrated.
@themax2go
@themax2go 3 месяца назад
agi: 1. ai develops understanding of win-loss conditions and sets policy params (inputs & actions) accordingly. 2. ai creates (= designs & builds) training env(s). 3. ai iterates, avals & adjusts policy parameters accordingly 4. done (or validation run(s) w/ human(s))
@themathguy3149
@themathguy3149 8 месяцев назад
Your Channel IS SO GREAT, I share with all my eng friends for you to get more visibility!
@tushargupta1999
@tushargupta1999 3 месяца назад
This video is amazing. You explained everything in such a simple manner. I am feeling really motivated to learn more about reinforcement learning and neural networks after watching this.
@ashketchum1244
@ashketchum1244 10 месяцев назад
I don't know how I stumbled upon this video but that was very interesting and intuitive to understand. Thank you.
@metaljacket8102
@metaljacket8102 2 месяца назад
This is really awsome! It's the best video that explains DRL in such an easy to understand way!
@codybarton2090
@codybarton2090 10 дней назад
I agree once you see how it all works it seems like 1s and zeros give me some feed back on r/grand unified theory or cosmo knowledge
@a.aspden
@a.aspden 9 месяцев назад
Your videos are great. Looking forward to more!
@marcinstrzesak346
@marcinstrzesak346 8 месяцев назад
Great video, very helpful, easy to understand.
@gmjammin4367
@gmjammin4367 10 месяцев назад
Amazing video as always :)!
@CptDoge-rn3ou
@CptDoge-rn3ou 7 месяцев назад
I really like the way you visualize what you are talking about. Thank you for putting in the effort!
@cloudysh
@cloudysh 2 месяца назад
This was so surprisingly great :3
@moldo800
@moldo800 5 месяцев назад
Excellent. Congratulations ❤
@luiseduardocraizer7416
@luiseduardocraizer7416 27 дней назад
Excellent content!
@mado.madeleine
@mado.madeleine 10 месяцев назад
Super helpful! Thank you 🙏🏽
@jameslibby5215
@jameslibby5215 9 месяцев назад
Very very underrated channel
@benc7910
@benc7910 5 месяцев назад
Underrated, two Rs
@jameslibby5215
@jameslibby5215 5 месяцев назад
@@benc7910 thank ya sir
@mohajeramir
@mohajeramir 2 месяца назад
Excellent
@nikbivation
@nikbivation 10 месяцев назад
thank you for this!
@edvinbeqari7551
@edvinbeqari7551 5 месяцев назад
What is your reward function for the pong game? I did a similar pong game and I couldn't get it to learn.
@ireoluwaTH
@ireoluwaTH 10 месяцев назад
Thank you!!!
@BlueBirdgg
@BlueBirdgg 9 месяцев назад
Can you playlist each one of your topics plz? I wanted to post on Twitter(X) your video topics but could only post a single video at a time. Great content by the way. Ty very much. Your perspective on some topics helped me a lot to get a more intuitive understanding.
@g5min
@g5min 9 месяцев назад
Good idea! Here's one on generative AI: ru-vid.com/group/PLWfDJ5nla8UoR8P7AGqVw7ZPjXajUFLMo Here's one on reinforcement learning ru-vid.com/group/PLWfDJ5nla8UoexEaLqVMw7q3Ft0vRYscL Here's one on LLMs + text-to-image ru-vid.com/group/PLWfDJ5nla8UoG2mvvHs_OS0asAKC5HJeu
@BlueBirdgg
@BlueBirdgg 9 месяцев назад
@@g5min Ty!
@solveigberling1662
@solveigberling1662 3 месяца назад
That was dope
@kniv0gaffel
@kniv0gaffel 7 месяцев назад
Brilliant
@bombur9007
@bombur9007 2 месяца назад
how many layers should such network have
@mineq4967
@mineq4967 2 месяца назад
but by what number do you change the weights like you never told us
@maxim_ml
@maxim_ml Месяц назад
that was good
@axe863
@axe863 7 месяцев назад
Simple Reinforcement learning is extremely dangerous in certain nonstationary environments 😅
@nischalyou
@nischalyou 9 месяцев назад
whats the name of this video game ?
@mind6861
@mind6861 3 дня назад
Can we have the code for this
@gaydemaupassant6263
@gaydemaupassant6263 3 дня назад
Pls o want the code plsss
@herikaniugu
@herikaniugu 8 месяцев назад
Imagine using reinforcement learning in quantitative finance 😊
@FRANKONATOR123
@FRANKONATOR123 9 месяцев назад
Can you share the source code for this project
@g5min
@g5min 9 месяцев назад
You can follow the link to the Karpathy site at the end of the video, repeated here: karpathy.github.io/2016/05/31/rl/
@macratak
@macratak 10 месяцев назад
ah yes, reinforcement learning. a fundamental computer graphics technology
@g5min
@g5min 10 месяцев назад
I think that character/game-AI is pretty central to graphics
@pw7225
@pw7225 10 месяцев назад
Why so negative?
@revimfadli4666
@revimfadli4666 10 месяцев назад
​@@g5minespecially AI image generation or processing nowadays
Далее
Reinforcement Learning:  AlphaGo
8:14
Просмотров 10 тыс.
Just try to use a cool gadget 😍
00:33
Просмотров 63 млн
The Most Important Algorithm in Machine Learning
40:08
Просмотров 264 тыс.
Large Language Models from scratch
8:25
Просмотров 336 тыс.
An introduction to Reinforcement Learning
16:27
Просмотров 642 тыс.
Reinforcement Learning, by the Book
18:19
Просмотров 76 тыс.
AI Learns to Walk (deep reinforcement learning)
8:40
Why Does Diffusion Work Better than Auto-Regression?
20:18
Just try to use a cool gadget 😍
00:33
Просмотров 63 млн