How Robots Learned to Act | The History of Reinforcement Learning

Art of the Problem

Подписаться 135 тыс.

Просмотров 4,9 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

16 сен 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 77

@Tubeytime 7 дней назад

Nothing makes me as excited about history, computer science, or mathematics as this channel. The lightbulbs were exploding with this one.

@ArtOfTheProblem 7 дней назад

Thrilled to hear it. Curious, did you see the video the first time it came out?

@Tubeytime 7 дней назад

@@ArtOfTheProblem I didn't watch it for sure, but I can't remember if I skipped the notification and forgot to come back. Some days you're in a different mood 😅

@ArtOfTheProblem 7 дней назад

@@Tubeytime thanks, I asked because I was messing with settings when I first posted it. stay tuned! it'll be a few months before the next

@mohamedjama7089 2 дня назад

@@ArtOfTheProblem it came up on my feed like 40minutes after it was uploaded.

@KyleRichter23 4 дня назад

PLEASE keep these videos coming, this series is awesome

@ArtOfTheProblem 4 дня назад

Thank you, I'm confused why this video never got any momentum. But i will make more

@puneetkumarsingh1484 2 дня назад

The video actually delves into all the relevant terms we hear about in Reinforcement learning but have to go through an hour it teo hour long lecture to understand. I will keep coming back to this video again and again while learning the topic ❤😊

@ArtOfTheProblem День назад

yes! this was my hope. the field is a bit of a mess in terms of how things are explained

@ivanczarapanau9417 5 дней назад

This is the youtube we deserve

@ArtOfTheProblem 5 дней назад

thanks :)

@sirkiz1181 4 дня назад

Very informative, I loved the ending note of using successful architectures in completely different domains. Especially interesting with the recent release of OpenAI’s o1

@ArtOfTheProblem 4 дня назад

yes! i was thinking of a follow up

@sirkiz1181 4 дня назад

@@ArtOfTheProblem very excited to see it! As someone who loves to learn about AI but has no formal education on it (at least yet) these videos are my lifeblood and only make me more energetic to learn more

@ArtOfTheProblem 4 дня назад

@@sirkiz1181 beautiful! love to hear this. btw I have a patreon for those who want to support future videos: www.patreon.com/artoftheproblem/

@ArtOfTheProblem 8 дней назад

I posted this video a month ago, but due to incorrect settings, it wasn't shared. I'm reposting it now for another chance to fly! I'd greatly appreciate if you could like, comment, or share it this week. FULL AI series: ru-vid.com/group/PLbg3ZX2pWlgKV8K6bFJr5dhM7oOClExUJ

@paedrufernando2351 8 дней назад

I am an avid fan of our vido and style of presenting..Keep posting more

@sumertuncay 7 дней назад

fingers crossed for this one then! I'd enjoy watching it again anyway

@hemasunchu 6 дней назад

"Robots learned to act through reinforcement learning by trial and error, mimicking how animals learn. Over time, algorithms evolved, allowing robots to make better decisions and adapt to complex environments."

@HoboGardenerBen 5 дней назад

Really cool video, thanks. I don't understand most of it, but the implications are clear, robots with general intelligence. That's huge. I haven't been jazzed about the language models, they are being used to take away creative work from people. But intelligent robots could take away the shitty work, leaving people to do creative things. I suspect that our inherent human fuckery will prevent it from being a utopia. We aren't built for endless pleasure so we will produce strife if the situation doesn't present us with any. I'm thinking of Anna Lembke with all that, how our brain tries to maintain an even mix of pain and pleasure as part of homeostasis. Perhaps that emotional need can be expressed completely in the digital space so that the war instinct stops making fuckery in real life. The momentum of the nation-state global structure we've had for a while will be hard to overcome, it directly incentivizes conflict.

@SHAINON117 3 дня назад

Amazing its as if all these videos are set up like a documentary of whats happening so in future yeara its like the most accurately recorded development in history ❤❤❤❤❤❤❤

@ArtOfTheProblem 3 дня назад

@@SHAINON117 thank you !

@dadsonworldwide3238 5 дней назад

Thanks, this was very helpful and informative . If your middle age is like me, then you grew up around lots of early leftover vacuum transitor antique tech that was very sensitive eltrodynamical systems if shorted or flawed you've played ham radios or bunny ears and distortion in amps, or even like a losly screwed in light bulb on a shaky table making it's plasticity stobe like gates or switches opening and closing . Something I think younger generations are lacking but may have more understanding on this level about the algorithm, code, or step by step functions after the facts screen time. I retired in precision machining where early g code cnc was very widespread before the internet reply took off but it wasn't much to be learned on its history.

@ArtOfTheProblem 5 дней назад

thank you for sharing, i appreciate it. i agree...i worked with my kids on circuit projects as much as I could to expose them to this

@ks0ni 7 дней назад

Awesome! I'd watched the original one when it was posted. Commenting for the algorithm.

@ArtOfTheProblem 7 дней назад

apprciate it

@aidankennedy6656 7 дней назад

One of the best high level introductions to Reinforcement Learning I've found. Superbly done.

@ArtOfTheProblem 7 дней назад

Thank you I worked really hard on this one as I found high level intro's were indeed lacking

@coreycoddington8132 6 дней назад

Nice work!!! Great information!!!

@ArtOfTheProblem 6 дней назад

appreciate it corey

@duke8925 3 дня назад

Thank you, very interesting video!

@notu483 6 дней назад

Thanks for the video! ❤😊

@TropicalCoder 6 дней назад

Thanks for alerting me to this video. I have now subscribed so I won't miss any future videos. There was so much food for thought presented, and it filled many blanks in my knowledge.

@ArtOfTheProblem 6 дней назад

thrilled to hear it!

@electronicwoe 7 дней назад

Superb work as always! I ordered some empty matchboxes and beads to try out MENACE for myself!

@ArtOfTheProblem 7 дней назад

:) super cool demo

@thegeneralist7527 7 дней назад

Extremely well explained, especially the original research. Thats worth a sub.

@ArtOfTheProblem 7 дней назад

thank you, appreciate you saying this. I'm curious how you found the video since it's so new?

@thegeneralist7527 7 дней назад

@@ArtOfTheProblem It came up as a suggestion on my RU-vid home page. What caught my attention was the phrase "learned to act", and you explained that concisely. Best of luck and keep up the great work.

@ArtOfTheProblem 7 дней назад

@@thegeneralist7527 appreciate this feedback I struggled for a while to find a title that works. thanks!

@Iknowwereyousleep289 7 дней назад

This was the best thing ever makes me think about reinforcement and art and voting

@ArtOfTheProblem 7 дней назад

ooo, tell me what it makes you think about voting and art??

@Iknowwereyousleep289 7 дней назад

The easiest example I can think of It’s slightly dystopian but the cart and pole kind of reminds me of twitch streamers trying whatever to get engagement So it makes me wonder if you can RL engagement and may be ratings

@ArtOfTheProblem 7 дней назад

@@Iknowwereyousleep289 exactly, RL plays a role in the social media algorithms.

@Iknowwereyousleep289 6 дней назад

@@ArtOfTheProblem But without the human in between just directly optimizing for engagement generating content I’m quite frightened of that and intrigued too much power

@trepanobiopsja 7 дней назад

great video

@ArtOfTheProblem 7 дней назад

thank you!

@vinniepeterss 7 дней назад

❤❤

@sureshkeerthi9820 7 дней назад

Loved it

@karmasource 6 дней назад

Do you have a link to the studies, particularly interested in the literature for domain randomization. ty and nice video :)

@ArtOfTheProblem 6 дней назад

thank you, here is a big one: arxiv.org/abs/1703.06907

@easlern 7 дней назад

Love this series, it’s really helped me better understand AI. Appreciate you sharing it with the world!

@joaoguerreiro9403 7 дней назад

Really appreciate the computer science content, great video ❤️

@ArtOfTheProblem 7 дней назад

thanks for the feedback stay tuned!

@molugusatyapriya2 6 дней назад

AI robots have the potential to revolutionize industries, but their development must be guided by responsible principles.

@mostlynotworking4112 7 дней назад

❤

@greenstonegecko 7 дней назад

Stunningly interesting video! Would love to know more about Action Pathways. Seems like it could be an entire video on its own.

@ArtOfTheProblem 7 дней назад

@@greenstonegecko yes this will come next , action pathways = behaviour

@puneetkumarsingh1484 2 дня назад

Perhaps if you have understood the topic itself in depth, you could make s series out if reinforcement learning just like your cryptography 😅

@filippobardi2663 7 дней назад

Good video; a feedback: the background music is really bad, i think you should change it. The rest is perfect

@neurodivr 7 дней назад

I found it so interesting to watch I didn't even hear the music! I had to go back and check it out.

@user-yu3tp8cl7q 7 дней назад

I thoroughly enjoyed the video, hope the algorithm picks it up this time!

@ArtOfTheProblem 7 дней назад

I thank you for this, much appreciated

@sureshkeerthi9820 7 дней назад

Wow

@ArtOfTheProblem 7 дней назад

thank you for sharing it

@vinniepeterss 7 дней назад

but the original still in the air though?

@ArtOfTheProblem 7 дней назад

Yes it is, i left the link up

@sifodyas_ 7 дней назад

Awesome as always.

@ArtOfTheProblem 7 дней назад

curious, had you seen this one already?

@thunderwh 7 дней назад

Loving the video but the music is giving me a headache. It's like trying to think near a railroad crossing signal that just won't stop ringing. I don't mind music in videos but I'm a firm believer that it should only be there when I'm not trying to listen to anyone talking.

@uvectoru1327 7 дней назад

Super interesting stuff! But maybe don't encourage people to apply their expertise devising ways for wealthy capital owners to more effectively extract wealth from the working class? Machine learning has a tremendous list of potential beneficial applications, so using it to solidify existing wealth and power hierarchies seems counter-productive to me.

@ArtOfTheProblem 7 дней назад

@@uvectoru1327 this is a great question , I’m actually working on a series on economics right now - it will look at all sides of that question , such as efficient market hypothesis , access to markets , and value itself (power and risk)

@uvectoru1327 7 дней назад

@@ArtOfTheProblem I'll definitely check it out! I'd be really interested if there are any applications being worked upon in the leftist economic sphere. (e.g. cybernetic algorithms applied to resource distribution like Project Cybersyn from the 70's)