CURIOSITY (Artificial Intelligence)

Подписаться 10 тыс.

Просмотров 5 тыс.

50% 1

In this video I discuss the problem of sparse reward enviroments and how OpenAI managed to solve it through the use of curiosity. The method is called Reinforcement Learning with Prediction-Based Rewards and it incentivizes visiting unfamiliar states by measuring how hard it is to predict the output of a fixed random neural network on visited states.
Support me on Patreon: www.patreon.com/user?u=25285137
Keep in touch: / sebastianschuc7
More on the topic: openai.com/blog/reinforcement...

Наука

Опубликовано:

26 июл 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 19

@quite_dumb_profile_picture7687 2 года назад

Machine learning is such a new thing to me and it feels like it has no boundaries in terms of getting knowledge from it.

@pulsarhappy7514 11 месяцев назад

I thought I left a comment on this a year ago but apparently not. This is truly amazing stuff and is very well explained.

@ZalvadorZali 4 года назад

This video is amazing! What a wonderful job explaining a curiosity centric neural network!

@bigedwerd 4 года назад

I would love to see more on curiosity.

@sharannagarajan4089 9 месяцев назад

Amazing video invoking my curiosity

@srikanthganta7626 4 года назад

Just discovered you. Loved every video

@Bishop_t Год назад

haha that AI watching TV made my day 😅

@wilhelminarandtke9380 2 года назад

This is such a great explanation of getting AI to be curious.

@ONDANOTA 4 года назад

YES, I have a request. ODE machine learning

@jonajo261 2 года назад

POUAHHHHHHHHHHHHHHHHHHHHHHH mind blowing !

@PeterBarnes2 4 года назад

I do like this, but I also liked the demonstrations of AIs.

@mangolinolino 4 года назад

Amazing work!!! I would love an AI video on a concept applied to financial markets!

@SebastianSchuchmannAI 4 года назад

Thank you! :)

@flurishart612 2 года назад

Is it possible to use ML-agents with financial markets? I know it's possible to connect to APIs with Unity, but don't know if this is a terribly inefficient approach to this or something