Let's teach our AI how to get from point A to point B of a Frozen Lake environment in the most efficient way possible using dynamic programming. This is considered reinforcement learning and we'll trying two popular techniques (policy iteration and value iteration). We'll use OpenAI's Gym environment and pure python to do this.
Code for this video:
github.com/llSourcell/navigat...
Please Subscribe! And like. And comment. That's what keeps me going.
Want more inspiration & education? Connect with me:
Twitter: / sirajraval
Facebook: / sirajology
More learning resources:
ocw.mit.edu/courses/aeronauti...
uhaweb.hartford.edu/compsci/cc...
/ deep-reinforcement-lea...
www.cs.cmu.edu/afs/cs/project...
cs.stanford.edu/people/karpath...
www.quora.com/How-is-policy-i...
www0.cs.ucl.ac.uk/staff/d.silv...
Join us in the Wizards Slack channel:
wizards.herokuapp.com/
And please support me on Patreon:
www.patreon.com/user?u=3191693 Instagram: / sirajraval Instagram: / sirajraval
Signup for my newsletter for exciting updates in the field of AI:
goo.gl/FZzJ5w
Hit the Join button above to sign up to become a member of my channel for access to exclusive content! Join my AI community: chatgptschool.io/ Sign up for my AI Sports betting Bot, WagerGPT! (500 spots available):
www.wagergpt.co
17 июл 2024