J Jonah Jellynose suspects Spiderman is an AI. Captain Blubber is arrested twice. A phone screen is smashed. What is happening
0:00 Intro
0:30 Basics
1:30 States, Actions and Rewards
2:45 Discount Factor
4:09 Neural Networks
5:59 PPO
7:03 Policy Gradient
9:54 Clamping the Policy
10:34 What the AI Learned
13:05 Just Swinging
White paper on how to create an AI like this from scratch:
docs.google.com/document/d/1F...
Download this AI: github.com/b2developer/Spider...
Discord: / discord
Reddit: / b2studios
Twitch: / b2studios
Useful Links:
huggingface.co/blog/deep-rl-p...
fse.studenttheses.ub.rug.nl/2...
iclr-blog-track.github.io/202...
7 июн 2024