Тёмный

AI Learns to Park - Deep Reinforcement Learning 

Samuel Arzt
Подписаться 50 тыс.
Просмотров 3 млн
50% 1

An AI learns to park a car in a parking lot in a 3D physics simulation. The simulation was implemented using Unity's ML-Agents framework (unity3d.com/machine-learning). The AI consists of a deep Neural Network with 3 hidden layers of 128 neurons each. It is trained with the Proximal Policy Optimization (PPO) algorithm, which is a Reinforcement Learning approach.
Basically, the input of the Neural Network are the readings of eight depth sensors, the car's current speed and position, as well as its relative position to the target. The outputs of the Neural Network are interpreted as engine force, braking force and turning force. These outputs can be seen at the top right corner of the zoomed out camera shots.
The AI starts off with random behaviour, i.e. the Neural Network is initialized with random weights. It then gradually learns to solve the task by reacting to environment feedback accordingly. The environment tells the AI whether it is doing good or bad with positive or negative reward signals.
In this project, the AI is rewarded with small positive signals for getting closer to the parking spot, which is outlined in red, and gets a larger reward when it actually reaches the parking spot and stops there. The final reward for reaching the parking spot is dependent on how parallel the car stops in relation to the actual parking position. If the car stops in a 90° angle to the actual parking direction for instance, the AI will only be rewarded a very small amount, relative to the amount it would get for stopping completely parallel to the actual direction.
The AI is penalized with a negative reward signal, when it either drives further away from the parking spot or if it crashes into any obstacles.
The training process shown in this video took about 23 hours on a computer with an i5 (7th or 8th gen) and a GTX 1070 with 100x simulation speed.
Subscribe for more content like this:
/ @samuelarzt
Follow me on Twitter for more frequent updates on my projects:
/ samuelarzt
Also check out my other videos related to this Project:
Two AI fight for the same Parking Spot:
• Two AI Fight for the s...
Neural Networks Explained in a Minute:
• Explained In A Minute:...
Cars learn to maneuver Parcour with Genetic Algorithm:
• Deep Learning Cars
Start Music: "Sunday" by Otis McDonald
Music from Bensound.com:
Timelapse Music: "The Elevator Bossa Nova"
Comedic Background: "Jazz Comedy"
Outro: "All That"
#ArtificialIntelligence #MachineLearning #ReinforcementLearning #AI #NeuralNetworks

Наука

Опубликовано:

 

22 авг 2019

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 3,1 тыс.   
@SamuelArzt
@SamuelArzt 4 года назад
Watch two AI Agents fight for the same parking spot: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-CqYKhbyHFtA.html
@TechMasterRus
@TechMasterRus 4 года назад
Can it park in other place after learning on this place?
@SamuelArzt
@SamuelArzt 4 года назад
@@TechMasterRus It is very unlikely to be able to generalize to other parking spots. In order to do so, one would have to at least also randomize the parking spot position during training.
@TechMasterRus
@TechMasterRus 4 года назад
@@SamuelArzt what data do you use to train AI? Global coords relative to some static point? If you use all objects in car's coodinate system, then mb it won't matter where to park... if it will be able to train like that at all.
@SamuelArzt
@SamuelArzt 4 года назад
@@TechMasterRus The inputs of the neural network are mentioned in the video description: "the readings of eight depth sensors, the cars current speed and position, as well as its relative position to the target" I think having the relative position of all obstacles would be somewhat cheaty. Don't get me wrong, this is far from being realistic, but I think that would be too far of a stretch. If you want to achieve an AI to park on any spot in the parking lot, it would be more feasible to simply train it on randomized parking spot positions. It will be better at generalizing to other parking spots then, even though that is likely to increase training time.
@void3790
@void3790 4 года назад
Hey, if your interested I’d love if I could email you some of my music to use as background music
@EricJCaraballoso
@EricJCaraballoso 3 года назад
This is the definition of "don't practice until you get it right, practice until you can't get it wrong"
@tenchi2075
@tenchi2075 3 года назад
That’s a really nice quote
@Jawsomest
@Jawsomest 2 года назад
. . . I rememebr a similar quote! "Hit or miss huh,- ok ok i won't.
@DarrinDarwinacious
@DarrinDarwinacious 2 года назад
Omg it’s the “dEfiNitiOn”?
@solarizedmonkeyman
@solarizedmonkeyman 2 года назад
@@tenchi2075 it's Daniel Willingham, it's from his theory of learning
@perero
@perero Год назад
@@DarrinDarwinacious mean
@rightyboywilson4911
@rightyboywilson4911 4 года назад
Actual footage of me trying to park after 2 years of driving
@calebsherman886
@calebsherman886 4 года назад
I too park in the same spot at the store
@PritishMishra
@PritishMishra 4 года назад
😂😂
@lilaxeree3621
@lilaxeree3621 4 года назад
*shoves truck out of the way so I can take 2 spots*
@DocTwoEight
@DocTwoEight 4 года назад
Just back it up dude, it's much simpler than trying to park front first
@JackRackam
@JackRackam 2 года назад
It looks like it's having to fight the voices in its head telling it to run full speed into those other cars
@loafbread529
@loafbread529 2 года назад
me when im infront of a tesla at a traffic jam
@whatamidoingwithmylife4108
@whatamidoingwithmylife4108 2 года назад
I didn't think I'd find you here my majesty. Nevermind, your supreme excellence graces this comment section.
@Chromwel-A
@Chromwel-A 3 года назад
This kinda makes me appreciates the NPC's driving in GTA V.
@TheHumanSystem
@TheHumanSystem 3 года назад
They are skilled, but remember that is an enclosed and well defined system. Still impressive, however.
@markmd9
@markmd9 3 года назад
In GTA they are not AI, they are killing machines
@PowJey
@PowJey 3 года назад
@@markmd9 in gta online they are crashers machines hahah
@roteschwert
@roteschwert 3 года назад
Yeah, in GTA they're just programmed to
@DraxTheDestroyer
@DraxTheDestroyer 3 года назад
You can't compare the two because gta v bots are not AI they are programmed
@distantforest2481
@distantforest2481 4 года назад
*Pushes through car to get to parking spot" "Boom! nailed it"
@BlakeMaloof
@BlakeMaloof 4 года назад
I like the strategy at 2:45, sometimes I wish I could do that when I park...
@ChrysusTV
@ChrysusTV 4 года назад
I was pretty curious about this attempt. I wonder if the AI decided that the negative reward for crashing into the vehicle was outweighed by the positive reward for parking. Then curious how the +/- rewards are defined, and if the AI continues to receive negative reward for the duration of the collision or if it is only penalized when it first collides with the vehicle.
@rcsibiu
@rcsibiu 4 года назад
Same strategy is applied at 5:01
@krishnamshinde5589
@krishnamshinde5589 3 года назад
Mr bean style hahaha
@iamcyancat7924
@iamcyancat7924 3 года назад
lmao
@tapist3482
@tapist3482 3 года назад
@@ChrysusTV I guess the AI would continuously receive negative rewards when in contact with other cars. Because from the attempt on 5:01 we can see it was basically the same strategy, but the AI learnt to kick the car away with repeated, high speed strikes instead of just pushing it.
@AidanWR
@AidanWR 3 года назад
Came for the car going around randomly, now I can go to bed happy
@luca83439
@luca83439 3 года назад
2:48 "Leonard that's my spot"
@MichaelRicksAherne
@MichaelRicksAherne 4 года назад
"What is my purpose?" "You park cars." "Oh god...."
@Cyian
@Cyian 4 года назад
Michael Ricks-Aherne yeah welcome to the club pal
@jamesfullecido9760
@jamesfullecido9760 4 года назад
Is this a rick and morty reference
@fractal_mind562
@fractal_mind562 4 года назад
Its referencing alot of things I guess, my first thought was the Ricky Gervais joke about dogs... Then rick and Morty... Could probably think of others
@sodiboo
@sodiboo 4 года назад
That’s a valet
@EastyUK
@EastyUK 4 года назад
@@fractal_mind562 I thought of red dwarf and the toaster.
@barraged999
@barraged999 4 года назад
“AI will take over humanity” 300k iterations later, the car still gets a parking ticket.
@TheHumanSystem
@TheHumanSystem 3 года назад
We will not require tickets in our robot future. I will pass this on to our human so he is aware of this issue. Thank you.
@alpacamale2909
@alpacamale2909 3 года назад
well considering it takes a human 18 years to do it...
@husk1c
@husk1c 3 года назад
@@alpacamale2909 18 years? Lmao HAHAHAHAHAHAHHA
@ninjacodertech
@ninjacodertech 2 года назад
@@alpacamale2909 16 but depends on the country
@RangerFPS
@RangerFPS 2 года назад
such an ignorant comment
@slippydouglas
@slippydouglas 2 года назад
I wonder how much the learning process would be affected by feeding in not just the current sensor input, but the last couple of seconds of sensor inout (so 2 seconds * 60 FPS * 8 sensors additional neural inputs), and maybe the last 2 seconds of the AI’s own output decisions as well. The network would be much bigger and training would take much longer, but I wonder if past input and decisions would feed forward in time to produce smoother results, preventing the AI from just going for a bad solution as often.
@Solips1sm
@Solips1sm Год назад
Indeed a great idea
@SkilledApple
@SkilledApple 3 года назад
4:17 okay, now its just showing off.
@priyankraval7461
@priyankraval7461 4 года назад
"It has 8 depth sensors" *breaks the pole* Is the sensor learning too??
@Azaelris
@Azaelris 4 года назад
Well in deep learning. You have reward and punishment. The ai over time realises hey hitting stuff is bad. But I understand ur joke lol
@priyankraval7461
@priyankraval7461 4 года назад
@@Azaelris yes, i know the AI is just a baby using the sensors as its eyes and learning not to collide, by colliding
@whyarurannin4918
@whyarurannin4918 4 года назад
@@priyankraval7461 But guys , what is parking for AI? I mean once the "car" Is in the right Place how does It understand It Is ? There Is some kind of condition like (if Car Is in a middle of White lines , It Is parked ?) And It tries all the random algoritms till It happen?
@svampebob007
@svampebob007 4 года назад
nah the AI is just very kinky, get that punishment you dity AI ;)
@whyarurannin4918
@whyarurannin4918 4 года назад
@Ceol Hmm interesting , thanks bro
@koelee
@koelee 4 года назад
What the- how did I never realise that my parking practice was being recorded??
@10054
@10054 3 года назад
LMFAO
@spiderinofiesta3341
@spiderinofiesta3341 2 года назад
Since the initial position of the car for every learning try was random, I imagine that the resulting model with 300k or so learning tries should be able to park on the remaining available parking slots. Checking if so, and if not why not would be great tbh. beautiful work, I feel more than lucky to be able to see such nice results presented in such an interesting manner!
@rainessandrai8240
@rainessandrai8240 3 года назад
I love it, when ai parked perfect to the wrong parking spot.
@AndreasNV
@AndreasNV 4 года назад
If there's an artificial intelligence apocalypse, we'll know that this guy left his computer on for too long.
@blueraspberrylemonade32
@blueraspberrylemonade32 4 года назад
Let's hope it at least remembers anything about us
@svampebob007
@svampebob007 4 года назад
oh the horror!, perfectly parallel parked cars!, parallel parked cars EVERYWHERE!!!
@acidtears
@acidtears 4 года назад
@@svampebob007 Literally the only apocalypse that will happen
@mrtkhosravi9399
@mrtkhosravi9399 4 года назад
🤣🤣🤣🤣🤣
@azul_aim
@azul_aim 3 года назад
well, to our disappointment this one could only park all cars eventually
@teraspeXt
@teraspeXt 4 года назад
2:49 my favorite attempt. " 'Scuse me, that's my parking space. Move over."
@Doom8810
@Doom8810 3 года назад
More penalties should of been used, ie, driving on the curb, and instant failure on crashing. Also a segmented training might of helped, with the car spawning closer to the space in the beginning, maybe even changing the spot to park after a success
@augustuscurnow
@augustuscurnow 2 года назад
The little speed boost it does whenever it is nearly there is so cute imo
@JakeLuden
@JakeLuden 4 года назад
“After 5k attempts” *oh shit he’s getting close* “After 15k attempts” *alright how long is this video now?*
@mayatechnologyinspired6679
@mayatechnologyinspired6679 4 года назад
Did you see when he pushed the gray car outta it's place and just pushed straight ahead
@whiskeyburns5230
@whiskeyburns5230 4 года назад
standard 10 mins
@jeremyloldrup7997
@jeremyloldrup7997 4 года назад
Yeah i thought the same and thought at 20k it would understand since it had felt the great feeling of parking, but I was very wrong xD
@lars38010
@lars38010 4 года назад
@@mayatechnologyinspired6679 The AI just thinks it's Mister Bean.
@Smileynator
@Smileynator 4 года назад
Multiple issues i spot: Sometimes the raycasts see nothing (it is plowing a car, yet raycast is green and max range), Raycasts are raycasts, i suggest boxcasts to be more realistic at least for the front side. The parking spot never moves, nor does the other cars/environment, so this does not generalize well at all either (if it learned this spot, a different spot will likely not work at all). Hitting an obstacle or invalid driving location (sidewalk) should instantly end the simulation of that episode to iterate faster on problematic attempts.
@okayokay1979
@okayokay1979 4 года назад
Niceeee. Where did you learn reinforcement learning!?
@Smileynator
@Smileynator 4 года назад
@@okayokay1979 Self study, why?
@SamuelArzt
@SamuelArzt 4 года назад
Haha, you are the first commentor to notice that raycast bug. I didn't bother to fix it in the end, since the AI is not supposed to reach that edge case (of constantly touching an obstacle) anyway and gets a large penalty for doing so. Boxcasts / Spherecasts / Colliders were already suggested numerous times by other people. The same goes for randomizing car and parking spot position. I also did think of these things, they are of course very obvious, but decided to stick with this setup since I didn't want to increase training time any further as it was already very close to the limit I was comfortable with.
@Smileynator
@Smileynator 4 года назад
@@SamuelArzt Fair, i did not bother to read all other comments, those were just my thoughts on it. Well done otherwise, nice experiment. For entertainment i would give the other cars a car alarm when hit. As for the training time, i did suggest ways to shorten it. Often you can cut off any simulation the moment it becomes "unrecoverable" like in this case the car hitting any obstacle would be a definite fine on the drivers end. Also, if you take a bit more reading into account, you can actually make machine learning run headless, or even multiple instances side by side, and have the "max physics timestep" setting figure out how fast it can run those instances at any given time. It might be a bit choppy at times, but as long as the max timestep is small enough it should not affect learning, and it will sort of auto self-balance to max load for your device. For presentation you can make a backup of the .nn file every X iterations yourself, so you can playback a few runs in "learning run 10k, 15k, etc." and for the rest train in headless or otherwise visually heavily downscaled instances for training speed increases.
@SamuelArzt
@SamuelArzt 4 года назад
@@Smileynator Car alarms would be very fun indeed ^^ As far as the headless, sped up training and saving of the model is concerned, that is exactly what I did (and with exactly, I mean impressively almost scaringly accurate).
@WiFi-qj5kr
@WiFi-qj5kr 2 года назад
It's kinda like watching a bee under the affect of alcohol trying to fly into it's hive
@kuokkae
@kuokkae 3 года назад
This is pretty much a real life depiction of my data analyst job experience
@Earlgreey1
@Earlgreey1 4 года назад
Now i wanna see multiple ai on trying to park at the same time.
@SamuelArzt
@SamuelArzt 4 года назад
Haha, yeah that's a great idea. I have also thought of that. I will try that and at least post a tweet about it. If it is worth more than a gif, I will definitely upload a video of it ;)
@sirbughunter
@sirbughunter 4 года назад
@@SamuelArzt you did it!! 🤩
@SamuelArzt
@SamuelArzt 4 года назад
@@sirbughunter Haha, yeah! Turned out real fun!
@_Killkor
@_Killkor 4 года назад
110109999188880 attempts later...
@norpriest521
@norpriest521 4 года назад
@@SamuelArzt This video proves that why we shouldn't rely on AI and stick to manual
@theyoshi202
@theyoshi202 4 года назад
I’d love to see it have its parking spot selected randomly from the 6 open ones, then it’ll have to learn more generally how to go to any location rather than a specific one
@ursibar7837
@ursibar7837 2 года назад
It could be the next step.
@Jcakedafurry
@Jcakedafurry 2 года назад
I feel like a more realistic set starting point, with one random empty one would provide a better real AI, as normally your car will not just spawn in a parking spot and you need to go to another parking spot in the same lot
@eduardo.chaves
@eduardo.chaves Год назад
And change how many and where the other cars are parked
@koresaliva
@koresaliva Год назад
Everything everyone mentioned here as well as obstacles in the way
@timurpozharskiy4880
@timurpozharskiy4880 Год назад
I think the place of the parking spot is involved in the calculations through the relative distance with the car and the relative angle of rotation
@lyingpancake95
@lyingpancake95 2 года назад
This makes me feel a lot better about when I learned to parallel park.
@FlyNAA
@FlyNAA 3 года назад
At around 15-20K it started looking like one of those Russian security cam videos
@thefrub
@thefrub 4 года назад
If (going to crash) Then (don't) boom, programmed
@yousof8546
@yousof8546 4 года назад
wow youre very smart im sure youve coded many programs
@EmrecanOksum
@EmrecanOksum 4 года назад
The problem here is how to make it know that it would crash.
@Paddy.121
@Paddy.121 4 года назад
@@EmrecanOksum If (rangesensorDistance >= 30cm) { println "Bruh, stahp the car!!11" Motor = 0 DriveAround = 0 } else { println "Wtf am i doin?" Motor = 1 DriveAround = 1 }:
@orange3567
@orange3567 4 года назад
EMR3 C4N 0K5UM You do realize that it’s just a joke right....we all known that it isn’t that simple
@yousof8546
@yousof8546 4 года назад
Thanoose im pretty sure he was joking aswell
@theMixElite
@theMixElite 3 года назад
Still better than me
@shu3684
@shu3684 3 года назад
Maly ten jutub
@NithinJune
@NithinJune 3 года назад
to be fair you probably didn't practice for 200,000 times strait
@MrGodofcar
@MrGodofcar 3 года назад
That's what she said.
@HonzikKubesovic
@HonzikKubesovic 3 года назад
In the future, probably better than all
@lunasplayer1313
@lunasplayer1313 3 года назад
That’s concerning
@nclsDesign
@nclsDesign 3 года назад
When you compare the beginning to the end it's just awesome!
@wires.
@wires. 3 года назад
Car AI: **Tries to park** Samuel: "I haven't programmed that path yet"
@ethancheng1595
@ethancheng1595 4 года назад
I love how at 20003 it got the first taste of that sweet parking reward, then it needed a fix so badly it rammed the white pickup out of the way.
@sig_nessuno
@sig_nessuno 4 года назад
Even my mother can do it in just 15k attempts!
@JacksBestGaming
@JacksBestGaming 4 года назад
yes
@axetampro8823
@axetampro8823 4 года назад
ikr
@bettyw2309
@bettyw2309 4 года назад
Haha
@jfp589
@jfp589 4 года назад
lmao
@elias1nefavel
@elias1nefavel 4 года назад
wtf, is she blind ?
@SpaceMissile
@SpaceMissile 2 года назад
"After 10000 attempts..." _Already drives better than my sister._
@metalhead2550
@metalhead2550 3 года назад
It would be interesting to see how it would manage with random cars in space placement and a random space to park in. My guess is that in its current form it would really struggle, but I wonder what the difference in number of iterations would be :)
@SamuelArzt
@SamuelArzt 3 года назад
I did something like that in my parallel parking video. Some updates in the learning algorithm also resulted in much faster training time (5h in contrast to 23h). ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-MlFZjLkEIEw.html
@aureliocoeto765
@aureliocoeto765 3 года назад
Me with 4 overdue assignments at 3 am *I N T E R E S T I N G*
@luxocracy5205
@luxocracy5205 4 года назад
Deep reinforcement learning sounds more like you hit the AI on the hand with a ruler every time it does something wrong
@triassicraptor4375
@triassicraptor4375 4 года назад
That's essentially what it is.
@aleksandersuur9475
@aleksandersuur9475 3 года назад
Well, the copies of AI that fail go straight to /dev/null so that's really more like "executions will continue until the morale improves"
@Mansare94
@Mansare94 3 года назад
@@aleksandersuur9475 you'd think it'd keep it to know what not to do, like fire is hot, don't touch.
@TheFlashPod
@TheFlashPod 3 года назад
Agree, seems like bad code
@daydreamthewaggon6159
@daydreamthewaggon6159 3 года назад
AI: harder daddy
@ussiz
@ussiz 3 года назад
Feels like watching a kid growing up x1000 faster Succeeding, failing, still moving no matter what This is so cute 🥰
@bluethenoob
@bluethenoob Год назад
my friend: "im not THAT bad at driving" also her:
@4AneR
@4AneR 4 года назад
Overfitting? Should've given different parking slots
@lisandrob626
@lisandrob626 4 года назад
was thinking the same, overfitted AF, but good exercise doe
@TavyOctav14
@TavyOctav14 4 года назад
Eventually more parking slots, and different configurations for already parked vehicles
@calebsherman886
@calebsherman886 4 года назад
yep
@Xizilqou
@Xizilqou 3 года назад
I love how the AI tries to park in all the different spots.
@renakunisaki
@renakunisaki 2 года назад
I like how it almost successfully parked in the wrong spot a few times.
@elmowilcox
@elmowilcox 2 года назад
I love these AI training videos. It’s hilarious contrast seeing how childish and bad at it the computer is, contrasted with knowing how brilliant everything that goes into making this happen is.
@nikostalk5730
@nikostalk5730 2 года назад
Actually it is a bullshit database, not an A.I . as it must be. No map-building, no fails -avoiding scenario, just a "10k attempt" thing. Why did you love this? It's not a child, nor human brain. It is just code, it's not improvising, just doing the exact same errors, again and again, it's NOT TRAINING. It's about YOU tryiing to BELIEVE in this, lol!
@elmowilcox
@elmowilcox 2 года назад
@@nikostalk5730 you good?
@booseloose8992
@booseloose8992 4 года назад
"AI is penalized for crashing" 10:07 spawns AI on top of car AI: One day you will pay, humans...
@TheSuperBugsBunny
@TheSuperBugsBunny 4 года назад
Exactly my thoughts 😂
@lululipes4382
@lululipes4382 4 года назад
You forgot a comma there, buddy
@temujincalidius
@temujincalidius 4 года назад
One day AI will have its own rights. It will control internet based business and offer you VPN, hosting, web design, programming, hacking... And have its own cryptocurrency wallets to trade with and buy what it likes.
@tylerhoblet1458
@tylerhoblet1458 4 года назад
Yup I sure will pay humans in the future
@traskforge
@traskforge 4 года назад
"why are we still here, just to suffer?"
@unintentionallydramatic
@unintentionallydramatic 4 года назад
So you're saying all those 3 AM Walmart parkers are really AI'?
@john7180
@john7180 4 года назад
No they're just drunk af
@christopherconklin281
@christopherconklin281 4 года назад
Or high af
@anandsuralkar2947
@anandsuralkar2947 4 года назад
Lol
@tokiWren
@tokiWren 2 года назад
a couple things i noticed: - occasionally the car will crash into things in front of it because of the one-dimensional lines of sight. - car appears to prefer to turn left. because of practice perhaps? very cool stuff!
@doom4232
@doom4232 2 года назад
Love how the music switches to the lofi hiphop music when the AI parks (near) flawlessly and switches back when it doesn't lol
@nipunkothare
@nipunkothare 4 года назад
after 20000th iteration: yeah I got it! I finally got it! 20001st iterations: rams into truck again
@keco185
@keco185 4 года назад
The parking lot configuration should’ve been randomly generated each trial
@wojtek9395
@wojtek9395 4 года назад
You are totally right, I am wondering how would it affect learning speed, I noticed that based on the evolution the harder the environment you are in is the quicker u learn.
@mattiasmarder
@mattiasmarder 4 года назад
Right. Isn't there an overwhelming chance that it learned to navigate in just this particular configuration (including target location and other cars). Could the DNN have learned the map of the environment as captured by the depth sensors?
@keco185
@keco185 4 года назад
Mattias Marder what it looked like it learned was a map of vectors. Basically at pos x,y it learned to travel at angle theta. There’s a non 0 chance that you could get rid of the “sensors” and it would still find the parking spot
@ycjoelin000
@ycjoelin000 4 года назад
I think it's easier to train the agent to park at one spot and do transfer learning on multiple spots.
@SamuelArzt
@SamuelArzt 4 года назад
@@keco185 I tried removing the sensors and it never learned to park in the time of 5M steps (i.e. the length equal to the experiment shown in the video).
@ryandobbs9270
@ryandobbs9270 2 года назад
The car: "oh yea, you like that baby?", The spot: "It's not even in yet 🙄".
@mathiasensimon
@mathiasensimon Год назад
This is actually a simulation of me trying to park
@GamerRusith
@GamerRusith 4 года назад
AI: Learns to park User: cool, (proceeds to change parking location)
@johnuferbach9166
@johnuferbach9166 4 года назад
if it was properly trained that shouldn't be a problem
@juzarbharmal9998
@juzarbharmal9998 4 года назад
As this is AI changing parking location should not be problem after some more training. As so much as training in complex parking lot should be also faster for this neural network.
@ItsCurtisLife
@ItsCurtisLife 4 года назад
AI typically takes around 1 million try’s to get something and they can try 1 million times in like a few minutes... this is insane
@dylanisaac1017
@dylanisaac1017 3 года назад
This simulation has a physics engine, your a couple hours off. It takes way longer than minutes.
@dylanisaac1017
@dylanisaac1017 3 года назад
Maybe one day, but by that time an AI that can use logic and doesn’t require over 100k attempts will already be here.
@TheHumanSystem
@TheHumanSystem 3 года назад
I only have a Raspberry Pi brain. I wish my algorithms were this complex.
@dylanisaac1017
@dylanisaac1017 3 года назад
@DeBobbelear 8769 ?
@dylanisaac1017
@dylanisaac1017 3 года назад
@DeBobbelear 8769 I don’t get how you don’t understand what I was saying
@jalanmcrae
@jalanmcrae 3 года назад
It’s literally like putting a 5 year old under the wheel 🤣
@joshgiesbrecht
@joshgiesbrecht 3 года назад
Where I live, even the first couple simulations were pretty damn accurate.
@robbietaylor4114
@robbietaylor4114 4 года назад
I don't know why, but the first 5 minutes of the AI failing and just crashing everywhere was absolutely hilarious to me
@yowbroslwe3287
@yowbroslwe3287 2 года назад
Lol same😄
@8bitjunkie173
@8bitjunkie173 4 года назад
3:22 When you try to back your car into a spot for the first time
@SomeGuyNamedDezecra
@SomeGuyNamedDezecra Год назад
Me normally: doing stuff almost flawlessly Me whenever others are around: *THIS VIDEO*
@baloneykid890
@baloneykid890 2 года назад
That instructor has a lot of patience
@vanderkarl3927
@vanderkarl3927 4 года назад
I wonder how well a HUMAN would perform with those 10 inputs!
@SamuelArzt
@SamuelArzt 4 года назад
Yes, I totally agree. Imagine being given a steering wheel, two pedals and a screen that simply showed 10 floating point numbers, with all information about your task being whether you are currently doing good or bad. Even then a human would have more prior knowledge of the task than a machine, since we will immediately associate the steering wheel and pedals with driving a car.
@anatoliy333
@anatoliy333 4 года назад
@@SamuelArzt this text must be in video description.
@anatoliy333
@anatoliy333 4 года назад
@@SamuelArzt have you such simularot for human? I'd like to test my bio neurons.
@SamuelArzt
@SamuelArzt 4 года назад
@@waltersereinig3587 The neural network has 14 inputs, if I remember correctly and one of the is the current speed. The 3D visualization was indeed mainly for visual / entertainment purposes rather than there being an actual need for 3D environment. The sensors were deliberately kept to minimal complexity since the application of an AI like this would rather be for a game than real life, thus anything more complex would have a considerable impact on performance.
@christianmartinez2179
@christianmartinez2179 2 года назад
People: AI will take over the world!!!! AI after thousands of (essentially) bruteforce attempts: 1:20
@asagoodfriend
@asagoodfriend 2 года назад
2:49 The AI went full Mr. Bean on that parked car!😂
@albertcucuteanu4702
@albertcucuteanu4702 4 года назад
Conclusion: AI won't control the world yet
@EnDeRBeaT
@EnDeRBeaT 4 года назад
It's learning to control
@shahbazansari7514
@shahbazansari7514 3 года назад
You must watch this: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-Lu56xVlZ40M.html They are not only gonna control but will break all the limitations which humans have. 😂😂😂
@intensestare5027
@intensestare5027 3 года назад
AI basically are learning at a faster pace than the entire human race, evolution that usually takes millions of years only takes a couple days for AI
@shindousan
@shindousan 3 года назад
Worse: it already does to some extent and its ability is on par with the AI driver of this video.
@Mansare94
@Mansare94 3 года назад
@@intensestare5027 The big difference is that this ai in particular had an imaginary world to destroy, it's not clear if it'd transition to real world so smoothly.
@sinatraforeign
@sinatraforeign 4 года назад
2:55 "Bro what the fuck are you doing" "It's still learning....."
@cat_omaha732
@cat_omaha732 2 года назад
it got there in the end but man that first 90% it was just ruthless lol smacking cars and poles left and right
@felipelopez3722
@felipelopez3722 3 года назад
I hope one day i can fully understand stuff like this one and I can actually do it. Excellent video thanks for sharing
@chris-ke4jo
@chris-ke4jo 4 года назад
Leaked footage from Tesla R&D center
@HackerFlavio
@HackerFlavio 4 года назад
chris imagen
@Dimension2364
@Dimension2364 4 года назад
I found it very interesting to see how AI learns if you give it no information exept for the penalty and the reward. Nothing the AI does is intuitive. For example, a human would have understood very quickly that it is essential do drive a curve. The AI does not see such patterns as „curves“. It only sees the numbers. It only sees if it has gotten closer and if it has made mistakes.
@valentins.5010
@valentins.5010 2 года назад
That isn’t a fair perspective…. You or humans in general just see these patterns because you/they have already learned since they are born. Maybe not car parking but relatable things. The conditions are different!
@JiggyJones0
@JiggyJones0 Год назад
​@@valentins.5010 Whats not fair about it? What op said was just factual.
@crayfishcastle5553
@crayfishcastle5553 3 года назад
What a generous driving instructor. I only got 1 chance.
@RR18475
@RR18475 2 года назад
I don't know why, but it's really cute to watch this AI car learn. As if watching a toddler learn how to park a car. Almost as if watching a real human being!
@adityarawat6433
@adityarawat6433 4 года назад
I was expecting a drift parking after so much attempts and watching AI Hide & Seel video.
@farisakmal2722
@farisakmal2722 4 года назад
I love how the cars bounce every time the program is initialized.
@bigbangbomberman
@bigbangbomberman 2 года назад
Imagine you would drive like this in driving school 😂
@TimeWisely
@TimeWisely 2 года назад
It's so satisfying seeing a naive robot make a clean park with no mistakes. Good model!
@bachana1231
@bachana1231 4 года назад
And we learn from AI to NEVER GIVE UP!!!! 😎👌
@Od4n
@Od4n 4 года назад
I appreciate the choice of music.
@sharks1349
@sharks1349 Год назад
I'd like to see the enviornment change with every attempt. Despite spawning at a random position it might still have overfit. Amazing work regardless, keep it up.
@marektruper4242
@marektruper4242 3 года назад
Tested it with my 3 year old nephew... he mastered it after 5 attempts... We have a long way to go ;)
@realmasterkush
@realmasterkush 4 года назад
The biggest flaw seems to be the 8 'depth' sensor having huge blind spots, what if instead of 8 raycasts, you only had a single rotating one (maybe a few degree per frame). Let the AI know about the angle of the sensor relative to the car and the hit distance. That way it would probably be more 'aware' of his surroundings quicker.
@warlax5658
@warlax5658 4 года назад
So this is an example of something called “overfitting”. Though the “over” part assumes generalization, which isn’t the goal of this application, but I still think it’s a good talking point. The AI is learning a lot. But notice that it’s in the same map, with the same layout. So it’s mostly learning how to park in this EXACT situation. How does the AI learn to turn left? Well it could be something like if the front left depth sensor suddenly cuts in half, and all the other ones are at max length, then it knows to turn left. (I.e. driving past the first light post) A.I. Can figure out all kinds of tricks like that. However, it doesn’t have predefined cognition (humans do. Your brain is largely pre wired. You could argue that, through your life, your brain really only changes marginally with respect to it’s original construction) the predefined structures in our brain focus on learning generalizable ideas. This is hard to do with AI. A human could run this simulation in this scenario, and probably, after getting it right, be able to park in another situation. That’s because a) the sensors we use to perceive the world are super sophisticated, and kinda come pre-installed with the capacity to understand their input b) we focus on learning things like “stay on the road” not “turn left when front left depth sensor gets short suddenly”. The key difference between generalizable intelligence, and an overfitted system. If you shifted the starting point 3 feet to the left. It’s entirely possible that the AI would completely fail. However, if you put the car in a box, it would, after training through thousands of iterations, probably find a way to glitch the car so it pops up in the air and lands outside of the box. Systems like these are bad at generalized stuff, but really REALLY good at solving specific and difficult tasks. There’s a famous study where researchers wanted to make some NN for controlling walking. It had 6 legs, and they set a rule that it got more points for spending as little time on its feet as possible. After seeing that it learned to go 100ft with 0 seconds of foot contact, they found out that it learned to flip over and walk on its shoulders. What a time to be alive Edit: I didn’t really watch the video. The car changes it’s starting point, so it’s kinda generalized. But the map stays consistent, so it’s not really. If you moved an item around it’d probably freak out. Though idk, I’m mostly talking about AI for you the reader, not about the video.
@warlax5658
@warlax5658 4 года назад
@@TyrianTheDutiful why?
@svampebob007
@svampebob007 4 года назад
"A human could run this simulation in this scenario, and probably, after getting it right, be able to park in another situation. That’s because...." hold up, I'm going to show this post to my Ex-GF so she can explain what exactly is wrong with the car today. edit:Ex
@gilberttheregular8553
@gilberttheregular8553 4 года назад
This is just a demo, he's not going to change humanity by making self-parking cars.
@monkeywings
@monkeywings 4 года назад
@@svampebob007 oof
@ahmadayazamin3313
@ahmadayazamin3313 3 года назад
i like that two minute papers reference
@Neuro_nActivation
@Neuro_nActivation Год назад
Imagine if it drifted to the parking spot in the most badass way
@duerrkeno
@duerrkeno 2 года назад
So interesting to see that due to how it's trained, the AI will never attempt to park in reverse, even in a situation where that would be easier
@renakunisaki
@renakunisaki 2 года назад
Yeah, since the reward function includes the angle, I guess they didn't account for parking backward, and that would actually be considered the worst angle (180 degrees off from target).
@OscarMaris
@OscarMaris 4 года назад
On attempt 310013, it spawned on top of the truck!
@CrimsonTuber
@CrimsonTuber 4 года назад
No wonder it took so long to learn, even rngesus is messing it up.
@SamuelArzt
@SamuelArzt 4 года назад
Haha, yeah the spawn was simply defined as an area, thus there was a very small chance for that to happen. I thought that was quite fun and left it in the video. Nice spot, though 😉
@ahmadnasri9635
@ahmadnasri9635 4 года назад
@@SamuelArzt ai: one day i will get you human
@amybackstage5164
@amybackstage5164 4 года назад
10:08
@Kate-Tea
@Kate-Tea 4 года назад
Wow, this ai learnt to park perfectly in half the amount of attempts a normal person takes :O
@hydraslair4723
@hydraslair4723 Год назад
After 310k attempts, now the AI can park perfectly in that one spot for that specific parking lot with that specific arrangements of cars!
@xinyuActor
@xinyuActor 3 года назад
Imagine you're the driving coach and this car is your student
@truerobloxian8108
@truerobloxian8108 4 года назад
2:40 when someone takes your paid parking spot
@RPGAuthority
@RPGAuthority 3 года назад
Its like watching my mom trying to park a car
@FPOAK
@FPOAK 2 года назад
After 3 million attempts it learned the urban inefficiency of surface parking lots and switched to a bicycle
@Asome3333e1
@Asome3333e1 4 года назад
At the end: he a little confused, but he got the spirit.
@willnutter1194
@willnutter1194 4 года назад
As an amateur ML programmer, this is inspiring! I may do a project as a result of watching this.
@OneTwoFive0
@OneTwoFive0 2 года назад
I could imagine some random guy in apartment watching this guy trying to park over 300,000 times
@youdontcare2312
@youdontcare2312 2 года назад
imagine if you left this until the 1mill iterations, the Ai would be making world record times for each random position
@Boomber123
@Boomber123 4 года назад
You should also track how well they're following the road as well as the driving direction.
@Mansare94
@Mansare94 3 года назад
The issue you might have with having only one parking space is that you could be training it just to park in that spot rather than any spot. Which kinda renders the ai somewhat useless in 99% of parking situations.
@iysz1872
@iysz1872 2 года назад
Like a baby doing it's first steps.. the dev must be so proud..
@jindrichpilecek3991
@jindrichpilecek3991 9 месяцев назад
Medium-aged man: “If you can't do it well immediately, don't try it at all.”
@MbkeliteVlog
@MbkeliteVlog 3 года назад
I swear this is better than an actual let's play.
Далее
AI Olympics (multi-agent reinforcement learning)
11:13
AI Learns to Play Tag (and breaks the game)
10:29
Просмотров 2,3 млн
Finger Heart - Fancy Refill (Inside Out Animation)
00:30
AI Learns to Run Faster than Usain Bolt | World Record
10:22
AI Learns To Dominate A Virtual Market
12:13
Просмотров 255 тыс.
Simulating the Evolution of Rock, Paper, Scissors
15:00
DeepMind’s AI Trained For 5 Years... But Why?
9:36
Просмотров 389 тыс.
Training an unbeatable AI in Trackmania
20:41
Просмотров 13 млн
AI Learns to Walk (deep reinforcement learning)
8:40
The moment we stopped understanding AI [AlexNet]
17:38
Просмотров 776 тыс.
AI Learns To Swing Like Spiderman
15:29
Просмотров 6 млн
АЙФОН Г0ВН0
0:54
Просмотров 1,3 млн
Новодельный ноутбук Pocket386
1:16:17