AI Finishes Rainbow Road Faster Than EVER Before | Mario Kart Wii

AI Tango

Подписаться 22 тыс.

Просмотров 38 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

26 сен 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 160

@B0RDERLANDFAN85 9 месяцев назад

A real testament to how fast funky Kong is getting blue shelled 4 times in 3 laps

@aitango 9 месяцев назад

Yeah even for a human that's brutal

@datbruh8469 9 месяцев назад

Don't forget getting shocked almost twice

@jackb4 9 месяцев назад

Great to see that you’re still making improvements! Have you considered trying the NEAT algorithm? It’s genetic and I’ve had some pretty cool results with it. It may be too simple for Mario Kart though. Keep up the videos!

@aitango 9 месяцев назад

A few years back before I got into Reinforcement Learning I actually started my AI journey doing genetic algorithms, NEAT included! Infact my first ever AI project was using NEAT for flappy bird. Sadly I doubt it would perform too well on Rainbow Road (especially with items and CPUs), but I could be wrong, would love for someone to try it

@twiglegg508 9 месяцев назад

Now we just gotta challenge it with Multiplayer, and by making all other racers learning ai as well.

@aitango 9 месяцев назад

I do think I could get it to play every track, it would just take sooo long to train! Maybe if I get some time where I can't be trying new stuff I'll leave it running and see what I can do!

@caliburnleaf9323 9 месяцев назад

Man, right at the end, watching it hop around that banana in its way... that was probably the biggest testament to how much it had learned. Your reward structure didn't even include rewards for faster times, just getting to the next checkpoint. Very cool.

@aitango 9 месяцев назад

Yeah that was really interesting to see, such a unique scenario yet it found a good set of actions. Glad you liked it!

@MarioMamTess 9 месяцев назад

Just a small suggestion: Next time you show off a different inferior algorithm, maybe don't use that black box filter over it. I could barely see the actual gameplay of the inferior AI because the black box filter combined with youtube's video compression made it a chunky mess to look at. For a better filter, maybe you could use a CRT filter instead?

@aitango 9 месяцев назад

Yeah that's a fair point, I guess it was a little hard to see in the end. Definitely something I'll consider for future videos

@benji.botterill 9 месяцев назад

Noted 📝 I’ll do that next time, thanks for the suggestion :)

@thegamesguy2263 9 месяцев назад

This may be quite difficult, but I just wanted to say that it would be really cool to see the AI do a speedrun trick/ skip

@leafly7917 9 месяцев назад

@@thegamesguy2263it would either have to learn it on its own, which is a very low chance it accidentally does it, or he'd have to change the reward system to force it to take that specific route, but that might break it

@leafly7917 9 месяцев назад

It wasn't an issue for me, more just an eyesore. I could see though.

@eg_Fishlord 9 месяцев назад

i love how it just shits itself when hit by a blooper

@aitango 9 месяцев назад

Who knew the blooper in this game would actually turn out to be useful

@nicki8731 9 месяцев назад

Someone used chatGPT for parts of the script 😂

@aitango 9 месяцев назад

I have no idea what you're talking about...

@benji.botterill 9 месяцев назад

@@aitango😂

@OfficialBiggBenn 9 месяцев назад

Great video, love your stuff. The people who think up novel approaches (or even iterative improvements to existing approaches) to AI are absolute geniuses, but applying existing ones to video games is something I always want more of. I also think it helps that you use Mario Kart Wii, because so many people have good memories with that game. Better than something obscure where I can't gauge how proficient a player is from footage.

@OfficialBiggBenn 9 месяцев назад

Ah, that isn't to diminish your videos on other games. I just can't personally estimate how good the ai is with something I have very little experience with. Still great videos though!

@aitango 9 месяцев назад

Yeah coming up with new AI techniques that actually work is a nightmare, during my PhD I learn that constantly haha. I do enjoy applying it to new situations though, always interesting to see how AI techniques can be used in different ways. Yeah Mario Kart Wii really has the nostalgia factor for many people

@Questerer 9 месяцев назад

This made me think of the idea that an ai able to finish the race could be implemented in a base game and slightly improving compared to the player’s skill. Does the AI win from the player, then don’t save what it learned that race. This way the ai and the player stay close in skill together.

@aitango 9 месяцев назад

Yeah that would be really interesting, could have the AI always adapted to the player's skill so it never gets boring

@alansmithee419 9 месяцев назад

humans tend to learn much faster than AIs (to an extent, AIs sometimes get ahead later on), so this would result in great boredom for many players waiting for the AI to catch up with them. I think it would be better to have a series of AIs which are all pre-trained, then players can choose which AI they want to play against - e.g. you could have the AI after training for different amounts of time (1 day, 2, 3, a week etc) so they have a range of differently skilled AI opponents to play against. Basically like current CPU opponent difficulties, but with an AI that hopefully makes the difficulty of the hardest CPUs higher.

@MarreNysman 9 месяцев назад

I love your videos and you are amazing! Keep up your work:D

@aitango 9 месяцев назад

Thanks, that's really great to hear!

@UncleLarrrry 9 месяцев назад

We need an AI like this in Mario Kart 9.

@aitango 9 месяцев назад

Nintendo please hire me lol

@omegahaxors9-11 9 месяцев назад

@@aitango If you're going to go to an interview don't say "emulate" for some reason they **really** don't like that word.

@thegamesguy2263 9 месяцев назад

I know this would be very (very) difficult to program and get working but I just want to say that [hypothetically] I think it would be really cool to see the AI do a shortcut or skip. Love your videos! Keep up the amazing work 👍

@aitango 9 месяцев назад

I'd love to do that too, although I think I'd start to get scared if I could do ultras haha

@AhegaHOE 5 месяцев назад

Heyy, I appreciate your work! Great job, maybe you can create a tutorial on how you did it? Maybe start a little tournament where people can submit their own AIs that then compete against each other? Wouldn't that be fun?

@aurastrike 9 месяцев назад

What needs to be done is to allow the AI more control over the inputs it can use - aka, standard turning, drifting with the three different levels of steering input, etc. That will allow the AI to be significantly more precise and dramatically improve it's time.

@aitango 9 месяцев назад

I recently spoke to some TASers about this, and am coming up with an improved action space that should allow the AI to be much more precise

@daveprice5911 9 месяцев назад

@@aitangoOoh now that's a fantastic idea!

@eriktokar4487 9 месяцев назад

OpenAI, please hire this guy and he going coding AI with some gameplay!

@aitango 9 месяцев назад

Yes, OpenAI please hire me hahaha

@t.w.videos1739 9 месяцев назад

Hope your channel blows up its super deserved

@aitango 9 месяцев назад

Thank you so much, always so nice to hear!

@adissentingopinion848 9 месяцев назад

Ok, "put your likes and subscribes together for" is a really good MC callout for RU-vid. Your video design is fantastic.

@aitango 9 месяцев назад

Thanks, that's really nice to hear!

@pennyjim5671 9 месяцев назад

Have you thought about analog controls for the ai? As in let the ai control the polar coordinates (or cardinal coordinates?) of the joystick input. If so, why did you decide against it?

@aitango 9 месяцев назад

I've thought about it, but haven't really looked at doing it. The only real reason is that I specialise in value-based Reinforcement Learning, which doesn't really look at using analog controls. I do think its an interesting idea though

@Pleezath 7 месяцев назад

let this learn for a really long time and when ai is more developed in the future you use this as a benchmark but with either the same amount of hours in it or less to see the same resault

@murums2104 9 месяцев назад

the AI is better than i am is it possible to run this Online? what a Legend nice Vid

@aitango 9 месяцев назад

Thanks! I looked a while back into getting it to play online. I haven't made much progress on that area, but its still something that will always be a goal of mine! I've got some more interesting stuff in the works with Mario Kart, so who knows what'll end up happening...

@bailey7657 9 месяцев назад

As a human who time trials mario kart wii, I'd love to see an AI attempt to. Although I'm not sure it would do too well due to the sheer complexity and lack of strat knowledge, i would love to be proven wrong. Unless you use some kind of method to help the AI learn from time trial footage, then maybe I could see it happening

@claytonallard 9 месяцев назад

I'm relatively new to studying RL so please correct me if I'm wrong, but instead of training two agents independently and taking whichever one does better overall, can't you combine the results of the two agents to make one better agent? By this I mean for each state, we evaluate the state-action values for each agent-action combination, and we choose whichever agent's action yields the higher state-action value. This may account for one agent being better at one part of the track while the other agent is better at another part of the track. Great work by the way!

@aitango 9 месяцев назад

Yes so that is possible, and I guess would be an "ensemble" network. Its hard to say exactly how much better it would get, as RL agents can overestimate values, so only taking the highest values may not always lead to better performance, however would be likely. Thanks!

@IAmSkystrike 9 месяцев назад

Seeing you have two computers to use for training AI now, I can see why the quality of your videos have gone up! I just hope your electricity bill hasn’t gone up too high with it 😅 Thanks for the great vid as always!

@aitango 9 месяцев назад

Thanks, the second PC helps a lot haha! We don't talk about electricity here haha

@biadrum 9 месяцев назад

These AI are a good representation of my brain before and after Ritalin 😂

@omegahaxors9-11 9 месяцев назад

Calling it now, the Time Trial AI is going to copy whatever the ghost does.

@aitango 9 месяцев назад

Depends if I give it the ghost or not

@omegahaxors9-11 9 месяцев назад

@@aitango You should. It would be funny to see if the ghost influences its play at all.

@TheGreenPig321 9 месяцев назад

Can’t wait to see if you do this on custom tracks!

@aitango 9 месяцев назад

Mushroom peaks would be a dream haha

@TheGreenPig321 9 месяцев назад

My point exactly ;) @@aitango

@Smurez 8 месяцев назад

I've got a question: you said that your AI was training for consecutive 60 hours. Are those "real time" hours? And if so, isn't there a way to artificially speed up the programs/process without altering the validity and reliability of the results?

@jds7006 9 месяцев назад

Awesome video! Is your code available anywhere? I’m really interested in getting into this stuff myself, and I’m very curious to see your code.

@aitango 9 месяцев назад

Thanks! My code is not yet publicly available (however later down the line I do plan on open-sourcing it), however if you're looking to get into this stuff, look at Reinforcement Learning Gym/Gymnasium, and there's lots of easier problems to get into and play around with if you're just starting out

@jds7006 9 месяцев назад

I will check that out, thanks for the suggestion

@maxsurman7838 9 месяцев назад

@@aitangojust curious, would you also be freely licensing it? Open source code can still be proprietary.

@mkwpatsfan 9 месяцев назад

The AI got a Bam shock dodge.

@aitango 9 месяцев назад

Yeah I was very happy to see that! I hope it was one purpose

@robloxprololololol7985 9 месяцев назад

you should make a video of the ai doing sherbet land and see it try not to slip on the ice. it would be a fun challenge for the ai

@discycat 9 месяцев назад

Im wondering when you’d leave a Mario Kart Wii AI out until it literally can’t improve anymore, I hope you’ll make this idea come true in the future!

@aitango 9 месяцев назад

At some point I'll get busy enough and just leave one running for like 3 weeks haha

@MoreInsane96 9 месяцев назад

Well, the final test could be beat a Time Trial World Record

@aitango 9 месяцев назад

That's something definitely I want to do, a bit of a dream of mine

@thelookingcat 9 месяцев назад

Once it learns the Ultra Shortcut, it will be truly scary.

@aitango 9 месяцев назад

Yeah I think if it does an ultra I would actually get worried haha

@AuroraNemoia 9 месяцев назад

is the end goal to generalize against any track vanilla or custom against real world players? I wanna see this thing rip through CTGP worldwides one day.

@doggo7078 9 месяцев назад

I know you use Mario Kart Wii bc it's very popular and likeable (attracts views and such). But if you ever develop such an impressive AI that the Mario Kart community cannot ever take a shit again without thinking about your AI, would you consider applying the AI knowledge to an A Button Challenge?

@LEOKDENxMarioKartWii 9 месяцев назад

You should do an AI vs Human video on mario kart wii, I wonder how that will go🤔

@NicosLeben 9 месяцев назад

That compression artifacts though. xD

@aitango 9 месяцев назад

Yeah it was a bit heavy haha

@isocle 9 месяцев назад

I'm curious how much better the AI would perform with more ram watch as NN inputs for all the important data that TASers use. I bet it would be scary fast, though not the same as reading the screen pixels.

@aitango 9 месяцев назад

Its something I'm looking into at the moment! Deciding what exactly to include is a little difficult though, since the RAM is way too large to just include it all. I'll be doing a video at some point though going into that!

@bumpybumpybumpybumpy 9 месяцев назад

I'd like to see how it would preform if it read data from dolphins memory, rather than getting it all visually.

@aitango 9 месяцев назад

Me too, I've been looking into this; deciding what info to include and chasing all the memory pointers is turning out to be quite a headache though!

@bumpybumpybumpybumpy 9 месяцев назад

@@aitango The WIP mkwii decomp would probably be of some help, though ofc finding some offsets will probably have to be done manually.

@vardamir0397 9 месяцев назад

Hi Tango! I'm curious, from your point of view and considering the AI already programmed in NPCs and such: would it be possible to train a brand new neural network to play Elden Ring, for example? I know shooter games have botting already, but for the RPG counterpart, is it reasonable to think AI can eventually learn these layered tasks? I imagine devising the reward system and all the input/output would be complex, but beyond that, I'd be curious to see how the intelligence adapts in a more open environment.

@aitango 9 месяцев назад

These types of tasks have typically been challenging for AI since most AI heavily rely on there being a very set direct task. Some AI models however (Dreamer v3 comes to mind), have however been able to adapt to these types of tasks. In the paper an AI was actually able to find Diamond in Minecraft! This works by the AI having its own "intrinsic" reward to guide it, however this often comes at the expense of needing a LOT of training time

@tulip1634 9 месяцев назад

This was great! What if you could feed the AI inputs from a TAS or WR, and make it try and finding new methods or improve it?

@aitango 9 месяцев назад

Glad you liked it! I'm not sure I'd give it inputs from a TAS, but I do have plans to try some time trial record stuff...

@DibzNr2234 9 месяцев назад

Amazing video!

@aitango 9 месяцев назад

Thanks!

@salih2D1 9 месяцев назад

great to see you imrove after all this time

@aitango 9 месяцев назад

Thanks!

@ChuckSploder 9 месяцев назад

I wish you'd show the final run at normal speed. The double time hurts my eyes and I can barely see what's going on.

@aitango 9 месяцев назад

Truth be told, I didn't even set the speed, I just let the AI run as fast as it wants to. If I do something like time trials where the speed is really important, Ill definitely show it at normal speed though

@ChuckSploder 9 месяцев назад

@@aitango Okay, cool. Thanks

@theominouspigeon 9 месяцев назад

humble 💯

@aitango 9 месяцев назад

@theominouspigeon 9 месяцев назад

@The_Grammar_Police 9 месяцев назад

How did you pull the speed of the AI from dolphin for training?

@aitango 9 месяцев назад

I'm using a fork of the Dolphin source code that allows me to read memory addresses using python code

@Isaac-ry3ks 9 месяцев назад

I need to see the ai do the ultra shortcuts in mkwii 😮

@bearnet4854 9 месяцев назад

Nice video! One question: If the AI were to be put on different track would it perform well?

@thegamesguy2263 9 месяцев назад

He actually covered this in a different video

@bearnet4854 9 месяцев назад

@@thegamesguy2263 oh could you link me the video / give me the name? thanks!

@aitango 9 месяцев назад

I think the video is called "Can an AI play a track its never seen before"! Hope you enjoy that one

@blahblahsaurus2458 6 месяцев назад

There's a much better option than making the visual input black and white: make it all bright colors - round the pixel value to a pure hue, with maximum saturation and medium lightness. And if it's very close to black or white or medium grey, round the pixel to those colors. This will remove a huge amount of noise and give the ai the information that the game designers thought was important enough to give a color. It will also give the AI very clear shapes with sharp edges. Forcing the AI to recognize thousands of complex grainy shapes that might differ from eachother by only a few pixels sounds inefficient. Of course it had trouble with the banana peel.

@Wizzkidwas 9 месяцев назад

Great video but good lord that filter for the bad AI was simply atrocious

@aitango 9 месяцев назад

Thanks! I've noted that for next time, a few people mentioned it

@nil24nil 9 месяцев назад

To improve it you could make it do mts and ssmt

@NeonNijahn 9 месяцев назад

So proud of our little ai boi

@aitango 9 месяцев назад

Honestly I love watching them get better

@NeonNijahn 9 месяцев назад

@@aitango Me too!

@lonelyPorterCH 9 месяцев назад

Very impressive :D

@aitango 9 месяцев назад

Thanks!

@samuelmeindl5512 9 месяцев назад

You should have it watch tas and then have it try to replicate it as much as possible.

@mynameiswalterhartwellwhit2727 9 месяцев назад

How will an ai do that

@aitango 9 месяцев назад

Imitation AI strategies are around, however I typically don't look into them since the result is basically guaranteed to be worse than the thing its imitating

@samsupernoods6822 9 месяцев назад

U should make a tiktok channel showing parts of your videos so more people will come to watch

@aitango 9 месяцев назад

I do actually have a tiktok, however haven't uploaded onto it in a little while

@samsupernoods6822 9 месяцев назад

@@aitango u should as it bring more views

@yanisjossart4038 9 месяцев назад

I love it but where is the source code?😢

@JustANobody12345 2 месяца назад

fr 🤣🤣 3:24

@mr_clean575 9 месяцев назад

Oxygen Thieves 💀💀💀😂

@aitango 9 месяцев назад

Glad someone liked that line haha

@benferris3415 6 месяцев назад

Ai vs Ai?

@BraxtonMeyer 9 месяцев назад

where is the code for this?

@MKWiiREEDY94 6 месяцев назад

😮

@aitango 6 месяцев назад

😯

@phecdaDia 9 месяцев назад

So when can we expect a paper from this 😄

@aitango 9 месяцев назад

If I were to write a paper, it sadly wouldn't be on Mario Kart as I don't think that's a standard benchmark! The algorithm I used though, I might be writing a paper for quite soon!

@itryen7632 5 месяцев назад

! ?

@bobboberson8297 9 месяцев назад

you're one of the most cracked coders on this platform but unless you can nail the story telling aspect of these videos they won't get the views you deserve

@aitango 9 месяцев назад

Thanks! I'm working on the story telling element, definitely something I want to improve upon

@maxsurman7838 9 месяцев назад

I think it's okay, hard to strike a balance between technical details and the end result.

@bobboberson8297 9 месяцев назад

@@maxsurman7838 it's definitely not easy to do, but the science and engineering channels that do manage it get millions of views

@lisianesouza48 9 месяцев назад

sixth

@aitango 9 месяцев назад

Not bad, but better luck next time haha

@MetalAcrean 9 месяцев назад

Finally I get to be first

@aitango 9 месяцев назад

Congratulations!

@Double-X2-Points 9 месяцев назад

Super cringe, trying your hardest to sound like your voice is low. Super cringe bro. Voice drops off at the end of every sentence.

@Alkaris 9 месяцев назад

I do wonder if the AI can learn certain top level player tricks like using a mushroom at the right time before a Blue Shell impacts, and ensuring it holds on to useful items to defend themselves from random Red Shell attacks instead of using the item instantly as soon as it gets it. Time Trials would certainly be a lot interesting to see though.

@tornadoplasmabot6236 9 месяцев назад

How has this not blown up!? Combining Mario kart wii, and newbie AI... Having played this game since I was little it amazes me to see how bad the AI is, like I was when starting, to it being sorta decent like me, then being like the top players. Simply incredible!

@aitango 9 месяцев назад

Thank you so much, that's really nice to hear! Yeah the AI really do suck in this game... better than the cheating AIs in Mario Kart 64 though

@mynameiswalterhartwellwhit2727 9 месяцев назад

Mkw is too complicated for ai to master and people make tool assisted speedruns (TAS) to play the game basically perfectly so ai is obsolete

@hornetscout2579 9 месяцев назад

You should teach AI to master Marvel vs. Capcom 2

@aitango 9 месяцев назад

There are so many games I want to do AI for! I could spend a lifetime just making AI play games better than I can play them...

@hornetscout2579 9 месяцев назад

@@aitango As you probably know, MvC2 is a 3 on 3 Tag Team fighting game with three different assist types for each character. It would be absolutely hilarious to see a team of the worst characters make it all the way to the final boss of arcade mode. AI-controlled Roll, Servbot, and Dan wiping the floor with Abyss would make me laugh until my sides hurt

@esrohm6460 9 месяцев назад

What we have to ask us is at what point will companies use ai to make game npcs over handcrafting the behaviour. I bet for many games will this creat far more easily a strong npc over doing manualy

@pim1244 7 месяцев назад

can you plz update the ai dolphin dowload it does not work any more

@badnam3189 9 месяцев назад

No suprise Rainbow DQN performs well on Rainbow Road! Is this still SOTA with low compute resources?

@aitango 9 месяцев назад

For this number of frames (around 50M), Rainbow is still pretty much the best out there since there is barely any other research since. Lots of big research labs like google focus on either ultra sample-efficient (100k frames) or virtually unlimited frames (up to like 10B). Sadly there isn't much in between, however I am looking to publish the algorithm I'm using here at some point

@AL_383 9 месяцев назад

o so you use pure rnn-cnn with softmax to predict the actions? expected a ppo or variant

@aitango 9 месяцев назад

This is a value-based method, not a policy gradient method so is quite different from PPO, and is instead much more closely related to DQN. I don't use a softmax, rather the network's outputs are just the AI's value predictions (hence why they don't add up to one). Also whilst I guess rnn is correct (residual neural network), I don't typically hear it called that because rnn usually refers to recurrent neural networks, which are very different!

@AL_383 9 месяцев назад

ah oke thank you for the explanation and you're right I mixed up my abbreviations. @@aitango

@aitango 9 месяцев назад

No worries, always great to hear people being curious about how it works!

@Nythos_ 9 месяцев назад

After the AI completes the race once how long till its consistent at the course, also are you training the model with a single agent or is it multithreaded?

@aitango 9 месяцев назад

For most of my other AIs after completing a single lap, they almost immediately complete the entire track. This AI had some more trouble however, not due to the track but just because with items throughout the course of 3 laps lots of crazy stuff happens, so it just has to learn to deal with that, so it easily took another 20+ hours in this case. In short its multithreaded, but if you want to know how it works be sure to check out my last video, Evolution of my Mario Kart AI

@Nythos_ 9 месяцев назад

@@aitango Intrenesting, ive done a bit with nes games and the models have struggled with consistency. Thanks for the reply, I watched the last video must of forgot you mention it. Really been enjoying the videos, excited for more.