Linesight

9
721 915

We train an Artificial Intelligence with Reinforcement Learning to play the game Trackmania Nations Forever, and post videos showcasing the progressive improvement of our A.I.
This channel is a collaboration between pb4 (github.com/pb4git) and Agade (github.com/Agade09).

You can contact us at the address pb4videos (at) gmail.com, via our github, or on Discord (server: discord.gg/tD4rarRYpj and channel: discord.com/channels/847108820479770686/1150816026028675133).

Комментарии

@74Gee 3 часа назад

That's yet another amazing step forwards from the previous - are there no limits? Really looking forward to your code too!

@linesight-rl 14 минут назад

It's out 🙂github.com/Linesight-RL/linesight

@Eddo_sensei День назад

I heard there is a method that GPT trains another AI. I don't know if it's possible to do in this case but it would be fun to see an AI train AI from scratch

@maxentityita 3 дня назад

26:42 the time is 53 64 i saw though it

@FullMastFlex 3 дня назад

I can't wait to see Linesight learn bugslides! Oh, what a day that will be...

@linesight-rl 3 дня назад

It actually is already doing a small one in the E03 run. At this part of the run: ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-GFOTtl4LbBY.html But the AI will have to be improved for it to find this technique more broadly on more maps.

@supersquat 4 дня назад

I wonder what would happen it the AI had access to the source code of the game. I'd imagine it'll start abusing glitches really fast

@SureshBabu-et8zb 4 дня назад

If any of you were wondering what the song in the hockolicious part was, it was 'Shitsobu Shita' by Josh Lis.

@NoobinGM 4 дня назад

The AI has infinite focus and precision, AI can beat Human but not at the creativity subject.

@lowcorrelation 5 дней назад

I HAVE DATA MINED ALL YOUR MOVES -AI

@peetKa 5 дней назад

how about implementing that the AI can reset the run, so it can learn to recognize the difficult part of maps. Like a human trying again and learning. I mean AI learning to store information about the last run and interpreting it, and not learn by modifying the ai model. Give him a maximum times to reset the run, like 10 Times or else. Otherwise it will kill the overall Computer performance. After the 10 runs, it gets scored and punished or revarded by the best out of ten runs

@ceruleanangel2364 5 дней назад

ai rally car? :]

@ceruleanangel2364 5 дней назад

feel theyre probably op at smooth steer

@LightPink 5 дней назад

What game is this?

@TaohRihze 6 дней назад

What if you split the AI in two parts. A Pathfinder that tries finding different routes and give the general "strategy" of the map, while the driver as at the end uses those paths to try and figure out how much they can be optimized, as well as perform the run.

@oliverdavidpatrick 6 дней назад

Well, mostly it is A and less I. The potential is massive. Sadly I hope this is just a showcase and never ever find its way into TMNF, which is where human skill is required. We alreadyhave much too much AI on youtube with stupid voiceovers. I will soon start a human league that content is AI free

@beso2916 6 дней назад

Is there any way to input past runs from the best human runs? This would theoretically allow the AI to mimic the best run (reducing the initial learning curve) and then optimize it more, just like how humans learn from their own mistakes, we also learn from others mistakes/successes. IDK that much about this stuff but meh, just an idea🤷🏽‍♂️

@onceuponatimeonearth 8 дней назад

Would it not be possible to insert a player driven run into the learning pool of the AI to introduce shortcuts or something like a wallbang? Or would this corrupt the learning pool with the AI trying to go offtrack and wallbang in completely irrelevant places everywhere

@linesight-rl 8 дней назад

It is possible in principle because the algorithm we are using is "off-policy". It can learn by watching someone else play. But we haven't tried. There are practical roadblocks: e.g. currently the algo plays at 20Hz whereas the human we would try to learn from does not follow this format of inputs. And some questions like: how many replays would we need to cause meaningful learning? Would 1 human replay added to the pool be enough?

@onceuponatimeonearth 8 дней назад

@@linesight-rl I see, Interesting insight, thank you!

@TheBestMCScavenger 10 дней назад

why are you giving ai the shadow clone training that naruto got? are you trying to kill us all?

@donutboy1382 10 дней назад

Hold on the a07 best theoretical run is so much faster then nornal way the tas on the map is at 22.53

@linesight-rl 8 дней назад

The no-cut TAS is only 1 tenth faster if I'm not mistaken ? In the context of this video, we're not comparing against shortcut runs.

@divi6254 11 дней назад

Do the same thing, but with beating TAS records pls

@rylanyoung2018 11 дней назад

I really want to learn how to train an AI to play complex games like this

@switzerlandful 11 дней назад

So... is AI going to be a net good?

@KalmanHuman 11 дней назад

Did you try training a model with graphic output from the game? I feel like giving AI access to the game engine grants it unfair advantage over human players, which makes it more reasonable for it to compete with TAS. Still great work though

@Ant3_14 11 дней назад

Made me want to learn training nns for trackmania. Would be cool summer project.

@kevinkostlan7934 12 дней назад

AI vs (non-AI) TAS would be fun to try. How *smart* is the AI if we remove slow human reflexes? Alternatively, you could force the AI to have a lag and time-jitter to simulate human limitations. Can it still be smarter than Wirtual?

@APerson-14 12 дней назад

upgrade your gpu and get a higher resolution and framerate for that AI >:(

@UI-4054 14 дней назад

What A.I.'s time on A01?

@Oroberus 14 дней назад

Hot take: The results the AI produces are not viable as the AI is basically just doing a TAS-Run, therefor the AI won't ever be better then a human player as the AI can't play the game without making it a TAS-Run While a player-TAS-run might be about bruteforcing the minute situation in a constant back and forth until they reach peak result, the AI is driving every single iteration of the minute advancement over the finishing line. Humans kind of cut out the wasted time while TASing due to human constraints while the AI doesn't need to as it doesn't have those constraints but in the end, it is still a TAS-run the AI is doing, just with WAY more time wasted ^^

@jareypoohbears 15 дней назад

I need this to work at 80% capacity to set medal times on my map because I'm actually hot garbage at driving BUT BOB TFN BUILD

@jarno8608 16 дней назад

And than the AI got banned

@bellidrael7457 16 дней назад

'It took players 9 years to realize they could do this trick...' -Ai literally within 24 hours with no human input 'Hey look what I found'

@MichelBin-rs5df 18 дней назад

Why dont you make a program, that lets us all make a giant computer cluster to train the ai?

@Tenderloin-yt7tn 19 дней назад

They said the AI played 20hz I play 10

@NathanaelNewton 20 дней назад

omg ahahah 14:13 I started lauging aloud the same before agent smith omg hahahah

@jobasti 21 день назад

New here! Love the content, keep it coming!

@balboni7363 21 день назад

As someone who creates maps, but isn't a stellar driver, one of the issues I run into is building around incorrect racing lines. I may build turns that will be taken a different way by a better player resulting in far more speed than expected on the exit. This causes the flow to be completely wrong and the maps to break. In these times I often wish I could have access to more strong players to test my map so I could build around these issues. With this AI, theoretically I could have it run on the map instead and learn from how it drives my maps. I could then make improvements and increase my map quality. Amazing work and excited for the future

@basildaoust2821 21 день назад

First I know I complained a bit, but I did like and subscribe so I hope we hear more about how your AI does, I can't drive those races though I did try a few as long as coming in like 10000 place or worse is an achievement :) Later.

@basildaoust2821 21 день назад

OK, very interesting. I have to ask, why are you trailing so much like even in the last race basically right from the start you are already losing and nothing has happened, it seems you must be missing something to always come out at the start behind and then you have to speed up. Also when you know a shortcut exists and that using it on lap two is bad because you will lose time why can't you teach the ai to use it on lap three because now it would be faster, maybe, must be since you ended up behind, I get how wheels on the track accelerate the car and wheels in the air do nothing, but still, in the race you won, you still had to come from behind when you could have been a touch slower but farther down the track and had a bigger lead on the human. I know this is teaching it how to measure best performance, but I bet you could teach it, maybe it needs to try jumps that it will fail then it could rank the jump undoable at speed x so it doesn't try and jump and when it has more speed it considers it and learns it's still slower so it marks the jump at that speed as also a bad choice and on lap three when it's going even faster it tries again and sees the better result. I get that this might be hard in your algorithm or web of choices but you seem smarter than me and I would just like to see the AI really kick human asses even harder, though I do have to dislike the use of the AI your using :)

@basildaoust2821 21 день назад

Wait you're saying an AI can't learn as fast as a human, that seems like just maybe your code is fucked in the head, can't the ai review the track design. Then guestimate the best speed he could drive through the area then figure out how good a bounce would be at let's say 20 points around the curve and just try those 20 spots, and see which one is the best then try 20 spots around the known current best and try again, if none are better is has the best one it can know, I mean how does the human player hit every wall ram/crash drive out at speed, for every corner, obviously an AI should be able to try and test these 24 hours a day and find the route?

@HexDeck 21 день назад

Wonder how hard it would be to train it to get vertical setups by itself and start noseboosting all over the place. PS I think the setup is far harder to train than the noseboosts themselves

@thegamer97HS 22 дня назад

insane video, well done.

@ejkmovies594 23 дня назад

In the video you give rewards to the agent for following the line of the course. How do you initialize the line? Is it just defined by the track itself or do you hardcode a line for every track yourself? If you do hardcode it, how do you hardcode it? Then how do you assign the reward? Is it just based on distance to the line? Because wouldnt that mean that if it moves backward that it still gets a reward for being close to the line. How do you make sure it moves forward with your reward function?

@unbreakablefootage 10 дней назад

it uses existing replay files and extracts the positions the player had from the replay file and creates the virtual checkpoints with it. its not rewarded based on how close it is to the line, it only checks if it still is within a certain reach of the line

@seifyk 24 дня назад

What happens if you add a multiplier for the reward for reaching mini checkpoints out of order?

@mrosskne 24 дня назад

bro's gonna accidentally create AGI while trying to beat records in a video game

@ImNotYo.u 24 дня назад

Shortcuts are found via analysis from the freecam so maybe you need to give the AI some sort of blueprint of the map

@ImNotYo.u 24 дня назад

Train it on Deep Dip 2

@robosergTV 25 дней назад

Good stuff.

@robinx1615 26 дней назад

Do you see, Mr. Anderson?

@gavinmcknight9206 26 дней назад

Investing in your channel from 13k. Big things ahead

@peterhansmann3289 27 дней назад

what if the progress line was also part of the ai network? now that the distance to the line also has an impact on the reward, maybe giving the ai the ability to modify the line will make it possible for even better optimized race lines.

@NorgesGlass 27 дней назад

You sould learn the AI to overcome Deep Dip 2, that would be amazing if that is even imposible 🤩

@thekoalaguy9597 28 дней назад

our wives - rollerating our obsession