AI Plays Trackmania - Bloopers

Подписаться 27 тыс.

Просмотров 3,6 тыс.

50% 1

In this video, and AI is trained with reinforcement learning to accumulate speed and finish a map as fast as possible.
The AI learned a behavior where it turns around right before the finish line. This is not a one-off mistake, the AI repeatedly did similar things in back-to-back runs. Can you guess why ?

Опубликовано:

28 сен 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 14

@krembananowy Год назад

Because it was rewarded for speed and not for finishing the track?

@linesight-rl Год назад

It was rewarded for both, but not enough for finishing. You are right 🙂

@Dream2503XD Год назад

thats so similar when i play the game myself

@pajcsilol Год назад

Great work tho! I saw the best run and the Wirtual video too. I would like to ask some questions. The AI's job is probably finishing the track as fast as it can, but can it actually "see" turns and the track, or you have to train it again and again if you put it on a new map? I am curious if you can train an AI which can finish any non complicated map at a decent time first time or no.

@cyb3r._. Год назад

it can actually "see" turns and the track, as it "sees"/uses 20 fps black and white "screenshots" (idk if they actually go and take screenshots or just have it play on 20 fps colorless or something)

@Dizastermaster. Год назад

It depends. Usually it works best with a top down camera cause 3d shit makes it a lot harder for AI to know what they're looking at. But some do and some don't. Depends on the model

@ArKantiK Год назад

Do you think this AI could play ZRT Trackmania cup circuits ? i'm wondering if it would be possible, but don't really know how to use it and have the time to try on my side.

@linesight-rl Год назад

Are those circuits available on Trackmania Nations Forever ?

@ArKantiK Год назад

@@linesight-rl I don't think so... but does the AI could work on TM² Stadium ?

@downpipe9935 2 месяца назад

@@ArKantiK well well well

@masonlyons2959 Год назад

What's the best way to contribute changes to the code? I've got a fork with some changes so far.

@linesight-rl Год назад

Thank you for your interest. This is a rapidly evolving 2-person project. At least in the short term, we prefer to keep it small. I'd be happy to have a look at changes you make, but will likely not merge your branch in our private repo. We will probably have a more open approach in the future and welcome contributions. How can we inform you when we decide to open our private repo and welcome contributions?

@masonlyons2959 Год назад

@@linesight-rl I've got notifications turned on for the channel so if a video is uploaded I'll see it. I'd suggest putting any open source notification there. As far as changes go I've done the following: Moved to logging instead of print Optimised variable access Made resuming last training easier Moved to automatic timestamp based filenames for saved data My to do is currently: Look into using mss instead of dxcam to see if it's quicker, from profiling it seems to screenshot creation is the longest part of the script. Introduce the surface and wall contact into the reward function to stop it trending towards ramming into the walls. Upon going onto the grass outside of the track ending the run and heavily punishing reware