In this video, and AI is trained with reinforcement learning to accumulate speed and finish a map as fast as possible.
The AI learned a behavior where it turns around right before the finish line. This is not a one-off mistake, the AI repeatedly did similar things in back-to-back runs. Can you guess why ?
28 сен 2024