Evolution of Neural Networks using Genetic Algorithm for a 3D car made in Unity

Подписаться 4,2 тыс.

Просмотров 686 тыс.

50% 1

Just a simple implementation of neural net for evolution of a car to finish the track. The neural network itself doesn't evolve in shape, but in the neuron connection weights.
Made in Unity
Music: "Righteous" by Silent Partner

Игры

Опубликовано:

19 июн 2017

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 449

@PabloPazosGutierrez 7 лет назад

it would be nice to see a step by step of designing this, it is very interesting!

@DatMilu2K 7 лет назад

Step 1: Forget step by step youtube tutorials. Step 2: Google is your friend. Step 3: Learn instead of c&p.

@Nereus00 7 лет назад

so where to start... :(

@sm4llbit 7 лет назад

To be honest, the code itself is not very complex, is only few hundreds lines of code. I pretty much followed the Cokemons path, there is plenty of amazing tutorials on this matter, I started with channel called "the coding train"

@MM-ts9jy 7 лет назад

Plot twist: RU-vid, in a sense, is Google

@ryskin82 7 лет назад

he means "Sometimes, Video is better than Texts."

@zuezazon9062 7 лет назад

Generation 1623: Déjà Vu! I've just been in this place before!

@ionut-alexandrusbaroi6931 5 лет назад

Literally

@JosephM 7 лет назад

This is actually really realistic. Looks like my first day driving

@_-X-_ 6 лет назад

If you change the music to Euro Beat, The car will start to drift

@crazygoldy1538 6 лет назад

RaZeoniK I was looking for this comment

@_-X-_ 6 лет назад

CRAZY Goldy Hahahahaha

@KaushikBala333 5 лет назад

Best comment

@shivamshivam7028 5 лет назад

Japan

@iinRez 6 лет назад

I find it highly intriguing that it seems to be Right predominant, and I can't help but wonder "If it were to be given virtual hands would it be right handed?" Which sparks a thought - the first part of the track had higher error potential on the left side than it did on the right, so the neural net had to "focus" on the challenge of the left hand side, and the ease of the right; perhaps _this_ contributed to it's right dominance.

@user-tc5df8hf1c 7 лет назад

They grow so fast :(

@Cnupoc 6 лет назад

lol

@eigentlichtoll02 6 лет назад

lol

@shubhammhashelkar6717 6 лет назад

lol

@kam6319 6 лет назад

Lol

@IntrophyGamer 5 лет назад

lol

@MichaelSHartman 6 лет назад

Now, add a police car and a generative adversarial network. Good job. The best I have seen so far.

@MrXana91 7 лет назад

I agree with Pablo Pazos, that would be cool if you could give an overview of what is really happening to the neural network algorithm during a test, and what are the changes for each generation

@sm4llbit 7 лет назад

This is not a NEAT implementation, so the network structure doesn't evolve, just the weights. After each population is done I use generic algorithm to generate a new population, I use ranking based selection combined with stochastic universal sampling to select mating pool. Than new gene will have weights from either parent A or B, I have tried to crossover entire layers but this didn't give me good results. Than each weights has a chance to mutate meaning it will get new random value withing the weights range.

@MrXana91 7 лет назад

oh yeah i could see in the video that the network structure doesn't evolve. Actually i had a question about the hidden nodes : how did you choose the number of those "hidden" nodes ? Does it depend on something or have you just choose a number without any clear reason ? This is very interesting anyway, and the way your NN masters the track at generation 33 is very impressive. Thank you for your explaination.

@sm4llbit 7 лет назад

Well from what I learned the number of hidden nodes is arbitrary chosen, that's why NEAT is much better because skips the limitations of this prerequisite. I also learned somewhere that in general is good to have at least inputs+1.

@zacharybarbanell1064 7 лет назад

+Tomek S would you be willing to post some source code?

@Moonz97 7 лет назад

Tomek S How would crossover work in combining two neural nets? Good work, very inspiring!

@Jabrils 7 лет назад

very nice, how did you go about using a genetic algorithm? based off one of your previous comments I see that you only passed the weights, meaning an offspring got 100% weights from A or B? What about your biases? Were these static or initialized with every new vehicle! Thanks for sharing your research! :D

@sm4llbit 7 лет назад

The offspring is created through a "crossover - nodes", so each neuron will get all incoming weights from (random) one of the parents. In my implementation bias is an additional node in the previous layer and their weights are subjected to crossover and mutation similarly to any other weight. Bias value is constant though.

@Jabrils 7 лет назад

Ah, yes! I hypothesized that your method would be an optimal way to train a Reinforcement Learned Neural Net, although I have yet to train my first one yet. Have you compared the convergence of this method with other genetic methods? Say every parameter is a gene instead of nodes? (Sorry to probe you, I am trying to get as much data from you before I write one xD) ALSO WHAT'S THE FUNCTION TO DRAW LINES IN UNITY? ( via your NN visual in the top right corner )

@sm4llbit 7 лет назад

No, I didn't compare this method to any other, but I might very soon, when my NEAT solution is ready to go.

@ibrahimaba8966 3 года назад

Hi Jabrils, what do you think about an hybrid approach, i mean, a combination of genetic algorithm and other evolutionary strategies such as particle swarm or gravitational search algorithm...? (sorry for my english)

@revimfadli4666 3 года назад

@@sm4llbit sounds like a natural fit for node mutation

@THExRISER 5 лет назад

I like the use of the 3 captors,just like a line-following robot,or in this case border-avoiding.

@JazevoAudiosurf 7 лет назад

was thinking about making GAs practical in simulations. this looks amazing

@RMJ1984 7 лет назад

I don't have the slightest clue as to how they works or programmed. But this fascinates me beyond belief :D Remember watching something with Super mario 3 some years back, about neural net learning to play.

@Liteship 7 лет назад

good work bro!

@georgetroulis 7 лет назад

Very fancy demonstration. I personally started looking into Neural Networks and I was wondering if you have your code open sourced; I'd be interested in attempting a NEAT-based implementation for personal use :)

@FredDamstra 7 лет назад

I did something very similar (without the cool graphics) using TD(lambda) in 1998 with the goal of fastest laps using a reward function of "-1" each second and something really severe (-100?) for crashing. It worked pretty well, though occasionally I'd grow a pathological variation that would minimize the function by crashing as soon as it was able. This is really cool stuff. Thanks for sharing.

@sm4llbit 7 лет назад

I cannot even remember what I was thinking about in 1998 probably about how angry I was that my commodore 64 didn't handle saves.

@sm4llbit 7 лет назад

I might be off by a decade here :)

@viktorvaldemar 6 лет назад

at least

@sm4llbit 6 лет назад

Yup my 98 was all about C and assembler.

@beneze3286 4 года назад

One suggestion for a fitness function that would take into account speed (ie. reward faster runs, punish slower runs) is to have a constant living penalty. Each time interval (eg. second), the fitness score decreases by a certain amount. This means that for faster cars, the fitness increases at a greater rate. However you would either want to do this once the cars can sorta get around the track or have it be non-linear. Otherwise they'd learn too slowly

@whitepen 7 лет назад

If you put your trained car in different tracks, will it drive without collision ? Or does it just know the track it was trained on.

@sm4llbit 7 лет назад

whitepen I will test this and let you know for sure. But in theory it doesn't perceive the track itself it only reacts to sensor input, so my guess is that it should do just fine on any track with exception of ones having sharper curves than the on on learning session. In that case car maybe wouldn't know that it has to de decelerate. I will let you know when I test it.

@rich1051414 7 лет назад

Most of these neural networks do not consider past actions at all, so it is not a matter of memorization, however, it is possible that the decision making improperly optimized due to coincidental order of operations. In other words, certain combinations of turns may expose flawed neural connections. Also, overtraining a neural network in a sterile test environment could cause the network to completely fail to adapt to changes when brought to a less sterile environment.

@The0GamingHero 7 лет назад

Amazing how that works!

@franciscohanna2956 7 лет назад

I was about to ask the same thing as @whitepen. I think that a little overfitting may be produced due to imperceptible patterns in the track that it is being used for training. Would it be better if a set of tracks is used for training, let's say 5 tracks, each with different degrees in the turns or even in the width of the track? Also, is it this code available in GitHub? I'll be glad if it is.

@sm4llbit 7 лет назад

So I tested it adding two new tracks. I paused the best genome which beaten the #1 track and switched the track. The simple track (#2) with soft turns was beaten without problem, and another track(#3) with sharp turns couldn't be beaten. So I continued to run the genetic algorithm, and the car was able to beat the #3 track after 10 more generations. Than I swap again to track #1 and it failed to beat it. Needed another 6 generations to finish it.

@gjsxnobody7534 7 лет назад

This is cool tomek, can you post a video of how you did this?

@zes3813 7 лет назад

wrr

@paulhendrix8599 7 лет назад

Zes wrrrr

@a3yearsoldbaby314 6 лет назад

more wrrr

@selvakumar735 7 лет назад

Great work.

@hao6247 6 лет назад

it is interesting to see such a video.

@Toimi 6 лет назад

That's awesome. I want to learn all about this stuff :o

@user-ke2vz5jb6r 5 лет назад

если провести еще больше лучей входных сигналов то повороты машина будет проходить плавнее. Лучи должны уходить более дальше в дорогу. А так же желательно добавить лучи по бокам.

@NurulMahfuzhPersonal 7 лет назад

interesting.. Nice Good work.

@Uejji 7 лет назад

If you had a way to construct tracks algorithmically, I think it would be fascinating to see an adversarial AI based on this.

@TDGalea 7 лет назад

Very impressive that by gen 13 it was pretty much staying dead center most of the time.

@jamesbra4410 6 лет назад

This can be applied to the Telsa car self driving system. However, you would need a lot of crash dummies to make a finished product.

@TheCrackingSpark 6 лет назад

Actual footage of me driving

@user-sv1vt6yn6e 7 лет назад

wa, i love this project !

@ThezZzRoc 5 лет назад

Great work. I would like to know, how do you represent the network structure on screen. Thank you

@dariocardajoli6831 7 лет назад

hi, Im tring to make something similar to your peoject, what activation finction did you use?

@sm4llbit 7 лет назад

Dario Cardajoli hyperbolic tangent

@dariocardajoli6831 7 лет назад

Tomek S thank you very much for the answer!

@AlqGo 7 лет назад

Is relu suitable for this particular project?

@Xptohz3 6 лет назад

Hi! I am doing a very similar project in UE4. What kind of mutation did you use? Also, did you use recombination? What kind? One point crossover, two point or uniform? Thanks!

@ayoubed-dafali1904 7 лет назад

HI !! Nice Work !! Is the code open source ?

@Jefferson-ly5qe 7 лет назад

DAYUM you fit, gen 14!!!

@breakingthemasks 6 лет назад

very cool video. I'd love to hear an explanation of what's going on in that top right neural network map as it's happening... I couldn't quite figure out what that changing normal map actually meant

@HibeePin 6 лет назад

How long did this take and did you run every trial individually?

@jben21 7 лет назад

Generation: 137, car starts to drift its way through the track

@zebcode 7 лет назад

I did something similar and came across NEAT neural networks. These work in much the same way. The problem you will have though is that your evolution becomes part of your overall algorithm. The evolution may end up causing your networks to be overtrained. They might not be able to generalise. When you bring pedestrians into the mix you might get problems as people are unpredictable.

@Phil000002 6 лет назад

Very good work! I would be interested how you are doing crossover and mutation of your networks :-)

@WrightlyDivided 6 лет назад

wow the new Cars 4 movie looks great

@BeastModeDR614 7 лет назад

I love it!

@revimfadli4666 6 лет назад

Have you tried adding recurrent connections, so that the car has some sense of velocity/memory? Also, why don't give it more or longer distance sensors?

@Dylann8245 7 лет назад

Still not 100% understanding neural networks but I know it' beautiful.

@ibrahimaba8966 3 года назад

I think it will be more accurate to use other form of crossover such as the blended crossover, because a neural network is defined on a continus domain and not a discrete domain, but in your case, it's a discrete domain in the sens that you are using the node as single point of your multi-dimension vector...!

@matty101yttam 5 лет назад

Always wondered if fluid dynamics could be used for more natural AI pathing, if you watch the way people walk around things it's similar to the fastest part of water flowing through rocks. The water in contact with the rock creates like a barrier type thing that makes all proceeding water flow more efficiently and naturally.

@funduino8322 7 лет назад

is the output of the neural network just for steeri g left and right or do you also control speed?

@purusharthdwivedi8914 7 лет назад

Hey, can you tell me, what strategy did you use, when mutating the weights? Also, how many networks were discarded and replaced in each generation. Thanks.

@guilhermepiresramos 7 лет назад

What engine did you use to render this? Unity? Also what language did you use to code?

@nahiyanalamgir7614 5 лет назад

C#?

@elr77 5 лет назад

would be nice to see how this was done conceptually to get a sense of its applications outside of the 3D car

@ZdrytchX 7 лет назад

Still drive better than many drivers out there

@critopadolf5534 5 лет назад

are those sensors boolean sensors? what if you extend them and feed in the 3 distance vectors instead (9 ANN inputs vs 3), that way it can speed up on straightaways and slow down around curves

@HurdalkCini 6 лет назад

we re trying to build a similar project, can you please explain the measure of fitness and fitness function in your implementation. we re planning to make the AI just control the steering, the acceleration will be fixed so the time elapsed seems like a good measure for fitness isnt it?

@sm4llbit 6 лет назад

Hurdalık Cini I had distance + square root of avg velocity. Or something like this. But with constant speed just distance should do for simple project otherwise you can measure dangerous situations i.e. too close to wall etc.

@HurdalkCini 6 лет назад

Tomek S thanks for the reply, we will use distance, one more thing what about the threshold values of neurons are they constant or are they included in the chromosome and evolve with the weights?

@okdick2551 6 лет назад

Is there sensibility to turn or variable speed controlled by the neural network ? If not why did you use a hidden layer ? Just connecting the inputs to the outputs works.

@ewliang 7 лет назад

Nice!

@SelfBiasedCybernaut 7 лет назад

Generation 70 might do some drifting.

@virajkaushalye6139 6 лет назад

superb..

@bright_minary6537 6 лет назад

I think it could perform better if it were Recurrent Neural Network (e.g. LSTM, GRU, or plain RNN) since it knows what it was doing. For example, it could accelerate more if it has been driving straight for a long time.

@zebcode 5 лет назад

Are you using a GA for the architecture or learning algorithm. If it's the former how do you cater for overfitting?

@KiloWhiskay 6 лет назад

Impressive! Do you think you could drop the car into another map and have it perform well? Or is it overfitted for this course?

@bernardvantonder1086 7 лет назад

Nice! Try see if PSO weight changing might perform better in terms of learning speed and at time interval. Could you put this on git? Would be fun to test some new ideas with 🤘

@tzisorey 6 лет назад

You might get better results if you added two more 'whiskers' that are extend 3-4 car lengths ahead, and are even with the left and right of the car. The two sideways whiskers you already have don't really give it much of a chance to do longer-but-smoother turns, and gives the neural network a better chance to 'know' the angle of a curve - which it currently can't, not without becoming a recurrent neural network. I'd be curious to see how it'd act if you added virtual barriers that it can see, but pass through - like lane markers. See if it picks up bad habits, like cutting corners, or drifting into other lanes.

@rationalpi6755 7 лет назад

Great video!!!! Is that how self driving cars work???

@neoblackcyptron 2 года назад

Really nice. Like you I also started with coding train. I’m probably going to create basic bit based chromosome and train against a fitting function. I’m impressed how to use neural networks in the place of chromosomes. I guess you already made them fully connected and used this genetic algorithm for the weights only training only.

@albertogonzalezplacencia8122 5 лет назад

generation 7489: the cars made their own programming languaje and develop programs about using genetic algorithms to control cars

@laurel8831 7 лет назад

Is there any chance that you can make a tutorial series to genetic algorithms in Unity in general? I would really appreciate it

@vedransustic9137 6 лет назад

Nice one. What parameters did you use in wheel colliders? My network is working fine but car control is just baaaddd... :P

@isaacvongurtberg7341 6 лет назад

Are all the output values only calculated according to the three vision input? It seems to me that there is no internal memory in the computation that would take record of the past. I mean, the neural network will not being able to take account of the curvature of the track, because he only knows how much he is to the left and to the right. Am I correct?

@ZhangHanQuan 7 лет назад

it's as if it doesn't remember what it just did last time. just an over all better performance by the end.

@ZhangHanQuan 7 лет назад

show me an ai learning a course that varies every time it tries. perhaps they'll be better at adapting into situation than we do.

@psiphiorg 7 лет назад

Watching the red and green lines, it appears to me that it learned that if it is able to turn right, it should veer right, or if it can go left, it should veer left, and it should only go straight if that's the only option available. Does that seem accurate?

@Reneator 7 лет назад

Is your fitness just the distance the car goes, or also using the time? If you would use the time, and make the max-speed unlimited, it would be possible to get the ai to learn to go even faster and getting more efficient, right?

@vasugupta9824 6 лет назад

Truly Awesome. Can you please tell me how to integrate AI into Unity and what is your GPU configuration?? Your help will be appreciated because I also want to work on a similar project.

@AhmedKachkach 7 лет назад

Any reason to use genetic programming instead of gradient descent for this? Gradient descent is more efficient as it will change weights in the "right direction" instead of doing it randomly.

@sm4llbit 7 лет назад

GD is boring ;)

@MrSefsdf 7 лет назад

Tomek S isn't it also because GD requires the knowledge of the right output for every training sample? As far as I understand the correct output is unknown in this scenario

@MichaelSHartman 6 лет назад

Brain child.

@muneebaadil1898 6 лет назад

GD requires just the (differentiable) objective function; nothing else. It's only such that in supervised learning settings, we need ground-truths to evaluate our objective function. Otherwise, GD doesn't put any restrictions on availability of ground-truths. If you are faced with a problem setting where you can compute utility/objective function even without ground-truths (such as this one), GD will perform just as fine, I think.

@zeyusu4645 6 лет назад

why not using reinforcement learning? I feel in this scenario it's quite like an atari game.

@hexazombi3533 5 лет назад

So I really want to make a project like this but I do not know where to start. Is there some video where I could learn all about this? Thanks!

@quentinotjacques4327 6 лет назад

Hello, thank you very much for your video, it's indeed really interesting. I've got a question (because I'm new at this and I can't seem to grasp the reason why to my question): let's take an example. At 1:28 you've got the generation 8 genome 7. It gets around the first turn okay, then it crashes around the second turn. Now we go to the very next: generation 8 genome 8. The code has updated itself (in weight of your neurons) between the 7th and the 8th genome. Am I correct until there? And it's his first breakthrough, so the correction of the weight should be minimal, because he is on the right track and he has covered the maximum length ever! So why on earth does he crash on the first corner right afterwards? Why can't he from one genome to the next get to at least the point where it crashed before? Is it that the correction is "too strong" and it messes up something somewhere before so that it can't take the first corner anymore? If so, how could one make the correction "less radical" to the code so that it "learns faster"? I hope I made some sense =) Thank you for the answers!

@suckinDiesel44 7 лет назад

What would happen if you took the end result and introduced it to an entirely new course? Would it have to learn all over again?

@sm4llbit 7 лет назад

I answered another question like this. The car can drive easier course but needs additional training on harder one. Easier or harder course is defined by sharpness of curves.

@suckinDiesel44 7 лет назад

Apologies, should've read other comments. That's really interesting, great work!

@sm4llbit 7 лет назад

No worries Sir

@dustinbrueggemann1875 7 лет назад

Try running parallel simulations. Run multiple variants of the car simultaneously and wait for all to have failed or completed. The top contender of each track is tried on all other tracks as well as being used to bias the adaptations placed upon the other experimental algorithms. It should result in a program that can more reliably respond to changes in the environment rather than creating one that only works effectively on a set environment. Additionally, use a second factor like time in addition to distance or apply time constraints once the distance target is achieved. What I believe would work effectively is to track distance and time/distance ratio. Once an iteration reaches the distance target, begin tracking speed.

@aperezNWO 3 года назад

Thank You

@Risslight 7 лет назад

Is this supervised training? so the input would be the 3 sensors hit/miss? And what the outputs are? Thanks!

@sm4llbit 7 лет назад

inputs are the distances of the sensors (not just hit/miss) output is steering and acceleration

@SlykeThePhoxenix 7 лет назад

How did you do the training for this? Backwards propagation? If so, what did you feed it after each crash?

@MrSefsdf 7 лет назад

SlykeThePhoxenix genetic algorithm

@DBHMedia 7 лет назад

Seems like the sensors are too short, not that it is necessarily a genetic imperfection for the sharp turning.

@M1ckeyD 7 лет назад

Maybe if you took the same one and reversed the track it would learn to break on right turns

@maxim25o2 5 лет назад

Widzę że użyłeś random funkcji do generowania przypadkowych wewnętrznych nodów. Ja myślę nad swoimi jak zaimplementować nody w miejscach gdzie są najbardziej potrzebne i jak dowiedzieć się w którym miejscu nowy nod powinien być dodany. W ten sposób IA powinien uczyć się znacznie szybciej, niż używać przypadkowych miejsc i sprawdzać czy dodany nod poprawił funkcjonalność całej sieci neuronowej, czy odwrotnie, i kiedy rozerwać połączenia między nodami, jeśli wynik na następnym nodzie przechodzącym przez wagę ma małą wartość.

@dr.joint4204 5 лет назад

that moment you write code, but don't want to code so you code a thing hat code itself... genius

@vizerbas 5 лет назад

what are the inputs are the outputs? and what is the conditions of the first nodes?

@DerRumbringer 7 лет назад

Is the simulation evironment available ? i would like to test my algorithms on it.

@javitzso7721 7 лет назад

I can see the future going backwards from here. Remove human brains from driving, replace with perfected driving matrix, create a robot neural network and slowly introduce brains back into driving. It will be kind of like the Server / Client of the 70's moving to the personal computer of the 80's & 90's now moving back to the Server (Cloud) / Client (Device) model.

@vaasubansal2523 6 лет назад

Tomek S could you post the link from where you learnt to set this up? Textbook name etc. would also help. Thanks!

@saratomassini5284 6 лет назад

Hello, can you share this projet on github?

@quangtung2912 7 лет назад

How many weight did you use? What is the output vector? And how did you calculate the fitness value, just by distance or something else? Sorry, I'm so curious

@MrSefsdf 7 лет назад

Quang Tùng as for the weights, there's the network topology on the top-right corner of the video

@nathanbittner8307 7 лет назад

Cool video! Any reason you chose to optimize your network with genetic algorithms rather than gradient-focused algorithms?

@MrSefsdf 7 лет назад

Nathan Bittner Not an expert here but wouldn't you need to know the correct output for gradient descend?

@RitchieWilson 7 лет назад

You state you have used tutorials to produce this within Unity. Any chance of pointing to them here or in the description? I would like to extend this to my own application: * have the sensors be float distance values (same layout) * change car dynamics to fit my physical chassis * once track is completed procedurally generate a new one

@sm4llbit 7 лет назад

Ritchie Wilson for evolutionary algorithms i watched coding train channel. neural networks just theory from some youtube videos. rest came from some random articles and books about the topic.The implementation is straightforward the problems come after that because there is dozens of choices to be made when designing experiment like this. i.e. what should be the input, how do I crossover do i do fitness or ranked based selection etc. This field of science is very wide this example merely covers a fraction of it.