MENACE: the pile of matchboxes which can learn

Подписаться 1,2 млн

Просмотров 435 тыс.

50% 1

See more data and check out what we changed on the second day (which caused MENACE to learn a different strategy) in the second video: • Day 2: Revenge of MENACE
Check out Matt Scroggs’s original blog post about MENACE and in the amazing Chalkdust magazine.
www.mscroggs.co...
chalkdustmagazi...
Play against the online version of MENACE:
www.mscroggs.co...
This is the original 1961 “Experiments on the mechanization of game-learning” by Donald Michie.
www.dropbox.co...
Thanks to Katie Steckles for organising our stall at the Manchester Science Festival and Antonio Benitez for giving us the space.
The MENACE crew were:
Alison Clarke, Andrew Taylor, Ash Frankland, David Williams, Katie Steckles, Matthew Scroggs, Paul Taylor, Sam Headleand and Zoe Griffiths
Get your MENACE data here!
www.dropbox.co...
CORRECTIONS:
None yet. Let me know if you spot anything!
Thanks to my Patreon supporters who made this possible! Here are the random subset I read out during the video:
Ben White
Scott Robinson
Nelson Emerson
Amy Sandland
Neil McGovern
Support my channel and make more videos like this possible!
/ standupmaths
Music by Howard Carter
Filming and editing by Trunkman Productions
Audio mastering by Peter Doggart
Design by Simon Wright
MATT PARKER: Stand-up Mathematician
Website: standupmaths.com/
Maths book: makeanddo4D.com/
Nerdy maths toys: mathsgear.co.uk/

Опубликовано:

18 сен 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии : 1 тыс.

@HagenvonEitzen 6 лет назад

9:20 That suggests to build Menace A and Menace B - and have them both learn by only playing against each other

@Eurley66 6 лет назад

Would actually work, adversarial machine learning is quite interesting.

@Anvilshock 6 лет назад

And don't forget to let Robert Miles know!

@aidangarner1181 6 лет назад

This is how we end up with the matrix.

@crashdemons 6 лет назад

This model of Menace just builds a [physical] FSM (a Finite-State-Machine knowing all game states) and slowly prunes edges that lead to known failure states. In particular, this works on games that are trivial (we can iterate all the states and their moves), and it can be done by just tracing edges from each failure state back and removing that edge - something done faster without humans or matchboxes. Also, it relies on human knowledge to solve the problem since so much is already represented by these connections (box-bead-box) - so it's really questionable if you can call this machine learning versus just filtering a state-machine. [For example: if we have a phonebook of all numbers in the world and if we randomly call a number and remove it if it's disconnected, we will eventually get a phonebook of all connected numbers - does the phonebook learn?] In larger real problems you need to both be able to explore the problem space, identify undesirable states and optimize at the same time, not just prune from all possible moves.

@KuraIthys 6 лет назад

You make a valid point. And that's the challenge isn't it? The only guaranteed optimal solution is to examine the entire possibility space of a problem and find the optimal point (or points, if there are solutions of equal weight) in that space. Fine with small problem spaces, but impractical with larger ones, thus we need a way of getting a good (but not necessarily ideal) solution with less effort...

@BazzFreeman 6 лет назад

So, when I lose a game I can honestly say "I am dumber than a box of matches"

@AlexKing-tg9hl 5 лет назад

pile of matchboxes

@garychap8384 4 года назад

No, but clearly something _could_ be said about the arrangement of your _"marbles"_ ; )

@Septimus_ii 3 года назад

Yes, but the pile of matchboxes has practiced more than you

@266art 6 месяцев назад

Not necessarily @@Septimus_ii

@CractusJohn 6 лет назад

"Can a Match Box?" "No, but it can learn."

@andymcl92 6 лет назад

The secret alternative answer to the impossible quiz...

@JL-zw7hi 6 лет назад

John Joubran No but a tin can

@EtzEchad 3 года назад

I remember that Martin Gardner article (I believe he published it in Scientific American) and I built this and played it as a teenager in the 60s. This was one of the first steps I took toward becoming a Computer Scientist. That was fun!

@ADHD_Gamer Год назад

reading that book I do not remember that many boxes. I believe he removed the mirror layouts. not sure. but yet, got me into A.I. LOL

@cosmicjenny4508 6 лет назад

"This must be what procreating feels like." UM. Okay, Matt...

@bpery1614 6 лет назад

He's a mathematician, he wouldn't know otherwise

@mdfogarty 6 лет назад

Quote is at 8:32, had the same reaction as you.

@CapaTwoZero 6 лет назад

A real Parker analogy.

@PhilBagels 6 лет назад

Trust me on this: It feels different.

@patrese993 6 лет назад

Who is looking for backdoors in the AI then.......?

@MisterAppleEsq 6 лет назад

Matt 'chbox' Parker

@cosmicjenny4508 6 лет назад

+Mister Apple Damn you!

@onecommunistboi 6 лет назад

Classic Parker box.

@StuziCamis 6 лет назад

A Parker pun 👍

@dijek5511 6 лет назад

@Richard_is_cool 6 лет назад

He is a Parker matchbox, basically.

@Quintkat 6 лет назад

This is secretly one of the best and simplest videos explaining machine learning

@kxuydhj 2 года назад

"I never thought i'd have a sense of pride over a sentient pile of matchboxes, but here we are." This line was great enough by itself, but he really perfected it by saying "This must be what procreating feels like".

@amyshaw893 6 лет назад

well, i know what im coding tonight

@trickytreyperfected1482 6 лет назад

Nillie The whole point to getting good at coding is to first code what has already been coded. That way, you can then know lots of new stuff to use in your own projects.

@nix207 6 лет назад

You know what, I'm gonna try this too now.

@ragnkja 6 лет назад

That makes sense, Trey Atkins and Elf Friend. Thanks for taking the time to make me a bit less ignorant.

@amyshaw893 6 лет назад

i was just kinda bored and wanted to code something...

@Periiapsis 6 лет назад

Elf Friend coding algebra ( ͡° ͜ʖ ͡°)

@kayleighlehrman9566 6 лет назад

MENACE, for when the machine goes first, and DENNIS, for when the human goes first

@IceMetalPunk 6 лет назад

DENACE: Dueling, Educable Naughts-And-Crosses Engine. When they're pitted against each other in true adversarial learning fashion, they're still DENACE the MENACE :)

@yoyoyonono 4 года назад

Dennis liao

@linamishima 6 лет назад

Menace doesn't die, it just learns that the only way to win is not to play :D

@keithkrummel9344 6 лет назад

ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-6DGNZnfKYnU.html

@smaug131 3 года назад

Or Menace loses all hope, poor thing

@MarcelPogorzelski 6 лет назад

Corner is by the way the best opening move against humans because it's an unusual move. It's still a drawn game if played right, but people who aren't familiar has a greater chance of doing the wrong move.

@JavierSalcedoC 6 лет назад

Not because it's an unusual position but because is mathematically the best starting position

@aarondavis5386 6 лет назад

Like the person before me said: corner is the best position to open with once you know the moves if you start in the corner if the you will win 100% of the time if your opponent goes anywhere but center, if that happens take the opposite corner and you still win 100% of the time your opponent doesn't take a side space, and only in that situation are you forced to draw.

@FinetalPies 6 лет назад

Sorry but center is the best move. What's the counter to your opponent going corner first? Go center. As long as you know that the center is the most important position, its very hard to lose.

@asherael 6 лет назад

the game can reliably be won or tied starting in the corner, Menace gets to go first, it needs to take the corner.

@pedroteran5885 3 года назад

Marcel is simply right. You will get a win (at least once) against most humans by giving them a chance to use their usual centerplay strategy in cornerplay. But you will get only draw after draw after draw if you play center.

@thejunkman 6 лет назад

Obligatory quote "The only winning move is not to play"

@vpheonix 6 лет назад

"War Games" - a great movie.

@Graknorke 6 лет назад

How about a nice game of chess?

@rcb3921 6 лет назад

No. Let's play Global Thermonuclear War.

@jwgmail 5 лет назад

Hello Joshua

@Ritefita 5 лет назад

I've seen that AI's decision in some AI youtube

@gloweye 4 года назад

I'd say, start with like 4 of each color in each box, so it's harder to kill off routes early in development. It should learn a bit slower, therefore keeping it more fun at the convention, and it should end up knowing *all* Paths to Victory.

@samrichardson8388 6 лет назад

As a dad, I can tell you that procreation carries a wide range of emotions, with pride being a small part. Fear and frustration are much more common.

@andymcl92 6 лет назад

You don't think searching for the right box so you can add or remove some beads all day would be frustrating?

@samrichardson8388 6 лет назад

andymcl92 I can't speak to that. He said it was like procreation, and it may be. I only know the procreation part

@PatPatych 6 лет назад

As your mum, I disapprove this comment.

@npc6817 5 лет назад

You made a whole child? How many known universes could fit inside the sphere of radius in centimeters equal to the number of boxes that it took?

@jimnelsen2064 3 года назад

when two matchboxes love each other very much.......

@KarnKaul 6 лет назад

8:33 @Matt, that's kinda what programming feels like too! The satisfaction of your watching your theory autonomously running, and correctly... Bliss!

@DataCab1e 6 лет назад

No, no, no... Use Tic Tac boxes containing differently-colored toes!

@johncameron1935 6 лет назад

DataCab1e that took me a second.

@lucianodebenedictis6014 6 лет назад

Abandoned for the lack of toe donations

@jeremybuchanan4759 6 лет назад

Really puts the 'cure' in pedicure!

@alexwang982 5 лет назад

@npc6817 5 лет назад

@@lucianodebenedictis6014 if the machine can't survive a lack of toes then could we say it is... lack-toes intolerant?

@Zephyrio 6 лет назад

I remember in elementary school, thinking myself pretty good at the tic-tac-toe. But then a friend beat me with a corner starting move. I was quite amazed and have played with a corner starting move ever since. I'm surprised at the disparity between greens and blues in the starting box. Corner move is pretty awesome...

@mage3690 Год назад

Corner move is a very specific way to win that requires you to pick a specific corner relative to your starting corner in the second round. This is one of those "local minima" problems that crops up an awful lot in machine learning, and it's why you need very specific reward structures to teach the machine right. In this case, it doesn't make the reward structure particularly more difficult: you just need to punish it for picking center. But the problem expands exponentially, just like any problem involving decision trees not reduced by real intelligence.

@thecakeredux 5 лет назад

This is absolutely amazing. I love the cross-over of high and low tech and this is the perfect synergy.

@achu11th 6 лет назад

Parker sentient beings.

@crobes4155 6 лет назад

The human race is going to be destroyed by matchboxes!

@achu11th 6 лет назад

TheTopazRobot they are just Parker sentient. They can learn how to draw with the human race only.

@EPMTUNES 6 лет назад

hes such a MES

@achu11th 6 лет назад

EPMTUNES wrong channel, but nice to meet you. Here I prefer Parker Square jokes as you may have guessed already. So I would be considered a Parker MES.

@EPMTUNES 6 лет назад

achu11th good idea. I’m going to start to make Parker square references on mes’ vids

@kwinvdv 6 лет назад

You could also teach matchboxes to play Dr. Nim.

@Laceykat66 5 лет назад

Back in the 1960s Reader's Digest had a "Book of Adventures" that had stories, puzzles, games and activities, all in hard bound. One of the activities was building a "computer" that would play "Hex-a-pawn." This was a game that used the nine square board (3x3) and three pawns on each side. The paws moved as traditionally and the object was to get your color in your opponent's home row. Like this experiment, you had matchboxes with the various board configurations on them and inside were colored beads to indicate the move. I came across this book in the 1970s (computers were becoming more of a reality by then) and spent a snowed-in weekend building the "machine" and playing the game. It was a lot of fun and taught me how programmes worked (basic anyway) and how a computer CAN make a mistake.

@SchutzmarkeGMBH 6 лет назад

I love that it can die out. The way to win is not to play at all.

@keithkrummel9344 6 лет назад

ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-6DGNZnfKYnU.html

@pannegoleyn9734 2 года назад

I love this! When I was 9 or 10, I got a copy of Martin Gardner's "Mathematical Carnival", which contains his piece about matchbox computers, and I was absolutely fascinated by it, though I never tried to build one. Forty-something years on, it still sticks in my memory -- I know exactly where I was (in a dinner queue at school) when I read it! It's great to see it in action. (Actually, I've been mourning for that book, unable to find it for years, and it's been out of print. Happily, a couple of years ago, an ex-colleague from my first job met my ex-partner, and returned it -- apparently I lent it to him sometime in the early 90s -- and I've very happily re-read it quite recently 🙂 )

@utl94 6 лет назад

8:32 "This must be what procreating feel like." Lol.

@matterwiz1689 6 лет назад

Classic mathematitian

@KoneSkirata 6 лет назад

Pretty sure he said "proof-creating" :'D

@npc6817 5 лет назад

when you've gone too far down the nerd hole you start referring to your machine learning algorithms as "your babies"

@NoIce33 5 лет назад

Ages ago I found a description of a similar learning pile of matchboxes from an old Soviet-time puzzle book. That game was different (a breakthrough of pawns on a 3x3 cheassboard), but it inspired me to make a tic-tac-toe version. I took rotations and reflections into account and didn't need that many boxes (only about 20, don't remember how many exactly); I also used a simpler algorithm where nothing was added, only in case of loss the last move indicator was removed (and if this emptied a box then the used move indicator from the previous box et c.). The simpler algorithm was, of course, worse, because it didn't distinguish between wins and draws (this feature was carried over from the original pawn game where draws were not possible), so in the end my fully trained machine mindlessly cruised into draw even in winning position. I think I grew bored before coming up with the idea of rewarding wins by adding indicators. A slight problem with this algorithm seems to be that it quickly becomes a fan of lines that have brought success. I don't think that corner opening is any worse than centre opening; one might say it is better (because it only leaves the opponent one non-losing move, while the centre opening leaves four in a way), but MENACE apparantly happened to score its first win or two with centre opening and this filled the opening matchbox with green beads, after which it, of course, started to open with centre move and kept scoring its wins with that, and so it snowballed.

@mage3690 Год назад

The path to victory in corner move first is much more narrow than center move as well, though. The first move reduces the second move to one possibility as well, so both you and your opponent are stuck with one winning move on corner move. It's actually a fantastic example of a local minima, and it's why ML models need good reward systems to achieve the right outcome.

@chinareds54 6 лет назад

How many matchboxes would be needed to learn Global Thermonuclear War?

@Kaiwala 6 лет назад

And how much would it cost to buy enough for the nuclear winter DLC by EA?

@jaewok5G 6 лет назад

settle down, joshua

@ajreukgjdi94 6 лет назад

Inspired by this and a previous video, in a fit on boredom, i programmed a bot to play Nim and let it go second 300 times against a perfect opponent, and the only reason it wasn't infallable is because i wouldn't let the probability of any move drop to 0. But with only 11 possible board states, it made for a very easy introduction into learning programs vs. trying to teach it 300-some board states and how to recognize reflections and rotations.

@entropyzero5588 6 лет назад

I might have missed this in the video, but I think an important thing to mention is that the initial state of the boxes _isn't_ one bead of every possible colour, but instead 8 each in the first box, 4 for the second moves, 2 for the third and one each in the rest (something which isn't even covered in the blogpost in the description…). The way Matt explained the setup would have a high likelihood very quickly dying out…

@pineapplegodguy Год назад

Yeah... nice catch

@MNalias 6 лет назад

I feel like this is the machine that RU-vid uses for there adbot.

@oledakaajel 6 лет назад

Nah. This is too advanced.

@anjopag31 6 лет назад

Probably does use something similar. Inputs are what you like, a few hidden layers perform calculations, and then the output is the type of ad. Your feedback rewards or punishes the network.

@MrGeocidal 5 лет назад

Machine learning only works when it makes mistakes. Google is unaware of that fact.

@benadians1769 4 года назад

@@MrGeocidal when was the last time you rated an ad?

@souravzzz 6 лет назад

The one dislike is from the person who lost to MENACE.

@thesuomi8550 6 лет назад

U Wot M8 now there are 7 of them

@thesuomi8550 6 лет назад

MENACE is getting better

@matthewwriter9539 5 лет назад

As of Dec 26, 2018 it is 92 dislikes.

@minecraftermad 5 лет назад

@@matthewwriter9539 ppl suck at tic tac toe lmao

@tranl1050 6 лет назад

ONE OF My MOST FAVOURITE VIDEOS ON RU-vid

@badlydrawnturtle8484 6 лет назад

“I am now joined by the guy who's fault it is!” This is the reason I follow you. Well, that and computers made of matchboxes.

@Morturious 6 лет назад

If this really proves anything, it is that anything can "learn" how to do anything as long as it gets some kind of feedback from its environment. This, more than anything, is simple, concrete proof that intelligence and understanding of abstract things can arise from simple physical items and processes. ... We all are just a pile of matchboxes.

@Alex2Buzz 6 лет назад

"It's learned to resign on the first move." So basically, all it's learned in that case is that it's bad at noughts and crosses.

@VFella 4 года назад

I made one a lot of time ago. This is amazing as it demonstrates the very basics of what we call "Artificial Intelligence" or Machine Learning.

@Izandaia 6 лет назад

Now that Matt Scroggs "has" this contraption... He can't be blocked except by two or more creatures.

@untitled6087 6 лет назад

What is this, some kind of _magic?_ What would a _gathering_ of matchboxes do to help him with that?

@davidjackson2114 6 лет назад

Great fun, I did this at school at the end of the 1970's also inspired by the brilliant Martin Gardner :)

@spinnwebe_ 6 лет назад

Oh my god I was at the museum last week! I practically could’ve run into you!

@AbiGail-ok7fc 6 лет назад

Over 40 years ago, following instructions from a popular science magazine, I build a similar machine out of matchboxes. It was for a different game with less states than tic-tac-toe (so I didn't require that many boxes). But the learning strategy was different: wins were never rewarded. For a loss, you'd remove the bead of the last move where the machine still had a choice left (more than one bead in the box). This, IMO, is a superior strategy for several reasons: 1) You don't need an large supply of beads, and ever expanding boxes. 2) The machine will "die" if and only if the start position is a losing position. (And not "about 10%" as it is for Menace). 3) The opponents cannot cheat. With the learning strategy of Menace, you can manipulate it in making a bad first move by first, on purpose, losing a bunch of games. Once it has a fondness of a bad first move, you can exploit that. And since the rewards for "wins" (3 more beads for every move) are much greater than for losses (lose a random bead), for such a machine, it's much harder to unlearn bad moves. (It doesn't apply that much for tic-tac-toe where the machine goes first, as there's no bad move -- but you can exploit that strategy if you'd use Menace to learn second-player tic-tac-toe).

@mage3690 Год назад

The perfect first move response is still center on a "bad" first move, though. Heck, it's the _only_ winning response on first move to corner. The only thing that changes is the second move response, which only depends on the second move. You _could_ teach it a bad second move response, but only if you didn't allow rotation and reflection. Since this machine depends wholly on unique game states that don't affect other possibility trees, that's a non-issue.

@charlotte1924 6 лет назад

How many match boxes would it need to learn how to play Mario?

@AtlasReburdened 6 лет назад

At most, the number of pixels to the power of the number of colors to the power of the number of degrees of freedom the player has to the power of the number of possible in game coodinates, or roundabouts.

@theleftuprightatsoldierfield 6 лет назад

Atlas WalkedAway in other words, a big-ass number

@OriginalPiMan 6 лет назад

Functionally (but not literally) infinite.

@LexanPanda 6 лет назад

One step at a time. We need to get command blocks playing MarI/O first.

@binaryteddybear8741 6 лет назад

Atlas WalkedAway well, there is only one speed in Mario, right? You could divide it up in to steps, that would make it almost feasible

@Grizzly01 3 года назад

0:25 Yay! Katie Steckles from the Puzzle Hunters on Only Connect!

@RaiCar1005 6 лет назад

Don’t beat yourself up about it. Tic Tac boxes are transparent

@Zero-ks3pc 6 лет назад

Rai Car but they can be shaken up and the piece delivered without human influence, not to mention a bit of tape could cover the clear bits. The bigger issue would be size limitation as it would fill up quickly and as it approaches its limit the ability for the pieces to move freely and any piece be equally possible begins to drop to almost zero.

@tapashalister2250 5 лет назад

* the best move if you are going first is corners (in which you can actually win most times playing optimally), and if you are going second it is the edge (in which you will draw versing an optimal player)

@trobin 6 лет назад

Thanks for the vid

@standupmaths 6 лет назад

+Starrgate Thanks for watching!

@haxxx0rz 5 лет назад

1:16 "... the box that matches". I see a connection.

@partynchill6455 5 лет назад

From now on Ill be counting things in "metric universes" xD

@fredg8328 6 лет назад

I saw this a long time ago in a french science magazine. Thank you very much to bring back this memory. I always thought it was from Von Neumann

@standupmaths 6 лет назад

+Fred G Glad I could remind you! Michie was the same era as Von Neumann but was over in Bletchley Park during WWII.

@badelementofstyle5238 5 лет назад

It seems like a little part of you died when you called it "Tic Tac Toe"

@DanielPowell9992 6 лет назад

Now I want to get back into my attempt at programming a neural network into a MUD engine... I mean, thinking in terms of my favorite (PennMUSH), I have rooms (containers that players and items can occupy) and exits which connect rooms. All object types have programmable attributes....and those could be weighted values. Such as "likelihood that this exit is used by a wandering object when it picks one at random" (a rat, maybe). But tic-tac-toe would be a much easier to start with...and the immersive quality of a MUD could make for some fun roleplay effects. A rat maze, however, would be way more inline with the dungeon crawler intention of a MUD.

@danielleanderson6371 6 лет назад

The irony of it honing in on center moves is that if you know what you're doing, corners are much better, but I doubt this thing is capable of thinking ahead, since a poorly-played corner game is much more likely to lose than any center game.

@madhuragrawal5685 6 лет назад

That's not actually true, is it? Is there some way for us to play multiplayer tic tac Toe online so we can talk about this?

@Ecl1psed276 6 лет назад

Actually I believe Danielle was right. If you start in a corner, your opponent had better play in the center, otherwise you can always beat them if you play correctly. Lots of people don't know that, so you can often beat people by going in the corner. If you start in a center, your opponent had better play in the corner. This is more widely known by most people, so you are more likely to end up with a draw in this case. And finally, just don't start on the side. It is possible to win in this case, but you probably won't unless your opponent makes a dumb move or something like that.

@computerfis 6 лет назад

puzzling.stackexchange.com/questions/30/what-is-the-optimal-first-move-in-tic-tac-toe

@AshleyFrankland 6 лет назад

MENACE works on a natural selection type system, so given enough opportunity it would eventually be able to avoid losing to a strong corner game.

@ljfaag 6 лет назад

But it has the first move, so it probably won't find that strategy

@Jaburesu 6 лет назад

It's interesting that MENACE learned risk aversion and tended towards the safer "draw" as opposed to the riskier attempt to win outright.

@rohitraghunathan 6 лет назад

8:32 "This must be what procreating feels like" Oh Matt! I pity your better half.

@thejunkman 6 лет назад

If math nerds don't have sex, how do we get more math nerds?

@Richard_is_cool 6 лет назад

We get MENACE.

@Kaixo 6 лет назад

Wow, this is, in a way, machine learning brought outside of the machine!! I am currently doing a project on Neural Networks for school and this fits so perfectly well with that project! It basically is machine learning! Love it, never thought it would be possible with matchboxes tho...

@guilhermekobori3155 6 лет назад

Is there a reasoning behind the rewarding distribution being +3 win, +1 draw and -1 loss?

@gordonrichardson2972 6 лет назад

Guilherme Kobori I've seen that ratio in other simulations. Its probably the smallest set of prime integers that converge nicely, without wild gyrations, or risk of dying prematurely.

@TingTang1234567 Год назад

Having watched this video years ago I was properly tickled when it came on QI this week, also presented by a math nerd called Matt 🤣

@CormacMacCormac 6 лет назад

the only problem is every game of tic tac toe is a draw, unless one person is an idiot.

@ilya8914 6 лет назад

CormacMacCormac IKR

@minecraftermad 5 лет назад

@@ilya8914 worst game ever... i play the infinte version tho with the one who has a 5 in a row wins

@garychap8384 4 года назад

You know a person is an idiot if they don't : - place their opening mark in the corner when starting, or - or the centre when going second. Anything else, betrays a complete lack of strategy... The corner square IS the strongest _(the centre square is poisonous and prevents hidden forks)_ ... but almost nobody realises this. If you make a rule that nobody can take the centre until they have a mark on the board, then every game can be won by force.

@anandsuralkar2947 2 года назад

true and thats why i dont even count it as a game its just game for kids when u grow up it seems useless

@ShortNecked_GreenGiraffe 2 года назад

@@garychap8384 oh YES! i was hoping someone else realised! (idk but almost everyone i play with still plays the centre first it's annoying... haha i got bored once while waiting in the paediatrician back when i was 13 or something so i just started playing with myself)

@Zalied 6 лет назад

people knowing how to play and it being a solved game definitely makes it tougher. it would be interesting to see this sytem vs only children (people who almost never have strategy) or a version of itself that does the other side

@ge2719 6 лет назад

if the first box runs out surely the solution is the put one of each bead back in and keep going?

@GEM4sta 6 лет назад

Unsure of whether this would work, since you also removed beads further down the tree. I don't really want to think about it though.

@Benny_Blue 6 лет назад

GEM4sta And there might also be a halting problem here - how could it self diagnose to know what forfeits are justified, and what forfeits are not?

@youtubeuniversity3638 6 лет назад

Simple: Forfeit means loss, so it shouldn't forfeit at any point.

@cmck362 6 лет назад

By forfeit I think it's meant that there are no beads in the box. That indicates to the stack of matchboxes that all moves and their continuations are losing in that position therefore the game is lost. Basically a forfeit. If you relate that to chess it doesn't matter if it's a mate in 1 or a mate in 5. Either way the game is over so don't waste my time making me play out a formality. Basically you should resign/forfeit. At least then you can say that you saw the mate.

@eugenecbell 6 лет назад

I have never seen anyone forfeit a game of Tick-Tack-Toe. I say never give up.

@goswinvonbrederlow6602 2 года назад

The way I remember the learning algorithm from long ago was to remove the loosing move from the last box. Only when that box is empty remove the loosing move from the previous box recursively. That way it only prunes loosing strategies and the only way it could die is if there was a strategy for the second player to always win. Regarding using different flavors of tic-tacs the winner could get to keep the tic-tacs as a reward.

@Toreno13 6 лет назад

I wouldn't call those matchboxes sentient. The matchboxes simply store the learned information, the one doing the learning here is actually the human using the matchboxes.

@standupmaths 6 лет назад

+Toreno13 What if a different human did each move for MENACE? They would not even have to be told why they are getting a bead and drawing a circle, just the steps to follow. Would you say the crowd of humans involved are learning even though no one person knew what they were doing?

@Toreno13 6 лет назад

standupmaths yes, with "humans doing the learning" I meant, that they are the process which is responsible for the distribution of colored beads in each matchbox in the end. Or the instructions themselves are the process that's doing the learning. Like for a processor executing instructions (itself not knowing what it's actually doing), and the memory (where the information of the matchboxes is stored), I wouldn't say that the memory is sentient, but the processor is doing the learning and storing the progress in memory.

@MrCmon113 5 лет назад

I don't think this system is conscious, but your reason given is rather silly. Whether machine learning happens via metal wires or humans counting beads is irrelevant.

@ssrreevvaadd 3 года назад

I would agree it’s not sentient. To me the term machine “learning” implies sentience as I suppose it does to most people outside of computer science. Industries have a tendency to develop their own terms as a way to raise the barrier of entry and it can lead to real miscommunication with the public at large.

@Eleni_E 6 лет назад

As a future maths teacher with experience in museum design, this makes me itch to go get a big pile of matchboxes and build one of these stateside....

@MasterHigure 6 лет назад

10:11 "Metric universes"

@Zero-ks3pc 6 лет назад

MasterHigure a ‘metric’ is a generic term for measurement, the ‘metric system’ is the standard units for a distance using meters. So you can have metric smoots, metric universes, metric Pomeranians and it is referring to the standard set by the companion word. Metric meters I guess would be more accurate but not necessary as it is the common use and when not speaking of it, you add the secondary defining word to define the standard you are using.

@boltstrikes429 6 лет назад

What a Parker square of a measurement unit

@johnladuke6475 2 года назад

It seems like it would teach us a lot more about how the machine learns to program Menace B so that humans can move first. A deep analysis of the data generated by Menace A's centre-first strategy, compared to what Menace B does when the centre is left open by a human first move, might reveal some really interesting patterns. I think it could also learn very differently depending on how often it samples its results. Having its number of beads updated after _every_ player would respond in a different way than playing fifty games at a time and updating all those results simultaneously from the same start position. Or updating after every hundred games, two hundred, etc. Starting each box with multiples of each bead could also help smooth the numbers. A pre-learning state of having every option represented in triplicate might bring some insights. For example, those few anomalous blue beads in the opening-move box represented corner moves, and presumably led to some clever corner-based strategies that force mistakes. The machine might learn to give those strategies more weight if it has more opportunities to start with them.

@pkeshish 6 лет назад

HOW ABOUT A NICE GAME OF CHESS?

@gwenynorisu6883 6 лет назад

Horsey to King Prawn 4.

@jimmysuperchannel1527 5 лет назад

@@senik_8766 d5

@cristinaalexe7454 2 года назад

This is brilliant, both the principle and its use at a science festival!

@siekensou77 6 лет назад

tic tac would have been more interesting esp cuz you can reward the winner with a tic tac

@RichardDamon 4 года назад

Yes, that was my thought, if the player won, let them have one of the tic tacs that was drawn.

@hugme9592 5 лет назад

I'm a security engineer but my major was in AI. I have done a lot of thinking about this project and how it works. I've tried to come up with ways of improving it and while there are several there is one I think would be the most helpful with both the software and matchbox version. It also gives clues as to how to write the same software for a more complicated game like chess. That is don't start with all the boxes, start with one. Many of the boxes and paths will never be used. There are lots of outcomes which will just not happen in any game. Start with one box, all blank. When the first move is made create a new box, each move after that will create new boxes. Now you have a smaller set to work with. For both versions this will give you fewer boxes to sort through making the game much faster. This will be absolutly critical when creating the same functions for a game like chess. Programing this for chess however is a different beast, you will want to add some AI functions in addition to the machine learning. Though with the above change you could do it with only machine learning. I am considering writing my own software version of this and seeing how few lines of code I can make it happen with. I may be able to do it in less than 20.... Also, maybe using something silly, like bash.

@dannygjk 3 года назад

The problem is even with slick optimizations it will take a lifetime to train it on a PC to play strong chess, (assuming a practical PC quantum computer is never made available to you).

@kavigollamudi 6 лет назад

Not a Parker Pile of matchboxes then?

@Jakromha 6 лет назад

It kinda is, because it's playing centre instead of corner.

@EPaulIII 5 лет назад

I believe Scientific American published an article on this around 60 years ago. But they suggested that you use a lot fewer match boxes by allowing for reflections and rotations. I believe only two or three dozen boxes were needed. And they suggested that when the boxes lost, the last bead selected in the last box should be removed. Nothing was added for a win. After loosing several dozen games to me, the match boxes became totally unbeatable if they had the first move. A lot simpler than what you are doing here. And it worked to perfection.

@moogthedog2816 6 лет назад

"It's 10 to the 27 metric universes across?" What about the old imperial universes?

@alexwang982 5 лет назад

moogthedog *imperial March plays*

@Coen80 6 лет назад

favorite channel. keep up the good work. really love the mix of humour and information!

@harshzhoshi 6 лет назад

What happens if Menace plays Menace? Edit: Also, extremely sensitive to initial conditions!

@bloomtwig76 3 года назад

I believe the phd student Matt was far underestimating the volume the match boxes needed to make a chess-menace. This is because a lot more options exist at each move, here you have at most 9 options per move chess gas sixteen pieces each side and each piece can typically go to several different squares, thus you would need larger match boxes!

@dannygjk 3 года назад

Yes it would not only be impossible to do with real life matchboxes it would also be impossible to do on a computer because the number of possible states of chess is ridiculously high. Another problem is that training method converges extremely slowly and as the number of game states rises the training time blows up exponentially.

@MrSimpsondennis 6 лет назад

but, if you start with 1-1-1 in each box, doesn't that completely erase an option upon losing? instead of just lowering the odds? Also, Menace going 2nd should result in more interesting results, since the opening move is a variable (humans don't always start center), so the countermove will have more variety and as such the result may vary more.

@damienporter5345 6 лет назад

But only the last box contains just 1 of each bead. Which is fine as a loss from there should be discarded immediately. The ealier boxes contain multiple copies of each bead.

@joshuarosen6242 6 лет назад

@Damien Porter While that would make perfect sense, did he say so? If so, I missed that bit.

@damienporter5345 6 лет назад

Joshua Rosen I don't think he says it, but it is in the discription that he links to.

@joshuarosen6242 6 лет назад

Damien Porter Which I didn't read. Thank you, I now shall.

@half_pixel 6 лет назад

Love the music in this one!

@unvergebeneid 6 лет назад

"This is what procreation must feel like." 😂😂😂 Wow, this is one of the saddest sentences I've ever heard!

@stuartcoyle1626 6 лет назад

I remember doing this when I was a kid based on a Martin Gardiner article. Thanks for the memories.

@senshtatulo 6 лет назад

I did the same thing.

@robertofontiglia4148 6 лет назад

"This must be what procreating feels like" -- Oh Matt...

@christopherpellerito3809 6 лет назад

I'm pretty sure that Martin Gardner did this in the 1950s, but I am glad that Matt Parker is keeping the tradition alive.

@danjtitchener 6 лет назад

So you wanted to make matchboxes learn to win noughts and crosses but it only learnt to draw? That's a real Parker Square of a machine learning routine...

@kellel5610 6 лет назад

Daniel Titchener tic tac toe is a sufficiently easy game that each player can force a draw or win provided that one of the players uses the best strategy

@MrCmon113 5 лет назад

You cannot win. The game is so simple that a human without a severe mental disability will always force a draw, no matter how much more intelligent or skilled you are.

@kailenlee33 4 месяца назад

I love the derision with which he says "or Tic-Tac-Toe".

@Parax77 6 лет назад

"If the first box runs out, it has learnt to resign on the first move, and that is Bad......" BUT Wargames taught us that is the correct move! ru-vid.com/video/%D0%B2%D0%B8%D0%B4%D0%B5%D0%BE-6DGNZnfKYnU.html

@oz_jones 6 лет назад

*only winning move, not necessarily the best

@JeanLucCoulon 6 лет назад

I can remember, in 1964, we had a publication called "L’Album des jeunes" from the Readers Digest Selection. There was a similar game. It was an "Hexapion". It was very easy to play even for a kid (like me) and a few (not a lot) of matchboxes. The principle was the same and it was very frustating (for me) to failed to win against a couple of matchboxes. At this time I was thinking that the way the "machine" was punished was not a real punition because this punition was helping the machine to win... And on my side I had nothing to help...

@MikeOxolong 6 лет назад

I thought, that the best way is to start with a corner.

@kalebbruwer 6 лет назад

Tazer Of you do it right, but it is very unlikely to stumble across it by chance. Watch 3blue1brown's videos on the topic.

@SuperGarryGamer 5 лет назад

I actually discovered it :D

@owez08 6 лет назад

I'm glad a video about machine learning was finally able to tell me how it is programmed to learn, at least at a basic level. (I know I could have googled it but I couldn't be bothered most of the time that it came up)

@gregorymaynard3089 6 лет назад

tic tacs learning tic tac toe tactics, has science gone too far?

@41-Haiku 6 лет назад

I say it hasn't gone too far enough!

@squeakybunny2776 6 лет назад

Tic tac toe tactics😄😄 oh man I love that

@benjamins2683 6 лет назад

Hey Matt I wrote a programm in c# which simulates your matchbox MENACE. Its mostly a replica of the matchboxes but I made some adjustments like a lower bound on how many different beed from each color stay in the boxes so that it cant die. I also added an auto-learn function where MENACE playes against himself and learns that way.

@AlexiLaiho227 6 лет назад

really disappointed neither katie nor matt said "link in the dooblydoo"

@serhancinar5218 6 лет назад

I feel quite strange that how much I find this video very entertaining. Excellent work!..

@robertnorth5725 6 лет назад

SOOOOOOOOOO, ....... at the end of the match,the inanimate match wins the match!!?!?!?!?!?!?! That's MATCHLESS!!!!!!!!!!!!!!!!!!!!! (& menace says ; "YOU'VE MET YOUR MATCH!!!!!!!") hahaahaaaaa

@sebastianrodriguezcolina634 6 лет назад

It is an interesting that they did it in a museum and maybe some kids and adults alike would learn a bit of how machine learning works. A simple search algorithm would be much more efficient for this particular game though

@geogeo3644 6 лет назад

To be honest this made me truly grasp neural networks. Thanks

@Marconius6 6 лет назад

This isn't really neural networks, I'm sorry to say. It's just basic learning where the machine is aware of all the possible states ahead of time, and just assigns values to them based on past experiences. Neural networks are kinda based on this idea, but a bit more abstracted; they don't look at individual game states, and there are multiple 'layers' that each process information in a different way, influenced by their previous layer.

@standupmaths 6 лет назад

+Geogeo 3 Glad I could help! Remember this is only a first-order approximation and actual neural networks are much more complicated. But nothing a lot of matchboxes couldn’t do.

@TakeoFR 6 лет назад

That's close to Q-learning (with discount factor equal zero). A Neural network would be different.

@lachlanscott7494 6 лет назад

I always going in the corner was the best move, since unless they go in the centre you will be able to force a victory. Also, when they do go in the centre going in the opposite corner forces them into a non-corner move (otherwise you can also force a victory). It is these subtleties that allows one to beat a human in tic-tac-toe, so I think calling the blue fellas trivial is a mistake. Otherwise great job, very enlightening!!

@AashishNehete 6 лет назад

Matt Parker for Doctor Who anyone?

@romainbornes22 6 лет назад

Aashish Nehete yeeeeeeeeeeeeeesss.

@RoderickEtheria 5 лет назад

Corner is better than center to start. Corner yields 7 wins and 1 possible draw on opponent's first turn. Center yields 4 won games and 4 possible drawn games on opponent's first move.

@ShinySwalot 6 лет назад

Why are Matt and Katie always together?

@computerfis 6 лет назад

They work together. "Katie works for Think Maths with Matt Parker, giving talks in schools around the country about engaging off-curriculum mathematics. She also does admin and project management for Think Maths".... source: www.katiesteckles.co.uk/

@Richard_is_cool 6 лет назад

They are Parker married. Katie even Parker took his surname (which means she didn't).

@joeshoesmith 6 лет назад

Shiny Swalot I hear their subjects are similar somehow but I have no idea how.

@standupmaths 6 лет назад

+Shiny Swalot We’re maths buddies!

@alphamikeomega5728 6 лет назад

To learn what procreating feels like.

@HunterJE 2 года назад

Re: the many people in the comments correctly observing that corner is actually optimal starting move - it is the optimal first player starting move IF you continue to play optimally, but with MENACE's initially random play I suspect center is more likely than corner to at least blunder in to at least a draw (since it blocks off the most possible opponent lines of any move), and since MENACE is going to be losing a lot of games before it starts reliably winning (seen in the initial precipitous drop in the chart) those early phases are going to select for the start that's least likely to result in a loss...