Тёмный

Scrabble GM vs. AI -- the Rematch! Game #26 

Mack Meller
Подписаться 4,5 тыс.
Просмотров 1,6 тыс.
50% 1

The Scrabble AI BestBot got the best of me in my 100-game Human vs. AI Ultimate Scrabble Battle, but I'm not ready to cede to our AI overlords! Introducing... the GM vs. AI rematch!
This 100-game series, running every Monday and Wednesday at 5pm ET for 50 weeks, will feature 20-minute games against BestBot with post-game analysis. Hope you guys enjoy, and wish me luck!
BestBot is the upcoming ultimate Scrabble AI from Woogles.io, to be launched in 2024. For questions, please email woogles@woogles.io.
Want personalized help taking your game to the next level or a fun gift for a friend? Check out www.mackmeller...! for more info or email me at mackmeller@gmail.com!

Опубликовано:

 

12 сен 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 57   
@sulerelus
@sulerelus 22 дня назад
at this point, mack just needs to beat the white sox' record through 100 games this season (27-73)
@whitesoxMLB
@whitesoxMLB 22 дня назад
Apparently even Scrabble videos aren't safe :'(
@ohtani2024
@ohtani2024 22 дня назад
Mack is on the pace of 23-77. White sox has a better record!😅😅
@sulerelus
@sulerelus 21 день назад
​@@whitesoxMLB im rooting for reinsdorf out the door, my heart is in the right place
@mackmeller
@mackmeller 21 день назад
Hahaha I'll admit that's even better than I thought they were doing, I would've guessed like 18-82 but I don't follow super closely (though closely enough to know they were bad lol)
@ohtani2024
@ohtani2024 21 день назад
@@mackmeller they are now 31-97, which is like 24-76 in a 100 game season
@5cr4bb13
@5cr4bb13 22 дня назад
one thing that's really stuck with me as I've improved at Scrabble over the past few years is that every heuristic has an exception. I think OR really exemplifies that - opening a huge bingo lane while up 30 with a generally-ok leave is not something that most top experts would do. Looking at that position, I don't think I would have had the guts to make that play either. But this is the great part about computer analysis tools: it shows us points where our intuition fails. There are definitely reasons why BestBot would overvalue OR, given how its algorithms work, but I think it is still a very interesting position to learn from.
@CyberchaoX
@CyberchaoX 22 дня назад
The interesting thing is, the idea of "distracting the opponent" that Mack talked about, drawing the attention to the bottom to get the top to stay open, is a very human behavior.
@mackmeller
@mackmeller 21 день назад
For sure, and that to me in the biggest difference with computers -- they never get emotional or have the natural risk-aversion that most people do, and as such aren't afraid to make plays like OR that I agree almost all humans would scoff at
@domino14
@domino14 20 дней назад
TROGON and OR are within 0.01-0.05% after many many iterations (roughly 77.3% to win). It was basically a coin flip from the bot's perspective. I still think it's a bit of a crazy play as well. From its perspective, it's giving up 6 more points by playing OR vs TROGON. However, it says it bingoes 58.4% of the time after OR, vs 22.8% after DRONGO. That's not enough for me to justify opening the board like that. Still, it's actually not ahead by that much. If it blows up its leave and you X bomb maybe it sees that as a problem?
@AugustusMatthias
@AugustusMatthias 22 дня назад
Definitions of Interesting Words Played in Game 26: JOLE (22 pts) [noun] - variant spelling of "jowl"; cheek, the boneless cheek meat of a hog [from Old English] FEZ (34 pts) [noun] - a brimless cone-shaped hat that has a flat crown usually with a long tassel attached, is usually made of red felt, is worn by men in eastern Mediterranean countries (as Turkey), and has been adapted for women's hats in Europe and America [French, from a Moroccan geographical name] DEIFICAL (76 pts) [noun] - variant of "deific"; divine, godlike MANTLING (74 pts) [noun] - a heraldic representation of a mantle behind and around a coat of arms [made up of French and English combining forms] CASERNS (84 pts) [noun] - plural of "casern"; a military barracks in a garrison town [French, from Provençal (a Romance language of Southern France, ultimately from Latin] UNLAY (35 pts) [transitive verb] - to untwist the strands of (as a rope) [made up of originally English parts] WHANG (32 pts) [intransitive verb] - to make or produce a resonant noise [alteration of another English word] DITZ (42 pts) [noun] - a dizzy person [alteration of an English word of unknown origin] DRONGOES (86 pts) [noun] - a plural of "drongo"; a bird of the family Dicruridae native to Asia, Africa, and Australia [from Malagasy, an Austronesian language of Madagascar] BOITE (40 pts) [noun] - variant spelling of "boîte"; a nightclub [from French] KAE (21 pts) [noun] - (chiefly Scottish) jackdaw [from Middle English (northern dialect)] BENE (24 pts) [noun] - variant spelling of "benne"; another term for sesame [perhaps of West African origin] LIRI (6 pts) [noun] - plural of "lira"; the basic monetary unit of Italy until 2002 [from Italian]
@whitesoxMLB
@whitesoxMLB 22 дня назад
You play OR to distract from bingo defense given the vowel-heavy pool. I play OR because I don't know that TROGON or DRONGO are words. We are not the same.
@mackmeller
@mackmeller 21 день назад
Haha gotta start birdwatching! (both TROGON and DRONGO are birds)
@ScrabbleKenji
@ScrabbleKenji 22 дня назад
I'd like to think I'm capable of playing OR. I do think it's definitely the best play here. Against a lot of humans I'd have to bail out with TROGON or something, because the risk is very high and I'd feel so dumb when this backfired, but it certainly looks like the best play to me, honestly by quite a lot. For me, one of the reasons I avoid playing as open as other top players is because I've always felt like this sort of aggression is absolutely optimal once you're on these sorts of boards a lot of the time, and playing stuff like OR does make me a little sick to my stomach, but once here I think you kinda have to do it. The pool smashes the *** out of your range, you're not ahead by enough to outlast bad draws or opponent good draws, and you can always fish again if you miss.
@mackmeller
@mackmeller 21 день назад
Yep, can't argue with any of this -- I feel like I've adapted a lot from seeing BestBot play, probably not enough yet to feel confident I'd make a play like OR in this kind of situation, but maybe I'll get to that point eventually
@AlexDings
@AlexDings 21 день назад
One thing about OR is that it keeps five different tiles, which is a nice insurance against the pool (full of duplicated vowels, plus Q and V). It risks the counterbingo from you but the upside is that in ensures the bot is fine is almost all other scenarios. I like making those kind of plays (although whether they are sound when I do them is another question 😉)
@squadxzo
@squadxzo 22 дня назад
So used to content creators say they 'privated' videos instead of saying they set the videos to private that I was near certain it was a word. I guess there are always lessons in these videos regardless the outcome.
@mackmeller
@mackmeller 21 день назад
Ah, never heard that but I guess I should know given I'm a pretty well-established content creator by now! Or maybe it's better I not know so I don't accidentally phony 😂
@ManyNestedTree
@ManyNestedTree 17 дней назад
Similarly burnt for trying to play FAVORITED# once
@EmmsterGD
@EmmsterGD 22 дня назад
Spoiler cinderblock. I love me some good ol fashioned concrete
@AmaranthRBY
@AmaranthRBY 21 день назад
Never seen Mack straight up stop talking to react to a tile draw lol. Crazy that the 5 Es are a sidenote of this game because OR happened
@humbertocruz6214
@humbertocruz6214 21 день назад
Being allowed to play phonies would make,the bot even more formidable and better simulate an over the board game
@morrisgreenberg5223
@morrisgreenberg5223 20 дней назад
After OR, my immediate instinct was playing PA(C)T given the range. If you assume after OR that the Bot will bingo on either the top or the bottom more than 50% of the time, then VIPER is basically an insta-loss that often. If you assume the Bot's range is less good to the point where its optimal strategy is to open a second lane to long-term give itself options (as it seems like the OR play is really to minimize the scenarios where you have a strong enough range to bingo from the D and it is drawing dead to your newfound lead), then opening a third lane could flip the concept towards your advantage in terms of tempo. PA(C)T makes the rest of the game a lot more random which seems advantageous in a situation where you're down 30 with the opponent either having a great range or needing the volatility to outrun high-point bingos in the short term.
@thomasdawson2257
@thomasdawson2257 22 дня назад
5 E's, lol, actually hilarious. And your reaction, haha
@mackmeller
@mackmeller 21 день назад
Eeeeek
@thatspsychotic
@thatspsychotic 20 дней назад
This made me laugh so hard
@blueboybob
@blueboybob 22 дня назад
Do you know how often the bot is updated? Is it possible it's just gotten that much better since the last series?
@mackmeller
@mackmeller 21 день назад
I don't know exactly, and I don't think there's a regular schedule per se, but Cesar could probably provide more info
@domino14
@domino14 20 дней назад
I actually did update it in the last week or so, and made it significantly (to a relative degree, of course) better. It used to beat HastyBot around 61% of the time, now it's 63%. I fixed a few bugs around the cutoff algorithm mostly, and now it also will not slow-roll to nearly the same degree (the second fix is to lower user frustration, not necessarily to make it better).
@sebastiangrahamchavez8412
@sebastiangrahamchavez8412 22 дня назад
Spoiler block 🧱
@Phaaee_
@Phaaee_ 21 день назад
what a fun episode!
@mackmeller
@mackmeller 21 день назад
Thanks!
@jacksonsmylie279
@jacksonsmylie279 20 дней назад
I like OR a lot. Can’t say I would always think of it over the board or have the cajones to pull the trigger, but it’s great. While BB has a lead, it’s still pretty precarious after something like TROGON or DRONGO with lots of potential bad draws, bingoes you could hit, X bomb, etc. Turning off the jets there seems premature, OR seals up the game immediately a solid 40% of the time
@Dashie-
@Dashie- 22 дня назад
“PAW, MAW, it’s close” You’ve just summoned the furries, Mack!
@miskee11
@miskee11 22 дня назад
WOOF
@zVersee
@zVersee 22 дня назад
Rawr >:3
@klawiehr
@klawiehr 21 день назад
Yeeeowww
@bro91wn
@bro91wn 21 день назад
I would not play OR. I get the idea of distracting, and opening up two lanes on a board that has only one, but it seems too aggressive to me. I'm not quite a top player, though. On the UNLAY turn, I liked 9C AYIN for 28, setting up the L front hook, and also setting up the possibility of LUT(Z) with a T draw. Also, UNLAY seems to close off the bottom on a board that is already dying, as there are no Cs or blanks left for MYC. It is quite a big equity sacrifice, but I like the board after AYIN much more than after UNLAY at a ~30pt deficit.
@cbauermusic
@cbauermusic 22 дня назад
ENTERTAINING TANGENT MACK
@craiglarimer1173
@craiglarimer1173 14 дней назад
Nice game. I think OR was a distraction.
@mikewarner3597
@mikewarner3597 22 дня назад
Was "or" a godlike mind game?
@kewich3729
@kewich3729 21 день назад
ENDGAME SPOILER Wouldn't the chance of drawing 5 E's in the endgame be much rarer than 15 choose 5 (1/3003), since your opponent is more likely to have E's since they are a good tile, thus making getting an E rarer?
@mackmeller
@mackmeller 21 день назад
Maybe, though this wasn't really the endgame, so their rack feels a bit more random
@kewich3729
@kewich3729 21 день назад
@@mackmeller Ok!
@asdfasdf4924
@asdfasdf4924 22 дня назад
UU = -18.8 WU = 13.0 It sounded like you had some kind of noise in the background there, it would be nice if you could solve that for next one
@mackmeller
@mackmeller 21 день назад
Yeah apologies for the noise, I'm definitely hearing it too on rewatch -- think it was computer fans, I'll make sure to not have other tasks going next time
@coolnath99
@coolnath99 21 день назад
i think the bot played OR just to flex 💪
@ScrapFatherScrapSon
@ScrapFatherScrapSon 21 день назад
Is teenie valid?
@mackmeller
@mackmeller 21 день назад
Nope, unfortunately
@almightyhydra
@almightyhydra 22 дня назад
The bot draws four Ss and both blanks while Mack draws 5 Es on the same rack. (Another "I'd like to sell a vowel" game!) And then the bot comes up with a risky play of OR - because it knows Mack's rack and that he can't exploit it? (The bot is in control of tile draws, so it knows the entire bag, and therefore what's on Mack's rack.)
@ryanlind5239
@ryanlind5239 21 день назад
No, this has been debunked countless times. Also Mack didn't draw 5 Es, he drew 4. Still absurd but it happens.
@cukka99
@cukka99 21 день назад
Enough of the inane conspiracy theory peddling already
@domino14
@domino14 20 дней назад
shut your face
@arcieplays9040
@arcieplays9040 21 день назад
Hello! Tough loss, I've analyzed both you and BestBot's moves with macondo (ply 5) and here is my analysis!: POUC(H) sims 5th at 39.61% behind POW (40.27%), CAMPO (40.64%), MAW (40.7%), and MOW (41.13%). BestBot's J(O)LE sims 2nd at 66.75% behind J(O)LLIED at 67.32%. WAVY is best, and BestBot's 5I ODD sims 4th (59.25%) behind 9G OLD (59.3%), 5I ODIC (59.38%), and 9G ODD (59.55%). FEZ, (D)eiFICAL, MANT(L)ING, cASERNS, UNLAY, WHA(N)G are all clear best moves. Exhange EEEEI is preferred (17.37% vs 17.32%) over exchanging EEEE. BestBot's ROM sims 2nd at 87.4% behind MORN at 87.46%, and DIT(Z) is best. 14E OR sims 2nd at 77.02% behind 3H (T)ROGON at 77.37%. Still, a very interesting play totally payed off for the bot. VIPER sims best, even using macondo's inference feature. Simming with perfect inference, plays that block the bingo rows up top sim only fractionally better than VIPER. After (D)RONGOES, all other plays before the endgame were best, and all chances for Mack slipped away. All the best in the next game, Mack.
@mackmeller
@mackmeller 21 день назад
Thanks for all the detailed sims!
Далее
Scrabble GM vs. AI -- the Rematch! Game #27
28:39
Просмотров 1,5 тыс.
The BEST WORST Scrabble move ever?!
22:46
Просмотров 2,9 тыс.
БЕЛКА РОЖАЕТ?#cat
00:28
Просмотров 272 тыс.
How Strong is Tin Foil? 💪
00:26
Просмотров 38 млн
The French Scrabble Champion who doesn't speak French
48:59
Scrabble GM vs. AI -- the Rematch! Game #32
32:06
Просмотров 1,4 тыс.
Scrabble game with commentary no.430
21:14
The Oldest Unsolved Problem in Math
31:33
Просмотров 10 млн
The Bingo Paradox: 3× more likely to win
30:15
Просмотров 573 тыс.
Mind-Bending Moves: Scrabble Pro Analyzes Every Turn!
23:22
Undercover Scrabble GM DESTROYS everyone!
58:27
Просмотров 5 тыс.
БЕЛКА РОЖАЕТ?#cat
00:28
Просмотров 272 тыс.