Тёмный

AlphaZero: An Introduction 

Aaron Davis
Подписаться 1,1 тыс.
Просмотров 42 тыс.
50% 1

We look at a project that I did trying to create a version of AlphaZero for the game Pente. AlphaZero uses Monte Carlo tree search in combination with machine learning and neural networks to estimate which moves are the best for the current player. We use convolutional neural networks to attempt to estimate move values for every possible move.
#some2 #3blue1brown
Primary Sources:
arxiv 1902.10565 - Accelerating Self-Play Learning in Go
Music:
Credit to Ludwig and Schlatt's Musical Emporium
(Everything below is me trying to beat the algorithm)
This implementation doesn't use the attention mechanism. Also, the original implementation by DeepMind could beat even the best chess players like Magnus Carlsen. There's also a subtle reference to the controversy around Hans Niemann hidden in the video, if you can notice it. Transformers could be used here, like in Leela Chess Zero, but we don't do that.

Опубликовано:

 

19 сен 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 94   
Далее
Alpha Zero and Monte Carlo Tree Search
23:35
Просмотров 42 тыс.
The moment we stopped understanding AI [AlexNet]
17:38
AI can't cross this line and we don't know why.
24:07
Просмотров 639 тыс.
I Beat MrBeast With Just a King and a Queen
30:54
Просмотров 17 млн
Watching Neural Networks Learn
25:28
Просмотров 1,3 млн
How is This Possible? | AlphaZero Shows Us the Way
14:04
Alpha Zero's Top 5 Moves Of All Time!!!
12:10
Просмотров 291 тыс.
How AlphaZero Completely CRUSHED Stockfish
33:48
Просмотров 4,3 млн