A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play

One program to rule them all Computers can beat humans at increasingly complex games, including chess and Go. However, these programs are typically constructed for a particular game, exploiting its properties, such as the symmetries of the board on which it is played. Silver et al. developed a program called AlphaZero, which taught itself to …

A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play Read More »