Player of Games: All the games, one algorithm! (w/ author Martin Schmid)
#playerofgames #deepmind #alphazero
Special Guest: First author Martin Schmid ()
Games have been used throughout research as testbeds for AI algorithms, such as reinforcement learning agents. However, different types of games usually require different solution approaches, such as AlphaZero for Go or Chess, and Counterfactual Regret Minimization (CFR) for Poker. Player of Games bridges this gap between perfect and imperfect information games and delivers a single algorithm that uses tree search over public information states, and is trained via self-play. The resulting algorithm can play Go, Chess, Poker, Scotland Yard, and many more games, as well as non-game environments.
OUTLINE:
0:00 - Introduction
2:50 - What games can Player of Games be trained on?
4:00 - Tree search algorithms (AlphaZero)
8:00 - What is different in imperfect information games?
15:40 - Counterfactual Value- and Policy-Networks
18:50 - The Player of Games search procedure
28:30 - How to train the network?
34
17 views
12
3
5 months ago 00:01:19 1
To the Star - Official Gameplay Reveal Trailer
5 months ago 00:03:05 1
The Forever Winter - Burst Transmission 1.0 - Europa EXO Lifecycle
5 months ago 00:00:47 1
Neon Prime Gameplay
5 months ago 00:03:32 1
FragPunk - Official Extended Trailer
5 months ago 00:00:10 1
Helldivers 2: Now THATS How You DESTROY a BUG HOLE!
5 months ago 00:24:43 1
380 GOLD Start of turn?! @BeterBabbit | Hearthstone Battlegrounds Commentary
5 months ago 00:06:22 1
Best Aliexpress Alternatives For Dropshipping in 2024
5 months ago 00:09:23 1
How to Find a Dropshipping Suppliers Today: Unveiling the Ultimate Guide😎
5 months ago 01:24:10 1
Reinhardt Buhr - Full 1 HOUR | 24 MINS ( LIVE LOOPING ALBUM ) “Movement 2“
5 months ago 00:01:58 1
Beyond Good & Evil - 20th Anniversary Edition: Launch Trailer