![]() | Up a level |
Srinivasan, Sriram, Lanctot, Marc, Zambaldi, Vinicius, Perolat, Julien, Tuyls, Karl, Munos, Remi and Bowling, Michael
(2018)
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 31.
pp. 3422-3435.
Lanctot, Marc, Zambaldi, Vinicius, Gruslys, Audrunas, Lazaridou, Angeliki, Tuyls, Karl, Perolat, Julien, Silver, David and Graepel, Thore
(2017)
A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning.
The Thirty-first Annual Conference on Neural Information Processing Systems (NIPS), 30.
pp. 4191-4204.
Perolat, Julien, Leibo, Joel Z, Zambaldi, Vinicius, Beattie, Charles, Tuyls, Karl and Graepel, Thore
(2017)
A multi-agent reinforcement learning model of common-pool resource
appropriation.
The Thirty-first Annual Conference on Neural Information Processing Systems (NIPS), 30.
pp. 3644-3653.