![]() | Up a level |
Peng, Bei ORCID: 0000-0003-0152-3180, Rashid, Tabish, de Witt, Christian A Schroeder, Kamienny, Pierre-Alexandre, Torr, Philip HS and Bohmer, Wendelin
(2021)
FACMAC: Factored Multi-Agent Centralised Policy Gradients.
In: Thirty-fifth Conference on Neural Information Processing Systems, 2021-12-6 - 2021-12-14, Online.
Pan, Ling, Rashid, Tabish, Peng, Bei ORCID: 0000-0003-0152-3180, Huang, Longbo and Whiteson, Shimon
(2021)
Regularized Softmax Deep Multi-Agent <i>Q-</i>Learning.
In: Thirty-fifth Conference on Neural Information Processing Systems, 2021-12-6 - 2021-12-14, Online.