Up a level |
Number of items: 1.
Pan, Ling, Rashid, Tabish, Peng, Bei ORCID: 0000-0003-0152-3180, Huang, Longbo and Whiteson, Shimon
(2021)
Regularized Softmax Deep Multi-Agent Q-Learning.
In: Thirty-fifth Conference on Neural Information Processing Systems, 2021-12-6 - 2021-12-14, Online.