Up a level |
Dippel, Oliver ORCID: 0000-0002-6252-2248, Lisitsa, Alexei ORCID: 0000-0002-3820-643X and Peng, Bei ORCID: 0000-0003-0152-3180
(2023)
Deep Reinforcement Learning for Continuous Control of Material Thickness.
In: The Forty-third SGAI International Conference on Artificial Intelligence, 2023-12-12 - 2023-12-14, Cambridge, United Kingdom.
Huang, Xiaowei ORCID: 0000-0001-6267-0366, Peng, Bei ORCID: 0000-0003-0152-3180 and Zhao, Xingyu ORCID: 0000-0002-3474-349X
(2022)
Dependable learning-enabled multiagent systems.
AI COMMUNICATIONS, 35 (4).
pp. 407-420.
Peng, Bei ORCID: 0000-0003-0152-3180, Rashid, Tabish, de Witt, Christian A Schroeder, Kamienny, Pierre-Alexandre, Torr, Philip HS and Bohmer, Wendelin
(2021)
FACMAC: Factored Multi-Agent Centralised Policy Gradients.
In: Thirty-fifth Conference on Neural Information Processing Systems, 2021-12-6 - 2021-12-14, Online.
Zhang, Tianhui, Bollegala, Danushka ORCID: 0000-0003-4476-7003 and Peng, Bei ORCID: 0000-0003-0152-3180
(2023)
Learning to Predict Concept Ordering for Common Sense Generation.
In: Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 2: Short Papers), 2023-11 - 2023-11, Bali, Indonesia.
Pan, Ling, Rashid, Tabish, Peng, Bei ORCID: 0000-0003-0152-3180, Huang, Longbo and Whiteson, Shimon
(2021)
Regularized Softmax Deep Multi-Agent <i>Q-</i>Learning.
In: Thirty-fifth Conference on Neural Information Processing Systems, 2021-12-6 - 2021-12-14, Online.