Browse by People


Up a level
Export as [feed] RSS [feed] RSS 2.0 Short Author List
Number of items: 5.


Castellini, Jacopo, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Savani, Rahul ORCID: 0000-0003-1262-7831 and Whiteson, Shimon
(2021) Analysing factorizations of action-value networks for cooperative multi-agent reinforcement learning. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 35 (2). 25-.


Castellini, Jacopo, Devlin, Sam, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Savani, Rahul ORCID: 0000-0003-1262-7831
(2020) Difference Rewards Policy Gradients. [Internet Publication]


Castellini, Jacopo, Devlin, Sam, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Savani, Rahul ORCID: 0000-0003-1262-7831
(2022) Difference rewards policy gradients. .


Castellini, Jacopo
(2022) Improved Representations for Cooperative Multi-Agent Reinforcement Learning. Doctor of Philosophy thesis, University of Liverpool.


Castellini, Jacopo, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Savani, Rahul and Whiteson, Shimon
(2019) The Representational Capacity of Action-Value Networks for Multi-Agent Reinforcement Learning. [Internet Publication]

This list was generated on Sat Oct 14 11:57:36 2023 BST.