Skip navigation

The University of Liverpool Repository

Browse by People

Number of items: 5.

Castellini, Jacopo, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Savani, Rahul ORCID: 0000-0003-1262-7831 and Whiteson, Shimon (2021) Analysing factorizations of action-value networks for cooperative multi-agent reinforcement learning. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 35 (2). 25-.

Castellini, Jacopo, Devlin, Sam, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Savani, Rahul ORCID: 0000-0003-1262-7831 (2020) Difference Rewards Policy Gradients. [Internet Publication]

Castellini, Jacopo, Devlin, Sam, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Savani, Rahul ORCID: 0000-0003-1262-7831 (2022) Difference rewards policy gradients. .

Castellini, Jacopo (2022) Improved Representations for Cooperative Multi-Agent Reinforcement Learning. Doctor of Philosophy thesis, University of Liverpool.

Castellini, Jacopo, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Savani, Rahul and Whiteson, Shimon (2019) The Representational Capacity of Action-Value Networks for Multi-Agent Reinforcement Learning. [Internet Publication]

This list was generated on Sat Oct 14 11:57:36 2023 BST.

Repository Staff Access

Research Support, University of Liverpool
Sydney Jones Library, Abercromby Square Liverpool L69 3DA, UK
+44 (0)151 794 0000