Up a level |
Castellini, Jacopo, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Savani, Rahul ORCID: 0000-0003-1262-7831 and Whiteson, Shimon
(2021)
Analysing factorizations of action-value networks for cooperative multi-agent reinforcement learning.
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 35 (2).
25-.
Katt, Sammie, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Amato, Christopher
(2019)
Bayesian Reinforcement Learning in Factored POMDPs.
Proc. of the 18th International Conference on Autonomous Agents and Multiagent Systems, abs/18.
pp. 7-15.
Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Amato, Christopher
(2014)
Best Response Bayesian Reinforcement Learning for Multiagent Systems with State Uncertainty.
.
Çelikok, Mustafa Mert, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Kaski, Samuel
(2022)
Best-Response Bayesian Reinforcement Learning with Bayes-adaptive POMDPs
for Centaurs.
[Preprint]
Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Savani, Rahul ORCID: 0000-0003-1262-7831, Gallego, Jose, van der Pol, Elise and Gross, Roderich
(2019)
Beyond Local Nash Equilibria for Adversarial Networks.
Springer International Publishing.
Roijers, Diederik M, Scharpff, Joris, Spaan, Matthijs TJ, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, de Weerdt, Mathijs and Whiteson, Shimon
(2014)
Bounded Approximations for Linear Multi-Objective Planning Under Uncertainty.
.
Roijers, Diederik, Scharpff, Joris, Spaan, Matthijs, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Weerdt, Mathijs De and Whiteson, Shimon
(2014)
Bounded Approximations for Linear Multi-Objective Planning under Uncertainty (Extended Abstract).
.
Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Amato, Christopher
(2014)
Dec-POMDPs as Non-Observable MDPs.
[Report]
Castellini, Jacopo, Devlin, Sam, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Savani, Rahul ORCID: 0000-0003-1262-7831
(2020)
Difference Rewards Policy Gradients.
[Internet Publication]
Castellini, Jacopo, Devlin, Sam, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Savani, Rahul ORCID: 0000-0003-1262-7831
(2022)
Difference rewards policy gradients.
.
Claes, Daniel, Robbel, Philipp, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Tuyls, Karl, Hennes, Daniel and van der Hoek, Wiebe
(2015)
Effective Approximations for Multi-Robot Coordination in Spatially Distributed Tasks.
In: Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, Istanbul, Turkey.
Claes, Daniel, Robbel, Philipp, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Hennes, Daniel, Tuyls, Karl and Van der Hoek, Wiebe
(2015)
Effective Approximations for Spatial Task Allocation Problems.
.
Kanters, Timon V, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Kaisers, Michael, van den Bosch, Stan R, Grispen, Joep and Hermans, Jeroen
(2016)
Energy- and Cost-Efficient Pumping Station Control.
.
Robbel, Philipp, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Kochenderfer, Mykel J
(2015)
Exploiting Anonymity in Approximate Linear Programming: Scaling to Large
Multiagent MDPs (Extended Version).
In: AAAI Fall Symposium on Sequential Decision Making in Intelligent Agents, Westin Arlington Gateway in Arlington, Virginia adjacent to Washington, DC.
Robbel, Philipp, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Kochenderfer, Mykel J
(2016)
Exploiting Anonymity in Approximate Linear Programming: Scaling to Large Multiagent MDPs.
.
Satsangi, Yash, Whiteson, Shimon and Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2015)
Exploiting Submodular Value Functions for Faster Dynamic Sensor Selection.
In: 2015.
Satsangi, Yash, Whiteson, Shimon, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Spaan, Matthijs TJ
(2018)
Exploiting submodular value functions for scaling up active perception.
AUTONOMOUS ROBOTS, 42 (2).
pp. 209-233.
Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Spaan, Matthijs TJ and Witwicki, Stefan J
(2015)
Factored Upper Bounds for Multiagent Planning Problems under Uncertainty with Non-Factored Value Functions.
PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015-J.
pp. 1645-1651.
Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Savani, Rahul ORCID: 0000-0003-1262-7831, Gallego-Posada, Jose, Pol, Elise van der, Jong, Edwin D de and Gross, Roderich
(2017)
GANGs: Generative Adversarial Network Games.
.
(Unpublished)
Castellini, Jacopo
(2022)
Improved Representations for Cooperative Multi-Agent Reinforcement Learning.
Doctor of Philosophy thesis, University of Liverpool.
Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Spaan, Matthijs TJ and Witwicki, Stefan
(2015)
Influence-Optimistic Local Values for Multiagent Planning.
.
Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Spaan, Matthijs TJ and Witwicki, Stefan J
(2015)
Influence-Optimistic Local Values for Multiagent Planning.
In: Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, Istanbul, Turkey.
Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Spaan, Matthijs TJ and Witwicki, Stefan
(2015)
Influence-Optimistic Local Values for Multiagent Planning --- Extended
Version.
CoRR, abs/15.
Suau, Miguel, He, Jinke, Congeduti, Elena, Starre, Rolf AN, Czechowski, Aleksander and Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2022)
Influence-aware memory architectures for deep reinforcement learning in POMDPs.
NEURAL COMPUTING & APPLICATIONS.
Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2018)
Interactive Learning and Decision Making: Foundations, Insights & Challenges.
In: Twenty-Seventh International Joint Conference on Artificial Intelligence {IJCAI-18}, 2018-7-13 - 2018-7-19, Stockholm.
Behbahani, Feryal, Shiarlis, Kyriacos, Chen, Xi, Kurin, Vitaly, Kasewa, Sudhanshu, Stirbu, Ciprian, Gomes, Joao, Paul, Supratik, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Messias, Joao et al (show 1 more authors)
(2019)
Learning from Demonstration in the Wild.
2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019-M.
pp. 775-781.
Katt, Sammie, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Amato, Christopher
(2017)
Learning in POMDPs with Monte Carlo Tree Search.
, 34th International Conference on Machine Learning (ICML 2017).
Roijers, Diederik M, Whiteson, Shimon and Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2014)
Linear Support for Multi-Objective Coordination Graphs.
.
Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Spaan, Matthijs TJ, Terwijn, Bas, Robbel, Philipp and Messias, João V
(2017)
The MADP Toolbox: An Open Source Library for Planning and Learning in (Multi-)Agent Systems.
Journal of Machine Learning Research, 18.
89:1-89:1.
Kedege, Vibhav Inna, Czechowski, Aleksander, Stellingwerff, Ludo and Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2022)
Multi Robot Surveillance and Planning in Limited Communication Environments.
In: 14th International Conference on Agents and Artificial Intelligence, 2022-2-3 - 2022-2-5.
Efremova, Julia, Ranjbar-Sahraei, Bijan, Rahmani, Hossein, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Calders, Toon, Tuyls, Karl and Weiss, Gerhard
(2015)
Multi-Source Entity Resolution for Genealogical Data.
In:
Population Reconstruction.
Springer International Publishing, pp. 129-154.
ISBN 9783319198835
Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Spaan, Matthijs TJ and Vlassis, Nikos
(2008)
Optimal and approximate Q-value functions for decentralized POMDPs.
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 32.
pp. 289-353.
Irissappane, Athirai A, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Zhang, Jie
(2014)
A POMDP Based Approach to Optimally Select Sellers in Electronic Marketplaces.
.
Roijers, Diederik M, Whiteson, Shimon and Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2015)
Point-Based Planning for Multi-Objective POMDPs.
In: Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence.
Satsangi, Yash, Whiteson, Shimon and Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2016)
Probably Approximately Correct Greedy Maximization.
AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, abs/16.
pp. 1387-1388.
Satsangi, Yash, Whiteson, Shimon and Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2016)
Probably Approximately Correct Greedy Maximization: (Extended Abstract).
.
Satsangi, Yash, Whiteson, Shimon, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Bouma, Henri
(2017)
Real-Time Resource Allocation for Tracking Systems.
In: Conference on Uncertainty in Artificial Intelligence.
Castellini, Jacopo, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Savani, Rahul and Whiteson, Shimon
(2019)
The Representational Capacity of Action-Value Networks for Multi-Agent Reinforcement Learning.
[Internet Publication]
Irissappane, Athirai A, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Zhang, Jie
(2016)
A Scalable Framework to Choose Sellers in E-Marketplaces Using POMDPs.
.
Amato, Christopher and Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2015)
Scalable Planning and Learning for Multiagent POMDPs.
In: 2015.
Amato, Christopher and Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2015)
Scalable Planning and Learning for Multiagent POMDPs.
.
Amato, Christopher and Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2014)
Scalable Planning and Learning for Multiagent POMDPs: Extended Version.
ArXiv e-prints, arXiv:.
Irissappane, Athirai A, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Zhang, Jie
(2015)
Scaling POMDPs For Selecting Sellers in E-markets-Extended Version.
CoRR, abs/15.
Irissappane, Athirai A, Zhang, Jie, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Dutta, Partha S
(2015)
Secure Routing in Wireless Sensor Networks via POMDPs.
PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015-J.
pp. 2617-2623.
Alves, Flavia, Gairing, Martin, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Do, Thanh-Toan
(2020)
Sensor Data for Human Activity Recognition: Feature Representation and Benchmarking.
In: 2020 International Joint Conference on Neural Networks (IJCNN), 2020-7-19 - 2020-7-24, Glasgow.
Scharpff, Joris, Roijers, Diederik M, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Spaan, Matthijs TJ and Weerdt, Mathijs de
(2015)
Solving Multi-agent MDPs Optimally with Conditional Return Graphs.
.
Scharpff, Joris, Roijers, Diederik M, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Spaan, Matthijs TJ and de Weerdt, Mathijs M
(2016)
Solving Transition-Independent Multi-Agent MDPs with Sparse Interactions.
.
Scharpff, Joris, Roijers, Diederik M, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Spaan, Matthijs TJ and Weerdt, Mathijs M de
(2015)
Solving Transition-Independent Multi-agent MDPs with Sparse Interactions
(Extended version).
CoRR, abs/15.
Wiggers, Auke J, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Roijers, Diederik M
(2016)
Structure in the Value Function of Two-Player Zero-Sum Games of Incomplete Information.
In: The Tenth AAMAS Workshop on Multi-Agent Sequential Decision Making in Uncertain Domains (MSDM), Istanbul, Turkey.
Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Witwicki, Stefan and Kaelbling, Leslie P
(2021)
A Sufficient Statistic for Influence in Structured Multiagent Environments.
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 70.
pp. 789-870.
Roijers, Diederik M, Whiteson, Shimon, Ihler, Alex and Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2015)
Variational Multi-Objective Coordination.
.