Browse by People


Up a level
Export as [feed] RSS [feed] RSS 2.0 Short Author List
Number of items: 51.


Castellini, Jacopo, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Savani, Rahul ORCID: 0000-0003-1262-7831 and Whiteson, Shimon
(2021) Analysing factorizations of action-value networks for cooperative multi-agent reinforcement learning. AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 35 (2). 25-.


Katt, Sammie, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Amato, Christopher
(2019) Bayesian Reinforcement Learning in Factored POMDPs. Proc. of the 18th International Conference on Autonomous Agents and Multiagent Systems, abs/18. pp. 7-15.


Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Amato, Christopher
(2014) Best Response Bayesian Reinforcement Learning for Multiagent Systems with State Uncertainty. .


Çelikok, Mustafa Mert, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Kaski, Samuel
(2022) Best-Response Bayesian Reinforcement Learning with Bayes-adaptive POMDPs for Centaurs. [Preprint]


Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Savani, Rahul ORCID: 0000-0003-1262-7831, Gallego, Jose, van der Pol, Elise and Gross, Roderich
(2019) Beyond Local Nash Equilibria for Adversarial Networks. Springer International Publishing.


Roijers, Diederik M, Scharpff, Joris, Spaan, Matthijs TJ, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, de Weerdt, Mathijs and Whiteson, Shimon
(2014) Bounded Approximations for Linear Multi-Objective Planning Under Uncertainty. .


Roijers, Diederik, Scharpff, Joris, Spaan, Matthijs, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Weerdt, Mathijs De and Whiteson, Shimon
(2014) Bounded Approximations for Linear Multi-Objective Planning under Uncertainty (Extended Abstract). .


Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Amato, Christopher
(2014) Dec-POMDPs as Non-Observable MDPs. [Report]


Castellini, Jacopo, Devlin, Sam, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Savani, Rahul ORCID: 0000-0003-1262-7831
(2020) Difference Rewards Policy Gradients. [Internet Publication]


Castellini, Jacopo, Devlin, Sam, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Savani, Rahul ORCID: 0000-0003-1262-7831
(2022) Difference rewards policy gradients. .


Claes, Daniel, Robbel, Philipp, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Tuyls, Karl, Hennes, Daniel and van der Hoek, Wiebe
(2015) Effective Approximations for Multi-Robot Coordination in Spatially Distributed Tasks. In: Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, Istanbul, Turkey.


Claes, Daniel, Robbel, Philipp, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Hennes, Daniel, Tuyls, Karl and Van der Hoek, Wiebe
(2015) Effective Approximations for Spatial Task Allocation Problems. .


Kanters, Timon V, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Kaisers, Michael, van den Bosch, Stan R, Grispen, Joep and Hermans, Jeroen
(2016) Energy- and Cost-Efficient Pumping Station Control. .


Robbel, Philipp, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Kochenderfer, Mykel J
(2015) Exploiting Anonymity in Approximate Linear Programming: Scaling to Large Multiagent MDPs (Extended Version). In: AAAI Fall Symposium on Sequential Decision Making in Intelligent Agents, Westin Arlington Gateway in Arlington, Virginia adjacent to Washington, DC.


Robbel, Philipp, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Kochenderfer, Mykel J
(2016) Exploiting Anonymity in Approximate Linear Programming: Scaling to Large Multiagent MDPs. .


Satsangi, Yash, Whiteson, Shimon and Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2015) Exploiting Submodular Value Functions for Faster Dynamic Sensor Selection. In: 2015.


Satsangi, Yash, Whiteson, Shimon, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Spaan, Matthijs TJ
(2018) Exploiting submodular value functions for scaling up active perception. AUTONOMOUS ROBOTS, 42 (2). pp. 209-233.


Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Spaan, Matthijs TJ and Witwicki, Stefan J
(2015) Factored Upper Bounds for Multiagent Planning Problems under Uncertainty with Non-Factored Value Functions. PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015-J. pp. 1645-1651.


Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Savani, Rahul ORCID: 0000-0003-1262-7831, Gallego-Posada, Jose, Pol, Elise van der, Jong, Edwin D de and Gross, Roderich
(2017) GANGs: Generative Adversarial Network Games. . (Unpublished)


Castellini, Jacopo
(2022) Improved Representations for Cooperative Multi-Agent Reinforcement Learning. Doctor of Philosophy thesis, University of Liverpool.


Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Spaan, Matthijs TJ and Witwicki, Stefan
(2015) Influence-Optimistic Local Values for Multiagent Planning. .


Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Spaan, Matthijs TJ and Witwicki, Stefan J
(2015) Influence-Optimistic Local Values for Multiagent Planning. In: Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, Istanbul, Turkey.


Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Spaan, Matthijs TJ and Witwicki, Stefan
(2015) Influence-Optimistic Local Values for Multiagent Planning --- Extended Version. CoRR, abs/15.


Suau, Miguel, He, Jinke, Congeduti, Elena, Starre, Rolf AN, Czechowski, Aleksander and Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2022) Influence-aware memory architectures for deep reinforcement learning in POMDPs. NEURAL COMPUTING & APPLICATIONS.


Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2018) Interactive Learning and Decision Making: Foundations, Insights & Challenges. In: Twenty-Seventh International Joint Conference on Artificial Intelligence {IJCAI-18}, 2018-7-13 - 2018-7-19, Stockholm.


Behbahani, Feryal, Shiarlis, Kyriacos, Chen, Xi, Kurin, Vitaly, Kasewa, Sudhanshu, Stirbu, Ciprian, Gomes, Joao, Paul, Supratik, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Messias, Joao
et al (show 1 more authors) (2019) Learning from Demonstration in the Wild. 2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019-M. pp. 775-781.


Katt, Sammie, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Amato, Christopher
(2017) Learning in POMDPs with Monte Carlo Tree Search. , 34th International Conference on Machine Learning (ICML 2017).


Roijers, Diederik M, Whiteson, Shimon and Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2014) Linear Support for Multi-Objective Coordination Graphs. .


Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Spaan, Matthijs TJ, Terwijn, Bas, Robbel, Philipp and Messias, João V
(2017) The MADP Toolbox: An Open Source Library for Planning and Learning in (Multi-)Agent Systems. Journal of Machine Learning Research, 18. 89:1-89:1.


Kedege, Vibhav Inna, Czechowski, Aleksander, Stellingwerff, Ludo and Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2022) Multi Robot Surveillance and Planning in Limited Communication Environments. In: 14th International Conference on Agents and Artificial Intelligence, 2022-2-3 - 2022-2-5.


Efremova, Julia, Ranjbar-Sahraei, Bijan, Rahmani, Hossein, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Calders, Toon, Tuyls, Karl and Weiss, Gerhard
(2015) Multi-Source Entity Resolution for Genealogical Data. In: Population Reconstruction. Springer International Publishing, pp. 129-154. ISBN 9783319198835


Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Spaan, Matthijs TJ and Vlassis, Nikos
(2008) Optimal and approximate Q-value functions for decentralized POMDPs. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 32. pp. 289-353.


Irissappane, Athirai A, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Zhang, Jie
(2014) A POMDP Based Approach to Optimally Select Sellers in Electronic Marketplaces. .


Roijers, Diederik M, Whiteson, Shimon and Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2015) Point-Based Planning for Multi-Objective POMDPs. In: Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence.


Satsangi, Yash, Whiteson, Shimon and Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2016) Probably Approximately Correct Greedy Maximization. AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, abs/16. pp. 1387-1388.


Satsangi, Yash, Whiteson, Shimon and Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2016) Probably Approximately Correct Greedy Maximization: (Extended Abstract). .


Satsangi, Yash, Whiteson, Shimon, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Bouma, Henri
(2017) Real-Time Resource Allocation for Tracking Systems. In: Conference on Uncertainty in Artificial Intelligence.


Castellini, Jacopo, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Savani, Rahul and Whiteson, Shimon
(2019) The Representational Capacity of Action-Value Networks for Multi-Agent Reinforcement Learning. [Internet Publication]


Irissappane, Athirai A, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Zhang, Jie
(2016) A Scalable Framework to Choose Sellers in E-Marketplaces Using POMDPs. .


Amato, Christopher and Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2015) Scalable Planning and Learning for Multiagent POMDPs. In: 2015.


Amato, Christopher and Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2015) Scalable Planning and Learning for Multiagent POMDPs. .


Amato, Christopher and Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2014) Scalable Planning and Learning for Multiagent POMDPs: Extended Version. ArXiv e-prints, arXiv:.


Irissappane, Athirai A, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Zhang, Jie
(2015) Scaling POMDPs For Selecting Sellers in E-markets-Extended Version. CoRR, abs/15.


Irissappane, Athirai A, Zhang, Jie, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Dutta, Partha S
(2015) Secure Routing in Wireless Sensor Networks via POMDPs. PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015-J. pp. 2617-2623.


Alves, Flavia, Gairing, Martin, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Do, Thanh-Toan
(2020) Sensor Data for Human Activity Recognition: Feature Representation and Benchmarking. In: 2020 International Joint Conference on Neural Networks (IJCNN), 2020-7-19 - 2020-7-24, Glasgow.


Scharpff, Joris, Roijers, Diederik M, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Spaan, Matthijs TJ and Weerdt, Mathijs de
(2015) Solving Multi-agent MDPs Optimally with Conditional Return Graphs. .


Scharpff, Joris, Roijers, Diederik M, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Spaan, Matthijs TJ and de Weerdt, Mathijs M
(2016) Solving Transition-Independent Multi-Agent MDPs with Sparse Interactions. .


Scharpff, Joris, Roijers, Diederik M, Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Spaan, Matthijs TJ and Weerdt, Mathijs M de
(2015) Solving Transition-Independent Multi-agent MDPs with Sparse Interactions (Extended version). CoRR, abs/15.


Wiggers, Auke J, Oliehoek, Frans A ORCID: 0000-0003-4372-5055 and Roijers, Diederik M
(2016) Structure in the Value Function of Two-Player Zero-Sum Games of Incomplete Information. In: The Tenth AAMAS Workshop on Multi-Agent Sequential Decision Making in Uncertain Domains (MSDM), Istanbul, Turkey.


Oliehoek, Frans A ORCID: 0000-0003-4372-5055, Witwicki, Stefan and Kaelbling, Leslie P
(2021) A Sufficient Statistic for Influence in Structured Multiagent Environments. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 70. pp. 789-870.


Roijers, Diederik M, Whiteson, Shimon, Ihler, Alex and Oliehoek, Frans A ORCID: 0000-0003-4372-5055
(2015) Variational Multi-Objective Coordination. .

This list was generated on Wed Jan 24 10:35:34 2024 GMT.