Point-based planning for multi-objective POMDPs


Roijers, DM, Whiteson, S and Oliehoek, FA (2015) Point-based planning for multi-objective POMDPs. In: UNSPECIFIED, ? - ?.

[img] Text
Roijers15IJCAI.pdf - Accepted Version

Download (777kB)

Abstract

Many sequential decision-making problems require an agent to reason about both multiple objectives and uncertainty regarding the environment's state. Such problems can be naturally modelled as multi-objective partially observable Markov decision processes (MOPOMDPs). We propose optimistic linear support with alpha reuse (OLSAR), which computes a bounded approximation of the optimal solution set for all possible weightings of the objectives. The main idea is to solve a series of scalarized single-objective POMDPs, each corresponding to a different weighting of the objectives. A key insight underlying OLSAR is that the policies and value functions produced when solving scalarized POMDPs in earlier iterations can be reused to more quickly solve scalarized POMDPs in later iterations. We show experimentally that OLSAR outperforms, both in terms of runtime and approximation quality, alternative methods and a variant of OLSAR that does not leverage reuse.

Item Type: Conference or Workshop Item (Paper)
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Depositing User: Symplectic Admin
Date Deposited: 20 Oct 2015 07:59
Last Modified: 18 May 2018 06:10
URI: http://livrepository.liverpool.ac.uk/id/eprint/2032383

Actions (Repository Staff Only)

Edit Item Edit Item