Hahn, Ernst Moritz, Perez, Mateo, Schewe, Sven, Somenzi, Fabio, Trivedi, Ashutosh and Wojtczak, Dominik
(2022)
Reinforcement Learning with Guarantees That Hold for Ever.
.
Text
paper_inv-2.pdf - Author Accepted Manuscript Download (665kB) | Preview |
Abstract
Reinforcement learning is a successful explore-and-exploit approach, where a controller tries to learn how to navigate an unknown environment. The principle approach is for an intelligent agent to learn how to maximise expected rewards. But what happens if the objective refers to non-terminating systems? We can obviously not wait until an infinite amount of time has passed, assess the success, and update. But what can we do? This talk will tell.
Item Type: | Conference or Workshop Item (Unspecified) |
---|---|
Uncontrolled Keywords: | 46 Information and Computing Sciences, 4602 Artificial Intelligence, 4611 Machine Learning |
Divisions: | Faculty of Science and Engineering > School of Electrical Engineering, Electronics and Computer Science |
Depositing User: | Symplectic Admin |
Date Deposited: | 05 Aug 2022 11:24 |
Last Modified: | 20 Jun 2024 20:24 |
DOI: | 10.1007/978-3-031-15008-1_1 |
Related URLs: | |
URI: | https://livrepository.liverpool.ac.uk/id/eprint/3160444 |