Up a level |
Hahn, Ernst Moritz, Perez, Mateo, Schewe, Sven ORCID: 0000-0002-9093-9518, Somenzi, Fabio, Trivedi, Ashutosh and Wojtczak, Dominik ORCID: 0000-0001-5560-0546
(2022)
Alternating Good-for-MDP Automata.
[Preprint]
Bhave, Devendra, Jha, Sagar, Krishna, Shankara Narayanan, Schewe, Sven ORCID: 0000-0002-9093-9518 and Trivedi, Ashutosh
(2015)
Bounded-rate multi-mode systems based motion planning.
In:
Proceedings of the 18th International Conference on Hybrid Systems Computation and Control - HSCC '15.
ACM, pp. 41-50.
ISBN 9781450334334
Hahn, Ernst Moritz, Perez, Mateo, Schewe, Sven ORCID: 0000-0002-9093-9518, Somenzi, Fabio, Trivedi, Ashutosh and Wojtczak, Dominik ORCID: 0000-0001-5560-0546
(2020)
Faithful and Effective Reward Schemes for Model-Free Reinforcement Learning of Omega-Regular Objectives.
In: ATVA, 2020-10-19 - 2020-10-23.
Hahn, Ernst Moritz ORCID: 0000-0002-9348-7684, Perez, Mateo ORCID: 0000-0003-4220-3212, Schewe, Sven ORCID: 0000-0002-9093-9518, Somenzi, Fabio ORCID: 0000-0002-2085-2003, Trivedi, Ashutosh ORCID: 0000-0001-9346-0126 and Wojtczak, Dominik ORCID: 0000-0001-5560-0546
(2020)
Good-for-MDPs Automata for Probabilistic Analysis and Reinforcement Learning.
In: TACAS, 2020-4-25 - 2020-4-30, Dublin.
Hahn, Ernst Moritz, Trivedi, Ashutosh, Perez, Mateo, Somenzi, Fabio, Schewe, Sven ORCID: 0000-0002-9093-9518 and Wojtczak, Dominik ORCID: 0000-0001-5560-0546
(2022)
An Impossibility Result in Automata-Theoretic Reinforcement Learning.
In: ATVA22.
Gupta, Anshul, Schewe, Sven ORCID: 0000-0002-9093-9518, Trivedi, Ashutosh, Deepak, Maram Sai Krishna and Padarthi, Bharath Kumar
(2016)
Incentive Stackelberg Mean-Payoff Games.
.
Hahn, Ernst Moritz, Perez, Mateo, Schewe, Sven ORCID: 0000-0002-9093-9518, Somenzi, Fabio, Trivedi, Ashutosh and Wojtczak, Dominik ORCID: 0000-0001-5560-0546
(2021)
Model-Free Reinforcement Learning for Branching Markov Decision Processes.
COMPUTER AIDED VERIFICATION, PT II, CAV 2021, 12760.
pp. 651-673.
Hahn, Ernst-Moritz, Perez, Mateo, Schewe, Sven ORCID: 0000-0002-9093-9518, Somenzi, Fabio, Trivedi, Ashutosh and Wojtczak, Dominik ORCID: 0000-0001-5560-0546
(2021)
Model-Free Reinforcement Learning for Branching Markov Decision Processes.
In: International Conference on Computer Aided Verification.
Hahn, Ernst-Moritz, Perez, Mateo, Schewe, Sven ORCID: 0000-0002-9093-9518, Somenzi, Fabio, Trivedi, Ashutosh and Wojtczak, Dominik ORCID: 0000-0001-5560-0546
(2021)
Model-Free Reinforcement Learning for Lexicographic Omega-Regular Objectives.
In: International Symposium on Formal Methods.
Hahn, Ernst Moritz, Perez, Mateo, Schewe, Sven ORCID: 0000-0002-9093-9518, Somenzi, Fabio, Trivedi, Ashutosh and Wojtczak, Dominik ORCID: 0000-0001-5560-0546
(2023)
Multi-objective ω-Regular Reinforcement Learning.
FORMAL ASPECTS OF COMPUTING, 35 (2).
pp. 1-24.
Hahn, Ernst Moritz ORCID: 0000-0002-9348-7684, Perez, Mateo ORCID: 0000-0003-4220-3212, Schewe, Sven ORCID: 0000-0002-9093-9518, Somenzi, Fabio ORCID: 0000-0002-2085-2003, Trivedi, Ashutosh ORCID: 0000-0001-9346-0126 and Wojtczak, Dominik ORCID: 0000-0001-5560-0546
(2023)
Mungojerrie: Linear-Time Objectives in Model-Free Reinforcement Learning.
In:
Tools and Algorithms for the Construction and Analysis of Systems.
Springer Nature Switzerland, pp. 527-545.
ISBN 9783031308222
Hahn, EM, Perez, Mateo, Schewe, S ORCID: 0000-0002-9093-9518, Somenzi, Fabio, Trivedi, Ashutosh and Wojtczak, DK ORCID: 0000-0001-5560-0546
(2019)
Omega-Regular Objectives in Model-Free Reinforcement Learning.
In: 25th International Conference on Tools and Algorithms for the Construction and Analysis of Systems (TACAS), 2019-4-6 - 2019-4-11, Prague, Czech Republic.
Das, Ankush, Krishna, Shankara Narayanan, Manasa, Lakshmi, Trivedi, Ashutosh and Wojtczak, Dominik ORCID: 0000-0001-5560-0546
(2015)
On Pure Nash Equilibria in Stochastic Games.
In: Theory and Applications of Models of Computation.
Hahn, Ernst Moritz, Perez, Mateo, Schewe, Sven ORCID: 0000-0002-9093-9518, Somenzi, Fabio, Trivedi, Ashutosh and Wojtczak, Dominik ORCID: 0000-0001-5560-0546
(2022)
Reinforcement Learning with Guarantees That Hold for Ever.
.
Schewe, Sven ORCID: 0000-0002-9093-9518, Trivedi, Ashutosh and Varghese, Thomas
(2015)
Symmetric Strategy Improvement.
In:
Automata, Languages, and Programming.
Lecture Notes in Computer Science, 9135
.
Springer, pp. 388-400.
ISBN 978-3-662-47665-9