Jerome, Joseph, Sanchez-Betancourt, Leandro, Savani, Rahul
ORCID: 0000-0003-1262-7831 and Herdegen, Martin
(2023)
Mbt-gym: Reinforcement learning for model-based limit order book trading
In: 4th ACM International Conference on AI in Finance.
Abstract
Within the mathematical finance literature there is a rich catalogue of mathematical models for studying algorithmic trading problems such as market making and optimal execution. This paper introduces mbt_gym, a Python module that provides a suite of gym environments for training reinforcement learning (RL) agents to solve such model-based trading problems in limit order books. The module is set up in an extensible way to allow the combination of different aspects of different models. It supports highly efficient implementations of vectorised environments to allow faster training of RL agents. In this paper, we motivate the use of RL to solve such model-based limit order book problems, we explain the design of our gym environment, and then demonstrate its use and resulting insights from solving standard and novel problems. Finally, we lay out a roadmap for further development and use of our module for research into limit-order-book trading.
| Item Type: | Conference Item (Unspecified) |
|---|---|
| Uncontrolled Keywords: | limit order book, market making, optimal execution, liquidity provision, inventory risk, reinforcement learning |
| Divisions: | Faculty of Science & Engineering > School of Electrical Engineering, Electronics and Computer Science |
| Depositing User: | Symplectic Admin |
| Date Deposited: | 27 Nov 2023 09:07 |
| Last Modified: | 23 May 2026 08:13 |
| DOI: | 10.1145/3604237.3626873 |
| Open Access URL: | https://dl.acm.org/doi/10.1145/3604237.3626873 |
| Related Websites: | |
| URI: | https://livrepository.liverpool.ac.uk/id/eprint/3177015 |
| Disclaimer: | The University of Liverpool is not responsible for content contained on other websites from links within repository metadata. Please contact us if you notice anything that appears incorrect or inappropriate. |
Altmetric
Altmetric