Reinforcement Learning for Jump-Diffusions, with Financial Applications (2405.16449v2)

Published 26 May 2024 in cs.LG, math.OC, and q-fin.MF

Abstract: We study continuous-time reinforcement learning (RL) for stochastic control in which system dynamics are governed by jump-diffusion processes. We formulate an entropy-regularized exploratory control problem with stochastic policies to capture the exploration--exploitation balance essential for RL. Unlike the pure diffusion case initially studied by Wang et al. (2020), the derivation of the exploratory dynamics under jump-diffusions calls for a careful formulation of the jump part. Through a theoretical analysis, we find that one can simply use the same policy evaluation and $q$-learning algorithms in Jia and Zhou (2022a, 2023), originally developed for controlled diffusions, without needing to check a priori whether the underlying data come from a pure diffusion or a jump-diffusion. However, we show that the presence of jumps ought to affect parameterizations of actors and critics in general. We investigate as an application the mean--variance portfolio selection problem with stock price modelled as a jump-diffusion, and show that both RL algorithms and parameterizations are invariant with respect to jumps. Finally, we present a detailed study on applying the general theory to option hedging.

References (42)

Authors (3)

Xuefeng Gao (28 papers)
Lingfei Li (10 papers)
Xun Yu Zhou (33 papers)

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/QFinancePapers/status/1821830409165947140

https://twitter.com/QFinancePapers/status/1877008315911135611

https://twitter.com/QFinancePapers/status/1795341052481610213

Reinforcement Learning for Jump-Diffusions, with Financial Applications (2405.16449v2)

Summary

Related Papers

Tweets