Papers
Topics
Authors
Recent
2000 character limit reached

MV Singular Dividend Control

Updated 13 November 2025
  • The paper introduces a mean–variance dividend control model that incorporates a variance penalty, creating a time-inconsistent framework resolved using equilibrium strategies.
  • It formulates the surplus process via drifted Brownian motion and derives explicit analytical and numerical solutions for both high and low risk-aversion regimes.
  • The study establishes an extended verification theorem within an HJB-variational system, emphasizing barrier strategies and the critical role of risk aversion in dividend optimization.

The MV singular dividend control problem refers to the singular stochastic control problem in which one seeks to maximize the mean-variance (MV) functional of discounted dividend payments until ruin, for a surplus process governed by a drifted Brownian motion subject to singular dividend payout controls. This formulation extends the classical de Finetti optimal dividend problem by introducing aversion to variability in dividend payouts, and thereby incorporates a risk/variance penalty in the objective. Unlike its classical counterpart, the MV criterion renders the problem time-inconsistent, requiring equilibrium (rather than optimal) strategies established via an intrapersonal game-theoretic framework. The mathematical mechanisms, equilibrium classification, verification theorem, and numerical evaluation are described below.

1. Model Formulation and Surplus Dynamics

The MV singular dividend control problem is set on a filtered probability space (Ω,F,{Ft},P)(\Omega, \mathcal{F}, \{\mathcal{F}_t\}, \mathbb{P}) supporting a standard Brownian motion BB. The uncontrolled surplus process evolves according to

dXt=adt+bdBt,dX_t = a\,dt + b\,dB_t,

for fixed drift a>0a>0 and volatility b>0b>0. The dividend control is represented by a nondecreasing, Ft\mathcal{F}_t-adapted, càdlàg process D=(Dt)t0D = (D_t)_{t \geq 0} with the technical constraint ΔDtXt\Delta D_t \leq X_{t^-} to ensure that dividend payments do not exceed available surplus.

Under an admissible strategy DD, the controlled surplus is given by the singular stochastic differential equation (SDE)

dXtD=adt+bdBtdDt,X0=x>0,dX_t^D = a\,dt + b\,dB_t - dD_t,\quad X_{0^-} = x > 0,

with ruin time defined by

τD=inf{t0:XtD0}.\tau^D = \inf\{ t \geq 0 : X_t^D \leq 0 \}.

Dividend payments are frozen upon ruin: Dt=DτDD_t = D_{\tau^D} for all tτDt \geq \tau^D.

2. Mean-Variance Dividend Objective

Let ρ>0\rho > 0 denote the discount rate. The total discounted future dividends, for t<τDt < \tau^D, are prescribed by

YtD=tτDeρ(st)dDs,Y_t^D = \int_t^{\tau^D} e^{-\rho(s-t)}\,dD_s,

with YtD=0Y_t^D = 0 for tτDt \geq \tau^D. The MV criterion—incorporating both expected value and a variance penalty—is parametrized by a risk-aversion coefficient γ>0\gamma > 0: J(x,t;D)=Ex,t[YtD]γ2Varx,t[YtD],J(x, t; D) = \mathbb{E}_{x, t}[Y_t^D] - \frac{\gamma}{2} \operatorname{Var}_{x, t}[Y_t^D], where the expectation and variance are conditional on XtD=xX_{t^-}^D = x. Equivalently, letting G(x,t)=Ex,t[YtD]G(x, t) = \mathbb{E}_{x, t}[Y_t^D] and H(x,t)=Ex,t[(YtD)2]H(x, t) = \mathbb{E}_{x, t}[(Y_t^D)^2], the criterion can be expressed as

J(x,t;D)=G(x,t)γ2(H(x,t)G(x,t)2).J(x, t; D) = G(x, t) - \frac{\gamma}{2}(H(x, t) - G(x, t)^2).

This objective balances maximizing the expected dividend payouts with minimizing their variability before ruin.

3. Time-Inconsistency and Game-Theoretic Equilibrium Definition

The inclusion of a variance penalty in the objective destroys the Bellman optimality principle, making the control problem dynamically inconsistent—an optimal policy at tt need not remain optimal at future times s>ts > t. To address this, the problem is reframed as an intrapersonal dynamic game: strategies must constitute weak equilibria in the sense of Björk–Murgoci (2010) and Dai–Jin–Jia (2024). The equilibrium requirement is as follows: given a candidate strategy DD^*, perturbed over a small interval [t,t+ε)[t, t + \varepsilon) by lump-sum payout dxd \leq x, the first-order change in JJ must be nonpositive as ε0\varepsilon \to 0: lim infε0+J(x,t;D)J(x,t;Dε)ε0,\liminf_{\varepsilon \to 0^+} \frac{J(x, t; D^*) - J(x, t; D^\varepsilon)}{\varepsilon} \geq 0, for every x>0x > 0, t0t \geq 0, d[0,x]d \in [0, x], where DεD^\varepsilon is the perturbed strategy. The equilibrium dividend strategy DD^* yields the equilibrium value function V(x,t)=J(x,t;D)V(x, t) = J(x, t; D^*).

4. Verification Theorem and HJB-Variational System

The extended verification theorem is central to the solution methodology. Define the second-order differential operator

Mϕ(x,t)=tϕ+axϕ+12b2xx2ϕ.\mathcal{M}\phi(x, t) = \partial_t \phi + a\,\partial_x \phi + \frac{1}{2}b^2\,\partial_{xx}^2 \phi.

The state-time space R+2\mathbb{R}_+^2 is decomposed into “pay” and “no-transaction” regions based on the gradient of the candidate value function: Pay:={(x,t):xV(x,t)=1},NT:=R+2Pay.\operatorname{Pay} := \{ (x, t) : \partial_x V(x, t) = 1 \}, \qquad \operatorname{NT} := \mathbb{R}_+^2 \setminus \operatorname{Pay}. Theorem 3.1 stipulates that if smooth functions V,G,HV, G, H satisfy: (i) boundary conditions V(0,t)=G(0,t)=H(0,t)=0V(0, t) = G(0, t) = H(0, t) = 0; (ii) the extended HJB-variational system

{max{MVγ2M(G2)+γGMGρG+γρ(HG2),  1xV}=0, (MGρG)1NT+(1xG)1Pay=0, (MH2ρH)1NT+(2GxH)1Pay=0,\begin{cases} \max\left\{ \mathcal{M}V - \frac{\gamma}{2} \mathcal{M}(G^2) + \gamma G\, \mathcal{M}G - \rho G + \gamma\rho (H - G^2),\; 1 - \partial_x V \right\} = 0,\ (\mathcal{M}G - \rho G)\,1_{\operatorname{NT}} + (1 - \partial_x G)\,1_{\operatorname{Pay}} = 0,\ (\mathcal{M}H - 2\rho H)\,1_{\operatorname{NT}} + (2G - \partial_x H)\,1_{\operatorname{Pay}} = 0, \end{cases}

then the minimal reflection strategy (lump-sum payout in Pay\operatorname{Pay}, local-time reflection at the region boundary) is a time-consistent equilibrium. The equilibrium V(x,t)V(x, t) is realized, with

G(x,t)=Ex,t[YtD],H(x,t)=Ex,t[(YtD)2].G(x, t) = \mathbb{E}_{x, t}[Y_t^{D^*}],\quad H(x, t) = \mathbb{E}_{x, t}[(Y_t^{D^*})^2].

This verification theorem enables constructive identification of equilibrium solutions in analytic and semi-explicit form under suitable smoothness and concavity conditions.

5. Analytical Equilibria: Large and Small Risk Aversion

The paper characterizes two principal equilibrium regimes depending on the risk aversion parameter γ\gamma:

Case A: High Risk Aversion (γ2a/b2\gamma \geq 2a/b^2)

The unique equilibrium is immediate liquidation: all surplus is paid out at t=0t = 0, triggering instant ruin. Explicitly,

G(x,t)=x,H(x,t)=x2,V(x,t)=x.G(x, t) = x,\quad H(x, t) = x^2,\quad V(x, t) = x.

The value function is linear in surplus xx; the criterion’s gradient always meets the payout region condition xV=1\partial_x V = 1.

Case B: Low Risk Aversion (0<γ<ε2a/b20 < \gamma < \varepsilon \leq 2a/b^2)

Here, the equilibrium takes the form of a constant-barrier strategy: pay all surplus down to barrier x~\tilde{x} when above it, otherwise wait. On the no-pay region [0,x~)[0, \tilde{x}), the relevant ODEs are

ρG+aG+12b2G=0;2ρH+aH+12b2H=0,- \rho G + a G' + \frac{1}{2}b^2 G'' = 0;\quad -2\rho H + a H' + \frac{1}{2}b^2 H'' = 0,

with solutions

G(x)=C1(er1xer2x),H(x)=C3(er3xer4x),G(x) = C_1(e^{r_1 x} - e^{r_2 x}),\quad H(x) = C_3(e^{r_3 x} - e^{r_4 x}),

where

r1,2=a±a2+2ρb2b2,r3,4=a±a2+4ρb2b2.r_{1,2} = \frac{-a \pm \sqrt{a^2 + 2\rho b^2}}{b^2},\quad r_{3,4} = \frac{-a \pm \sqrt{a^2 + 4\rho b^2}}{b^2}.

On the pay region [x~,)[ \tilde{x}, \infty ), continuity and matching requirements yield

G(x)=G(x~)+(xx~),H(x)=H(x~)+2G(x~)(xx~)+(xx~)2.G(x) = G(\tilde{x}) + (x - \tilde{x}),\quad H(x) = H(\tilde{x}) + 2G(\tilde{x})(x - \tilde{x}) + (x - \tilde{x})^2.

Smooth pasting fixes the constants, and the barrier x~\tilde{x} solves an explicitly derived nonlinear equation f(x~,γ)=0f(\tilde{x}, \gamma) = 0, with concavity requirement V(x)<0V''(x) < 0 domainwise.

6. Numerical Illustration and Parametric Regimes

For parameters a=1a = 1, b=0.25b = 0.25, ρ=0.2\rho = 0.2, the critical risk aversion is numerically 2ab2=32\frac{2a}{b^2} = 32. For γ=0\gamma = 0, the classical de Finetti result is recovered with barrier x~0.3141\tilde{x} \approx 0.3141. The function f(x,γ)f(x, \gamma) admits a unique positive root x~(γ)\tilde{x}(\gamma) increasing with γ\gamma up to approximately γˉ0.139732\bar{\gamma} \approx 0.1397 \ll 32, beyond which concavity fails. For instance, at γ=0.13\gamma = 0.13, the barrier is x~(0.13)0.3232\tilde{x}(0.13) \approx 0.3232, and the corresponding value function is strictly concave below the barrier.

Key parametric findings include:

  • The equilibrium barrier x~(γ)\tilde{x}(\gamma) is strictly increasing in γ\gamma for γ[0,γˉ]\gamma \in [0, \bar{\gamma}].
  • The equilibrium payoff V(x;γ)V(x; \gamma) decreases with increasing γ\gamma, as expected from stronger penalties on variance.
  • Increasing impatience (ρ\rho) lowers the barrier, while higher drift (aa) or reduced volatility (bb) modify admissible risk aversion and the barrier in line with economic intuition.

The MV singular dividend control problem thus presents a distinct dichotomy: high risk aversion leads to immediate payout (liquidation), while lower risk tolerance generates an explicit barrier equilibrium up to a numerically determined threshold for γ\gamma. This threshold is strictly less than 2a/b22a/b^2 except in the degenerate liquidation regime.

7. Relation to Classical and Multidimensional Dividend Control

The MV singular dividend problem generalizes the de Finetti classical dividend control by integrating a reward-risk tradeoff directly into the objective. The time-inconsistency induced by the MV criterion necessitates equilibrium methods not present in the classical Bellman HJB approach. This framework is conceptually distinct from multidimensional singular dividend/switching problems such as those treated in Azcue–Muler (Azcue et al., 2018), which concern n-dimensional compound Poisson surplus models with optimal switching and penalty at ruin, addressed via viscosity solution theory and monotone finite-difference fixed-point schemes. Nonetheless, both lines of research rigorously elaborate the interplay between singular controls, dynamic programming, and non-standard objectives, with the MV singular control problem providing a mean–variance-driven paradigm for solvency and payout optimization.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Whiteboard

Topic to Video (Beta)

Follow Topic

Get notified by email when new papers are published related to MV Singular Dividend Control Problem.