MV Singular Dividend Control
- The paper introduces a mean–variance dividend control model that incorporates a variance penalty, creating a time-inconsistent framework resolved using equilibrium strategies.
- It formulates the surplus process via drifted Brownian motion and derives explicit analytical and numerical solutions for both high and low risk-aversion regimes.
- The study establishes an extended verification theorem within an HJB-variational system, emphasizing barrier strategies and the critical role of risk aversion in dividend optimization.
The MV singular dividend control problem refers to the singular stochastic control problem in which one seeks to maximize the mean-variance (MV) functional of discounted dividend payments until ruin, for a surplus process governed by a drifted Brownian motion subject to singular dividend payout controls. This formulation extends the classical de Finetti optimal dividend problem by introducing aversion to variability in dividend payouts, and thereby incorporates a risk/variance penalty in the objective. Unlike its classical counterpart, the MV criterion renders the problem time-inconsistent, requiring equilibrium (rather than optimal) strategies established via an intrapersonal game-theoretic framework. The mathematical mechanisms, equilibrium classification, verification theorem, and numerical evaluation are described below.
1. Model Formulation and Surplus Dynamics
The MV singular dividend control problem is set on a filtered probability space supporting a standard Brownian motion . The uncontrolled surplus process evolves according to
for fixed drift and volatility . The dividend control is represented by a nondecreasing, -adapted, càdlàg process with the technical constraint to ensure that dividend payments do not exceed available surplus.
Under an admissible strategy , the controlled surplus is given by the singular stochastic differential equation (SDE)
with ruin time defined by
Dividend payments are frozen upon ruin: for all .
2. Mean-Variance Dividend Objective
Let denote the discount rate. The total discounted future dividends, for , are prescribed by
with for . The MV criterion—incorporating both expected value and a variance penalty—is parametrized by a risk-aversion coefficient : where the expectation and variance are conditional on . Equivalently, letting and , the criterion can be expressed as
This objective balances maximizing the expected dividend payouts with minimizing their variability before ruin.
3. Time-Inconsistency and Game-Theoretic Equilibrium Definition
The inclusion of a variance penalty in the objective destroys the Bellman optimality principle, making the control problem dynamically inconsistent—an optimal policy at need not remain optimal at future times . To address this, the problem is reframed as an intrapersonal dynamic game: strategies must constitute weak equilibria in the sense of Björk–Murgoci (2010) and Dai–Jin–Jia (2024). The equilibrium requirement is as follows: given a candidate strategy , perturbed over a small interval by lump-sum payout , the first-order change in must be nonpositive as : for every , , , where is the perturbed strategy. The equilibrium dividend strategy yields the equilibrium value function .
4. Verification Theorem and HJB-Variational System
The extended verification theorem is central to the solution methodology. Define the second-order differential operator
The state-time space is decomposed into “pay” and “no-transaction” regions based on the gradient of the candidate value function: Theorem 3.1 stipulates that if smooth functions satisfy: (i) boundary conditions ; (ii) the extended HJB-variational system
then the minimal reflection strategy (lump-sum payout in , local-time reflection at the region boundary) is a time-consistent equilibrium. The equilibrium is realized, with
This verification theorem enables constructive identification of equilibrium solutions in analytic and semi-explicit form under suitable smoothness and concavity conditions.
5. Analytical Equilibria: Large and Small Risk Aversion
The paper characterizes two principal equilibrium regimes depending on the risk aversion parameter :
Case A: High Risk Aversion ()
The unique equilibrium is immediate liquidation: all surplus is paid out at , triggering instant ruin. Explicitly,
The value function is linear in surplus ; the criterion’s gradient always meets the payout region condition .
Case B: Low Risk Aversion ()
Here, the equilibrium takes the form of a constant-barrier strategy: pay all surplus down to barrier when above it, otherwise wait. On the no-pay region , the relevant ODEs are
with solutions
where
On the pay region , continuity and matching requirements yield
Smooth pasting fixes the constants, and the barrier solves an explicitly derived nonlinear equation , with concavity requirement domainwise.
6. Numerical Illustration and Parametric Regimes
For parameters , , , the critical risk aversion is numerically . For , the classical de Finetti result is recovered with barrier . The function admits a unique positive root increasing with up to approximately , beyond which concavity fails. For instance, at , the barrier is , and the corresponding value function is strictly concave below the barrier.
Key parametric findings include:
- The equilibrium barrier is strictly increasing in for .
- The equilibrium payoff decreases with increasing , as expected from stronger penalties on variance.
- Increasing impatience () lowers the barrier, while higher drift () or reduced volatility () modify admissible risk aversion and the barrier in line with economic intuition.
The MV singular dividend control problem thus presents a distinct dichotomy: high risk aversion leads to immediate payout (liquidation), while lower risk tolerance generates an explicit barrier equilibrium up to a numerically determined threshold for . This threshold is strictly less than except in the degenerate liquidation regime.
7. Relation to Classical and Multidimensional Dividend Control
The MV singular dividend problem generalizes the de Finetti classical dividend control by integrating a reward-risk tradeoff directly into the objective. The time-inconsistency induced by the MV criterion necessitates equilibrium methods not present in the classical Bellman HJB approach. This framework is conceptually distinct from multidimensional singular dividend/switching problems such as those treated in Azcue–Muler (Azcue et al., 2018), which concern n-dimensional compound Poisson surplus models with optimal switching and penalty at ruin, addressed via viscosity solution theory and monotone finite-difference fixed-point schemes. Nonetheless, both lines of research rigorously elaborate the interplay between singular controls, dynamic programming, and non-standard objectives, with the MV singular control problem providing a mean–variance-driven paradigm for solvency and payout optimization.