1. Introduction
The jump diffusion process has come to play an important role in many branches of science and industry. In their book [1], Øksendal and Sulem have studied the optimal control, optimal stopping, and impulse control for jump diffusion processes. In mathematical finance theory, many researchers have developed option pricing theory, for example, Merton [2] was the first to use the jump processes to describe the stock dynamics, and Bardhan and Chao [3] were amongst the first authors to consider market completeness in a discontinuous model. The jump diffusion models have been discussed by Chan [4], Föllmer and Schweizer [5], El Karoui and Quenez [6], Henderson and Hobson [7], and Merculio and Runggaldier [8], to name a few.
On the other hand, regime-switching models have been widely used for price processes of risky assets. For example, in [9] the optimal stopping problem for the perpetual American put has been considered, and the finite expiry American put and barrier options have been priced. The asset allocation has been discussed in [10], and Elliott et al. [11] have investigated volatility problems. Regime-switching models with a Markov-modulated asset have already been applied to option pricing in [12–14] and references therein. Moreover, Markowitz′s mean-variance portfolio selection with regime switching has been studied by Yin and Zhou [15], Zhou and Yin [16], and Zhou and Li [17].
Portfolio selection is an important topic in finance; multiperiod mean-variance portfolio selection has been studied by, for example, Samuelson [18], Hakansson [19], and Pliska [20] among others. Continuous-time mean-variance hedging problems were attacked by Duffie and Richardson [21] and Schweizer [22] where optimal dynamic strategies were derived, based on the projection theorem, to hedge contingent claims in incomplete markets.
In this paper, we will extend the results of Yin and Zhou [15] to SDEs with jumps under regime switching. After dealing with the difficulty from the jump processes, we obtain similar results to those of Yin and Zhou [15].
2. SDEs under Regime Switching with Jumps
Throughout this paper, let (Ω,
ℱ,
P) be a fixed complete probability space on which it is defined a standard
d-dimensional Brownian motion
W(
t)≡(
W1(
t), …,
Wd(
t))
′ and a continuous-time stationary Markov chain
α(
t) taking value in a finite state space
𝕊 = {1,2, …,
l}. Let
N(
t,
z) be as
n-dimensional Poisson process and denote the compensated Poisson process by
()
where
Nj,
j = 1, …,
n, are independent 1-dimensional Poisson random measures with characteristic measure
νj,
j = 1, …,
n, coming from
n independent 1-dimensional Poisson point processes. We assume that
W(
t),
α(
t), and
N(
dt,
dz) are independent. The Markov chain
α(
t) has a generator
given by
()
where Δ > 0. Here
qij ≥ 0 is the transition rate from
i to
j if
i ≠
j while
()
and stationary transition probabilities
()
Define
ℱt =
σ{
W(
s),
α(
s),
N(
s, ·)
:
0 ≤
s ≤
t}. Let |·| denote the Euclidean norm as well as the matrix trace norm and
M′ the transpose of any vector or matrix
M. We denote by
the set of all
ℝm-valued, measurable stochastic processes
f(
t) adapted to
, such that
.
Consider a market in which
d + 1 assets are traded continuously. One of the assets is a bank account whose price
P0(
t) is subject to the following stochastic ordinary differential equation:
()
where
r(
t,
i) ≥ 0,
i = 1,2, …,
l, are given as the interest rate process corresponding to different market modes. The other
d assets are stocks whose price processes
Pm(
t),
m = 1,2, …,
d, satisfy the following system of stochastic differential equations (SDEs):
()
where for each
i = 1,2, …,
l,
b : [0,
T] ×
𝕊 →
ℝd×1,
σ : [0,
T] ×
𝕊 →
ℝd×d,
ρ : [0,
T] ×
𝕊 ×
ℝn →
ℝd×n is the appreciation rate process, and
σm(
t,
i): = (
σm1(
t,
i), …,
σmd(
t,
i)) are adapted processes such that the integrals exist. And each column
ρ(k) of the
d ×
n matrix
ρ = [
ρij] depends on
z only through the
kth coordinate
zk, that is,
()
Remark 2.1. Generally speaking, one uses noncompensated Poisson processes in a jump diffusion model (see Kushner [23]). However, we use compensated Poisson processes in (2.6) instead of using noncompensated Poisson processes, this is because firstly, using relationship (2.1) we can easily transform a jump diffusion model driven by noncompensated Poisson processes into a jump diffusion model driven by compensated Poisson processes; secondly, using compensated Poisson processes we can keep the Riccati Equation (4.2) similar to that of a diffusion model without jump processes, and then H(t, i) in (4.3) has a financial interpretation.
Define the volatility matrix, for each
i = 1, …,
l,
()
where
()
We assume throughout this paper that the following nondegeneracy condition:
()
is satisfied for some
δ > 0. We also assume that all the functions
r(
t,
i),
bm(
t,
i), and
σmn(
t,
i),
ρmn(
t,
i,
z) are measurable and uniformly bounded in
t.
Suppose that the initial market mode
α(0) =
i0. Consider an agent with an initial wealth
x0 > 0. These initial conditions are fixed throughout the paper. Denote by
x(
t) the total wealth of the agent at time
t ≥ 0. Assume that the trading of shares takes place continuously and that transaction cost and consumptions are not considered. Suppose the right portfolio (
π0(
t),
π1(
t), …,
πd(
t)) exists, where
π0(
t) is the money invested in the bond, and
πi(
t) is the money invested in the
ith stock. Then
()
where
η0(
t) is the number of bond units bought by the investor, and
ηi(
t) is the amount of units for the
ith stock. We call
x(
t) the wealth process for this investor in the market. Now let us derive intuitively the stochastic differential equation (SDE) for the wealth process as follows. Suppose the portfolio is self-financed, that is, in a short time
dt the investor does not put in or withdraw any money from the market. Let the money
x(
t) change in the market due to the market own performance, that is, self-finance produces
()
Now substituting (
2.5) and (
2.6) into the above equation, after a simple calculation we arrive at
()
where
π(
t) = (
π1(
t), …,
πd(
t))
′ which we call a portfolio of the agent. And
πm(
t) is the total market value of the agent′s wealth in the
mth asset,
m = 0,1, …,
d, at time
t.
Setting
()
we can rewrite the wealth equation (
2.13) as
()
Definition 2.2. A portfolio π(·) is said to be admissible if and the SDE (2.15) has a unique solution x(·) corresponding to π(·). In this case, we refer to (x(·), π(·)) as an admissible (wealth, portfolio) pair.
Remark 2.3. Most works in the literature define a portfolio, say π(·), as the fractions of wealth allocated to different stocks. That is,
()
With this definition, (
2.15) can be rewritten as
()
It is well known that this equation has a unique solution (see [
1, page 10, Theorem 1.19]. We can use the same method in [18, Example 1.15, page 8] to show positivity of the solution of (
2.17) if the initial wealth
x0 is positive and
u(
t)
′ρ(
t,
i,
z)>−1. A wealth process with possible zero or negative values is sensible at least for some circumstances. The nonnegativity of wealth process is better imposed as an additional constraint, rather than as a built-in feature. In our formulation, a portfolio is well defined even if the wealth is zero or negative, and the nonnegativity of the wealth could be a constraint.
The agent′s objective is to find an admissible portfolio
π(·) among all the admissible portfolios whose expected terminal wealth is
𝔼x(
T) =
ζ for some given
ζ ∈
ℝ1, so that the risk measured by the variance of the terminal wealth
()
is minimized. Finding such a portfolio
π(·) is referred to as the mean-variance portfolio selection problem. Specifically, we have the following formulation.
Definition 2.4. The mean-variance portfolio selection is a constrained stochastic optimization problem, parameterized by ζ ∈ ℝ1:
()
Moreover, the problem is called feasible if there is at least one portfolio satisfying all the constraints. The problem is called finite if it is feasible and the infimum of JMV(x0, i0, π(·)) is finite. Finally, an optimal portfolio to the above problem, if it ever exists, is called an efficient portfolio corresponding to ζ; the corresponding (Var x(T), ζ) ∈ ℝ2 and (σx(T), ζ) ∈ ℝ2 are interchangeably called an efficient point, where σx(T) denotes the standard deviation of x(T). The set of all the efficient points is called the efficient frontier.
For more details of mean-variance portfolio selection see [
15,
16], We need more notations; let Δ
ij be consecutive, left closed, right open intervals of the real line each having length
γij such that
()
For future use, we cite the generalized Itô lemma (see [
1,
24,
25]) as the following lemma.
Lemma 2.5. Given a d-dimensional process y(·) satisfying
()
where
f,
g, and
γ satisfy Lipschitz condition with appropriate dimensions, each column
γ(k) of the matrix
γ = [
γij] depends on
z only through the
kth coordinate
zk. Let
φ(
t,
x,
i) ∈
C1,2([0,
T] ×
ℝn ×
S;
ℝ), one then has
()
where
()
where
μ is a martingale measure,
()
and
is a martingale measure. And
γ(
dt,
dy) is a Poisson random measure with intensity
dt ×
μ(
dy), in which
μ is the Lebesgue measure on
ℝ.
3. Feasibility
Since the problem (2.19) involves a terminal constraint 𝔼x(T) = ζ, in this section, we derive conditions under which the problem is at least feasible. First of all, the following generalized Itô lemma [25] for Markov-modulated processes is useful.
The associated wealth process
x0(·) satisfies
()
with its expected terminal wealth
()
Lemma 3.1. Let ψ(·, i), i = 1,2, …, l, be the solutions to the following system of linear ordinary differential equations (ODEs):
()
Then the mean-variance problem (
2.19) is feasible for every
ζ ∈
ℝ1 if and only if
()
Proof. To prove the “if" part, construct a family of admissible portfolios πβ(·) = βπ(·) for β ∈ ℝ1 where
()
Assume that
xβ(
t) is the solution of (
2.15). Let
xβ(
t) =
x0(
t) +
βy(
t), where
x0(·) satisfies (
3.1), and
y(·) is the solution to the following equation:
()
Therefore, problem (
2.19) is feasible for every
ζ ∈
ℝ1 if there exists
β ∈
ℝ such that
ζ =
𝔼xβ(
T) ≡
𝔼x0(
T) +
β𝔼y(
T). Equivalently, (
2.19) is feasible for every
ζ ∈
ℝ if
𝔼y(
T) ≠ 0. Applying the generalized Itô formula (Lemma
2.5) to
φ(
t,
x,
i) =
ψ(
t,
i)
x, we have
()
Integrating from 0 to
T, taking expectation, and using (
3.5), we obtain
()
Consequently,
𝔼y(
T) ≠ 0 if (
3.4) holds.
Conversely, suppose that problem (2.19) is feasible for every ζ ∈ ℝ1. Then for each ζ ∈ ℝ, there is an admissible portfolio π(·) so that 𝔼x(T) = ζ. However, we can always decompose x(t) = x0(t) + y(t) where y(·) satisfies (3.6). This leads to 𝔼x0(T) + 𝔼y(T) = ζ. However, 𝔼x0(T) ≡ ζ0 is independent of π(·); thus it is necessary that there is a π(·) with 𝔼y(T) ≠ 0. It follows then from (3.8) that (3.4) is valid.
Theorem 3.2. The mean-variance problem (2.19) is feasible for every ζ ∈ ℝ if and only if
()
Proof. By virtue of Lemma (3.1), it suffices to prove that ψ(t, i) > 0 ∀ t ∈ [0, T], i = 1,2, …, l. To this end, note that (3.3) can be rewritten as
()
Treating this as a system of terminal-valued ODEs, a variation-of-constant formula yields
()
Construct a sequence
ψ(k)(·,
i) (known as the Picard sequence) as follows:
()
Noting that
qij ≥ 0 for all
j ≠
i, we have
()
On the other hand, it is well known that
ψ(
t,
i) is the limit of the Picard sequence
ψ(k)(
t,
i) as
k →
∞. Thus
ψ(
t,
i) > 0. This proves the desired result.
Corollary 3.3. If (3.9) holds, then for any ζ ∈ ℝ, an admissible portfolio that satisfies 𝔼x(T) = ζ is given by
()
where
x0 and
ϱ are given by (
3.2) and (
3.4), respectively.
Proof. This is immediate from the proof of the “if" part of Lemma (3.1)
()
Then one has
()
Corollary 3.4. If , then any admissible portfolio π(·) results in 𝔼x(T) = ζ0.
Proof. This is seen from the proof of the “only if" part of Lemma (3.1)
()
since
.
Remark 3.5. Condition (3.9) is very mild. For example, (3.9) holds as long as there is one stock whose appreciation-rate process is different from the interest-rate process at any market mode, which is obviously a practically reasonable assumption. On the other hand, if (3.9) fails, then Corollary (3.4) implies that the mean-variance problem (2.19) is feasible only if ζ = ζ0. This is pathological and trivial case that does not warrant further consideration. Therefore, from this point on we will assume that (3.9) holds or, equivalently, the mean-variance problem (2.19) is feasible for any ζ.
Having addressed the issue of feasibility, we proceed with the study of optimality. The mean-variance problem (
2.19) under consideration is a dynamic optimization problem with a constraint
𝔼x(
T) =
ζ. To handle this constraint, we apply the Lagrange multiplier technique. Define
()
Our first goal is to solve the following unconstrained problem parameterized by the Lagrange multiplier
λ:
()
This turns out to be a Markov-modulated stochastic linear-quadratic optimal control problem, which will be solved in the next section.
4. Solution to the Unconstrained Problem
In this section we solve the unconstrained problem (
3.19). Firstly define
()
Consider the following two systems of ODEs:
()
()
The existence and uniqueness of solutions to the above two systems of equations are evident as both are linear with uniformly bounded coefficients.
Proposition 4.1. The solutions of (4.2) and (4.3) must satisfy P(t, i) > 0 and 0 < H(t, i) ≤ 1, ∀t ∈ [0, T], i = 1,2, …, l. Moreover, if for a fixed i, r(t, i) > 0, a.e., t ∈ [0, T], then H(t, i) < 1, ∀t ∈ [0, T).
Proof. The assertion P(t, i) > 0 can be proved in exactly the same way as that of ψ(t, i) > 0; see the proof of Theorem 3.2. Having proved the positivity of P(t, i), one can then show that H(t, i) > 0 using the same argument because now P(t, j)/P(t, i) > 0.
To prove that H(t, i) ≤ 1, first note that the following system of ODEs:
()
has the only solutions
,
i = 1,2, …,
l, due to the uniqueness of solutions. Set
()
which solves the following equations:
()
A variation-of-constant formula leads to
()
A similar trick using the construction of Picard′s sequence yields that
. In addition,
, ∀
t ∈ [0,
T), if
r(
t,
i) > 0, a.e.,
t ∈ [0,
T]. The desired result then follows from the fact that
.
Remark 4.2. Equation (4.2) is a Riccati type equation that arises naturally in studying the stochastic LQ control problem (3.19) whereas (4.3) is used to handle the nonhomogeneous terms involved in (3.19); see the proof of Theorem 4.3. On the other hand, H(t, i) has a financial interpretation: for fixed (t, i), H(t, i) is a deterministic quantity representing the risk-adjusted discount factor at time t when the market mode is i (note that the interest rate itself is random).
Theorem 4.3. Problem (3.19) has an optimal feedback control
()
Moreover, the corresponding optimal value is
()
where
()
with the transition probabilities
given by (
2.4).
Proof. Let π(·) be any admissible control and x(·) the corresponding state trajectory of (2.15). Applying the generalized Itô formula (Lemma 2.5) to
()
we obtain
()
where
π*(
t,
x,
i) is defined as the right-hand side of (
4.8). Integrating the above from 0 to
T and taking expectations, we obtain
()
Consequently,
()
Since
P(
t,
α(
t)) > 0 by Proposition (
4.1), it follows immediately that the optimal feedback control is given by (
4.8) and the optimal value is given by (
4.9), provided that the corresponding equation (
2.15) under the feedback control (
4.8) has a solution. But under (
4.8), the system (
2.15) is a nonhomogeneous linear SDE with coefficients modulated by
α(
t). Since all the coefficients of this linear equation are uniformly bounded and
α(
t) is independent of
W(
t), the existence and uniqueness of the solution to the equation are straightforward based on a standard successive approximation scheme.
Finally, since
()
and
qij ≥ 0 for all
i ≠
j, we must have
θ ≥ 0. This completes the proof.
5. Efficient Frontier
In this section we proceed to derive the efficient frontier for the original mean-variance problem (2.19).
Theorem 5.1 (efficient portfolios and efficient frontier). Assume that (3.9) holds. Then one has
()
Moreover, the efficient portfolio corresponding to
z, as a function of the time
t, the wealth level
x, and the market mode
i, is
()
where
()
Furthermore, the optimal value of Var
x(
T), among all the wealth processes
x(·) satisfying
𝔼x(
T) =
ζ, is
()
Proof. By assumption (3.9) and Theorem 3.2, the mean-variance problem (2.19) is feasible for any ζ ∈ ℝ1. Moreover, using exactly the same approach in the proof of Theorem 4.3, one can show that problem (2.19) without the constraint 𝔼x(T) = ζ must have a finite optimal value, hence so does the problem (2.19). Therefore, (2.19) is finite for any ζ ∈ ℝ1. Now we need to prove that JMV(x0, i0, π(·)) is strictly convex in π(·). We can easily get
()
where
κ ∈ [0,1]. So, we obtain
()
which proves
JMV(
x0,
i0,
π(·)) is strictly convex in
π(·). that Affine space means the complement of points at infinity. It can also be viewed as a vector space whose operations are limited to those linear combinations whose coefficients sum to one. Since
JMV(
x0,
i0,
π(·)) is strictly convex in
π(·) and the constraint function
𝔼x(
T) −
ζ is affine in
π(·), we can apply the well-known duality theorem (see [
26, page 224, Theorem 1]) to conclude that for any
ζ ∈
ℝ1, the optimal value of (
2.19) is
()
By Theorem
4.3, inf
π(·)admissibleJ(
x0,
i0,
π(·),
λ) is a quadratic function (
4.9) in
λ −
ζ. It follows from the finiteness of the supremum value of this quadratic function that
()
Now if
()
then again by Theorem
4.3 and (
5.7) we must have
()
for every
ζ ∈
ℝ1, which is a contradiction. This proves (
5.1). On the other hand, in view of (
5.7), we maximize the quadratic function (
4.9) over
λ −
ζ and conclude that the maximizer is given by (
5.3) whereas the maximum value is given by the right-hand side of (
5.4). Finally, the optimal control (
5.2) is obtained by (
4.8) with
λ =
λ*.The efficient frontier (
5.4) reveals explicitly the tradeoff between the mean (return) and variance (risk) at the terminal. Quite contrary to the case without Markovian jumps [
17], the efficient frontier in the present case is no longer a perfect square (or, equivalently, the efficient frontier in the mean-standard deviation diagram is no more a straight line). As a consequence, one is not able to achieve a risk-free investment. This, certainly, is expected since now the interest rate process is modulated by the Markov chain, and the interest rate risk cannot be perfectly hedged by any portfolio consisting of the bank account and stocks [
27], because the Markov chain is independent of the Brownian motion. Nevertheless, expression (
5.4) does disclose the minimum variance, namely, the minimum possible terminal variance achievable by an admissible portfolio, along with the portfolio that attains this minimum variance.
Theorem 5.2 (minimum variance). The minimum terminal variance is
()
with the corresponding expected terminal wealth
()
and the corresponding Lagrange multiplier
. Moreover, the portfolio that achieves the above minimum variance, as a function of the time
t, the wealth level
x, and the market mode
i, is
()
Proof. The conclusions regarding (5.11) and (5.12) are evident in view of the efficient frontier (5.4). The assertion can be verified via (5.3) and (5.12). Finally, (5.13) follows from (5.2).
Remark 5.3. As a consequence of the above theorem, the parameter s can be restricted to ζ ≥ ζmin when one defines the efficient frontier for the mean-variance problem (2.19).
Theorem 5.4 (mutual fund theorem). Suppose that an efficient portfolio is given by (5.2) corresponding to ζ = ζ1 > ζmin . Then a portfolio π*(·) is efficient if and only if there is a μ ≥ 0 such that
()
where
is the minimum variance portfolio defined in Theorem
5.2.
Proof. We first prove the “if" part. Since both and are efficient, by the explicit expression of any efficient portfolio given by (5.2), must be in the form of (5.2) corresponding to ζ = (1 − μ)ζmin + μζ1 (also noting that x*(·) is linear in π*(·)). Hence π*(t) must be efficient.
Conversely, suppose that π*(·) is efficient corresponding to a certain ζ ≥ ζmin . Write ζ = (1 − μ)ζmin + μζ1 with some μ ≥ 0. Multiplying
()
by (1 −
μ), multiplying
()
by
μ, and summing them up, we obtain that
is represented by (
5.2) with
and
ζ = (1 −
μ)
ζmin +
μζ1. This leads to (
5.14).
Remark 5.5. The above mutual fund theorem implies that any investor needs only to invest in the minimum variance portfolio and another prespecified efficient portfolio in order to achieve the efficiency. Note that in the case where all the market parameters are deterministic [17], the corresponding mutual fund theorem becomes the one-fund theorem, which yields that any efficient portfolio is a combination of the bank account and a given efficient risky portfolio (known as the tangent fund). This is equivalent to the fact that the fractions of wealth among the stocks are the same among all efficient portfolios. However, in the present Markov-modulated case this feature is no longer available.
Since the wealth processes
x(·) are with jumps, it is more complicated when we solve the unconstrained problem (
3.19). Firstly, we aim to derive conditions of feasibility. It is not hard to prove feasibility of the constrained stochastic optimization problem (
2.19), which we get the unconstrained problem (
3.19) from. Then we solve the unconstrained problem (
3.19). If we assume that
()
we have
()
where
()
So, we added one item
in optimal feedback control
π*(
t,
x,
i) (see (
3.19)) to simplify the calculation.
Acknowledgments
The author would like to thank Dr. C. Yuan for his helpful comments and discussions. He would like to thank the referees for the careful reading of the first version of this paper.