Optimal Portfolios in Lévy Markets under State-Dependent Bounded Utility Functions
Abstract
Motivated by the so-called shortfall risk minimization problem, we consider Merton′s portfolio optimization problem in a non-Markovian market driven by a Lévy process, with a bounded state-dependent utility function. Following the usual dual variational approach, we show that the domain of the dual problem enjoys an explicit “parametrization,” built on a multiplicative optional decomposition for nonnegative supermartingales due to Föllmer and Kramkov (1997). As a key step we prove a closure property for integrals with respect to a fixed Poisson random measure, extending a result by Mémin (1980). In the case where either the Lévy measure ν of Z has finite number of atoms or for a process ζ and a deterministic function ϑ, we characterize explicitly the admissible trading strategies and show that the dual solution is a risk-neutral local martingale.
1. Introduction
The task of determining good trading strategies is a fundamental problem in mathematical finance. A typical approach to this problem aims at finding the trading strategy that maximizes, for example, the final expected utility, which is defined as a concave and increasing function U : ℝ → ℝ ∪ {−∞} of the final wealth. There are, however, many applications where a utility function varies with the underlying securities, or even random. For example, if the market is incomplete, it is often more beneficial to allow certain degree of “shortfall” in order to reduce the “super-hedging cost” (see, e.g., [1, 2] for more details). Mathematically, such a shortfall risk is often quantified by the expected loss
The above shortfall minimizing problem can be easily recast as a utility maximization problem with a bounded state-dependent utility of the form
The existence and essential uniqueness of the solution to the problem (1.3) in its various special forms have been studied by many authors see, for example, Cvitanić [4], Föllmer and Leukert [3], Xu [5], and Karatzas and Žitković [6], to mention a few. However, while the convex duality approach in [3] succeeds in dealing with the non-Markovian nature of the model, it does not seem to shed any light on how to compute, in a feasible manner, the optimal trading strategy, partly due to the generality of the model considered there. In this paper we will consider a specific but popular model driven by a Lévy process. Our goal is to narrow down the domain of dual problem so that the convex duality method holds true. Furthermore, we will try to give an explicit construction of the dual domain that contains the dual optimizer. Although at this point our results are still rather general, and at a theoretical level, we believe that this is a necessary step towards a feasible computational implementation of the convex duality method.
While the utility maximization problem of this kind can be traced back to Merton [7, 8], in this paper we shall follow the convex duality method, suggested by Karatzas et al. [9], and later extended by Kunita [10] to general Lévy market models. However, we note that in [9] the utility function was required to be unbounded, strictly increasing and concave, continuously differentiable, and other technical assumptions including the so-called Inada conditions. On the other hand, since one of the key tools in [10] is an exponential representation for positive local supermartingales (see, e.g., [11, Lemma 4.2]), it is required that the utility function satisfies the same conditions as in [9] (in particular, unboundedness), plus that the dual domain Γ contains all positive “risk-neutral” local supermartingales. The boundedness and potential nondifferentiability of the utility function in our case thus cause some technical subtleties. For example, the dual optimal process can be 0 with positive probability, thus the representation theorem of Kunita [11, Lemma 4.2] does not apply anymore.
A key element that we use to overcome these technical difficulties is an exponential representation theorem for nonnegative supermartingales by Föllmer and Kramkov [12]. This result leads to an explicit construction of the dual domain, based on those nonnegative supermartingales that can be written as stochastic exponentials ξ = ξ0ℰ(X − A), with A being an increasing process and X belonging to a class of semimartingales 𝒮 that is closed under Émery′s topology. To validate this approach we prove a closure property for integrals with respect to a fixed compensated Poisson random measures, a result of interest on its own, which extends the analog property for integrals with respect to a fixed semimartingale due to Mémin [13]. Finally, unlike some previous works on the subject (see, e.g., Föllmer and Leukert [3] and Xu [5]), we do not use the so-called bipolar theorem of Kramkov and Schachermayer [14] to argue the attainability of the optimal final wealth. Instead, we shall rely on the fundamental characterization of contingent claims that are super replicable [1, 2], reducing the problem of finding the optimal primal solution to a super-eplication problem.
We believe that the dual problem proposed in this paper offers several advantages. For example, since the dual class enjoys a fairly “explicit” description and “parametrization,” our results could be considered as a first step towards a feasible computational implementation of the covex duality method. Furthermore, the specific results we obtained for the Lévy market can be used to characterize the elements of the dual domain and the admissible trading strategies. In particular, if either (i) the jumps of the price process S are driven by the superposition of finitely many shot-noise Poisson processes, or (ii) for a process ζ and a deterministic function ϑ, we show that the dual solution is a risk-neutral local martingale.
We would like to remark that some of our results are related to those in Xu [5], but there are essential differences. For example, the model in [5] exhibits only finite-jump activity and allows only downward price jumps (in fact, this assumption seems to be important for the approach there), while our model allows for general jump dynamics, and our approach is also valid for general additive processes, including the time-inhomogeneous cases considered in [5] (see (ii) of Section 6).
The rest of the paper is organized as follows. In Section 2 we introduce the financial model, along with some basic terminology that will be used throughout the paper. The convex duality method is revised in Section 3, where a potential optimal final wealth is constructed. An explicit description of a dual class and a characterizations of the dual optimum and admissible trading strategies are presented in Section 4. In Section 5 we show that the potential optimal final wealth is attained by an admissible trading strategy, completing the proof of the existence of optimal portfolio. In Section 6 we give some concluding remarks. Some necessary fundamental theoretical results, such as the exponential representation for nonnegative supermartingales of Föllmer and Kramkov [12] and the closure property for integrals with respect to Poisson random measures, are collected in Appendix A. Finally, Appendix B briefly outlines the proofs of the convex duality results used in the paper.
2. Notation and Problem Formulation
Throughout this paper we assume that all the randomness comes from a complete probability space (Ω, ℱ, ℙ), on which there is defined a Lévy process Z with Lévy triplet (σ2, ν, 0) (see Sato [15] for the terminology). By the Lévy-Itô decomposition, there exist a standard Brownian motion W and an independent Poisson random measure N on ℝ+ × ℝ∖{0} with mean measure 𝔼N(dt, dz) = ν(dz)dt, such that
2.1. The Market Model
We assume that there are two assets in the market: a risk free bond (or money market account), and a risky asset, say, a stock. The case of multiple stocks, such as the one studied in [10], can be treated in a similar way without substantial difficulties (see Section 6 for more details). As it is customary all the processes are taken to be discounted to the present value so that the value Bt of the risk-free asset can be assumed to be identically equal to 1. The (discounted) price of the stock follows the stochastic differential equation
2.2. Admissible Trading Strategies and the Utility Maximization Problem
A trading strategy is determined by a predictable locally bounded process β representing the proportion of total wealth invested in the stock. Then, the resulting wealth process is governed by the stochastic differential equation
Definition 2.1. The process Vw,β : = V solving (2.4) is called the value process corresponding to the self-financing portfolio with initial endowment w and trading strategy β. We say that a value process Vw,β is “admissible” or that the process β is “admissible” for w if
For a given initial endowment w, we denote the set all admissible strategies for w by , and the set of all admissible value processes by . In light of the Doléans-Dade stochastic exponential of semimartingales (see, e.g., [17, Section I.4f]), one can easily obtain necessary and sufficient conditions for admissibility.
Proposition 2.2. A predictable locally bounded process β is admissible if and only if
To define our utility maximization problem, we begin by introducing the bounded state-dependent utility function.
Definition 2.3. A random function U : ℝ+ × Ω ↦ ℝ+ is called a “bounded and state-dependent utility function” if
- (1)
U(·, ω) is nonnegative, nondecreasing, and continuous on [0, ∞);
- (2)
for each fixed w, the mapping ω ↦ U(w, ω) is ℱT-measurable;
- (3)
there is an ℱT-measurable, positive random variable H such that for all ω ∈ Ω, U(·, ω) is a strictly concave differentiable function on (0, H(ω)), and it holds that
Notice that the ℱT-measurability of the random variable ω → U(VT(ω), ω) is automatic because U(w, ω) is ℬ([0, ∞)) × ℱT-measurable in light of the above conditions (1) and (2). We remark that while assumption (2.7) is merely technical, assumption (2.6) is motivated by the shortfall risk measure (1.2). Our utility optimization problem is thus defined as
Our main objectives in the rest of the paper are the following: (1) Define the dual problem and identify the relation between the value functions of the primal and the dual problems; (2) By suitably defining the dual domain, prove the attainability of the associated dual problem; (3) Show that the potential optimum final wealth induced by the minimizer of the dual problem can be realized by an admissible portfolio. We shall carry out these tasks in the remaining sections.
3. The Convex Duality Method and the Dual Problems
In this section we introduce the dual problems corresponding to the primal problem (1.3) and revise some standard results of convex duality that are needed in the sequel. Throughout, represents the convex dual function of U(·; ω), defined as
Remark 3.1. We point out that the random fields defined in (3.1) and (3.2) are ℬ([0, ∞)) × ℱT-measurable. For instance, in the case of , we can write
Next, we introduce the traditional dual class (cf. [14]).
Definition 3.2. Let be the class of nonnegative supermartingales ξ such that
- (i)
0 ≤ ξ(0) ≤ 1,
- (ii)
for each locally bounded admissible trading strategy β, is a supermartingale.
To motivate the construction of the dual problems below we note that if and V is the value process of a self-financing admissible portfolio with initial endowment V0 ≤ z, then 𝔼[ξ(T)(VT⋀H)] ≤ z, and it follows that
Definition 3.3. Given a subclass , the minimization problem
Notice that, by (3.6) and (3.7), we have the following weak duality relation between the primal and dual value functions:
Proposition 3.4. The dual value function vΓ corresponding to a subclass Γ of satisfies the following properties
- (1)
vΓ is nonincreasing on (0, ∞) and 𝔼[U(0; ·)] ≤ vΓ(y) ≤ 𝔼[U(H; ·)].
- (2)
If
()then vΓ is uniformly continuous on (0, ∞), and() - (3)
There exists a process such that
- (4)
If Γ is a convex set, then (i)vΓ is convex, and with (ii) there exists a attaining the minimum vΓ(y). Furthermore, the optimum ξ* can be “approximated” by elements of Γ in the sense that there exists a sequence {ξn} n ⊂ Γ for which ξn(T) → ξ*(T), a.s.
We now give a result that is crucial for proving the strong duality in (3.8). The result follows from arguments quite similar to those in [9, Theorem 9.3]. For the sake of completeness, we outline the proof in Appendix B.
Theorem 3.5. Suppose that (3.9) is satisfied and Γ is convex, then, for any z ∈ (0, wΓ), there exist y(z) > 0 and such that
- (i)
;
- (ii)
, where
() - (iii)
.
We note that Theorem 3.5 provides essentially an upper bound for the optimal final utility of the form , for certain “reduced” contingent claim . By suitably choosing the dual class Γ, we shall prove in the next two sections that this reduced contingent claim is (super-) replicable with an initial endowment z.
4. Characterization of the Optimal Dual
We now give a full description of a dual class Γ for which strong duality, that is, u(z) = vΓ(y) + zy, holds. Denote 𝒱+ to be the class of all real-valued càdlàg, nondecreasing, adapted processes A null at zero. We will call such a process “increasing.” In what follows we let ℰ(X) be the Doléans-Dade stochastic exponential of the semimartingale X (see, e.g., [17] for their properties). Let
Theorem 4.1. The class Γ is convex, and if (3.9) is satisfied, the dual optimum of Theorem 3.5 belongs to Γ, for any 0 < z < wΓ.
Proof. Let us check that 𝒮 meets with the conditions in Theorem A.1. Indeed, each X in 𝒮 is locally bounded from below since, defining τn : = inf {t ≥ 0 : Xt < −n},
In the rest of this section, we present some properties of the elements in Γ and of the dual optimum ξ* ∈ Γ. In particular, conditions on the “parameters” (G, F, A) so that ξ ∈ Γ(𝒮) is in are established. First, we note that without loss of generality, A can be assumed predictable.
Lemma 4.2. Let
Proof. Let . Since F ∈ Gloc (N), there are stopping times such that
The following result gives necessary conditions for a process ξ ∈ Γ(𝒮) to belong to . Recall that a predictable increasing process A can be uniquely decomposed as the sum of three predictable increasing processes,
Proposition 4.3. Let ξ : = ξ0ℰ(X − A) ≥ 0, where ξ0 > 0,
- (i)
There exist stopping times τn↗τ such that
() - (ii)
For ℙ-a.e. ω ∈ Ω,
()
Proof. Recall that ξ and S satisfy the SDE’s
We now turn to the sufficiency of conditions (i) and (ii). Since is locally bounded,
The following result gives sufficient and necessary conditions for ξ ∈ Γ(𝒮) to belong to . Its proof is similar to that of Proposition 4.3.
Proposition 4.4. Under the setting and notation of Proposition 4.3, ξ ∈ Γ(𝒮) belongs to if and only if condition (i) in Proposition 4.3 holds and, for any locally bounded admissible trading strategies β,
The previous result can actually be made more explicit under additional information on the structure of the jumps exhibited by the stock price process. We consider two cases: when the jumps come from the superposition of shot-noise Poisson processes, and when the random field v exhibit a multiplicative structure. Let us first extend Proposition 2.2 in these two cases.
Proposition 4.5. (i) Suppose that ν is atomic with finitely many atoms then, a predictable locally bounded strategy β is admissible if and only if ℙ × dt-a.e.
(ii) Suppose that v(t, z) = ζtϑ(z), for a predictable locally bounded process ζ such that ℙ × dt-a.e. ζt(ω) ≠ 0 and is locally bounded, and a deterministic function ϑ such that ν({z : ϑ(z) = 0}) = 0, then, a predictable locally bounded strategy β is admissible if and only if ℙ × dt-a.e.
Proof. From Proposition 2.2, recall that ℙ-a.s.
Example 4.6. It is worth pointing out some consequences
- (a)
In the time homogeneous case, where v(t, z) = z, the extreme points of the support of ν (or what accounts to the same, the infimum and supremum of all possible jump sizes) determine completely the admissible strategies. For instance, if the Lévy process can exhibit arbitrarily large or arbitrarily close to −1 jump sizes, then
()a constraint that can be interpreted as absence of shortselling and bank borrowing (this fact was already pointed out by Hurd [19]). - (b)
In the case that , the admissibility condition takes the form If in addition ζ· < 0 (such that the stock prices exhibit only downward sudden movements), then and β· ≡ −c, with c > 0 arbitrary, is admissible. In particular, from Proposition 4.4, if ξ ∈ Γ(𝒮) belongs to , then a.s. htβt ≤ at, for a.e. t ≤ τ. This means that if and only if condition (i) in Proposition 4.3 holds and ℙ−a.s. ht ≥ 0, for a.e. t ≤ τ. For a general ζ and still assuming that , it follows that β is admissible and satisfy that ℙ-a.s.
()for a.e. t ≥ 0.
We now extend Proposition 4.4 in the two cases introduced in Proposition 4.5. Its proof follows from Propositions 4.4 and 4.5.
Proposition 4.7. Suppose that either (i) or (ii) in Proposition 4.5 is satisfied, in which case, define
We remark that the cases and do not lead to any absurd in the definition of above as we are using the convention that 0 · ∞ = 0. Indeed, for instance, if , it was seeing that , for a.e. t ≤ τ, and thus, we set the second term in the definition of to be zero.
Now we can give a more explicit characterization of the dual solution ξ* = ℰ(X* − A*) to problem (3.7), whose existence was established in Theorem 4.1. For instance, we will see that A* is absolutely continuous up to a predictable stopping time. Below, we refer to Proposition 4.3 for the notation.
Proposition 4.8. Let ξ : = ξ0ℰ(X − A) ∈ Γ(𝒮), τA : = inf {t : ΔAt = 1}, and . The followings two statements hold true.
- (1)
. Furthermore, if and only if .
- (2)
Suppose that either of the two conditions in Proposition 4.7 are satisfied and denote
()where is defined accordingly to the assumed case. Then, , and furthermore, the process belongs to if .
Proof. Let Ac, As, Ad denote the increasing predictable processes in decomposition (4.10) of A. Since A is predictable, there is no common jump times between X and A. Then,
We remark that part (2) in Proposition 4.8 remains true if we take . The following result is similar to Proposition 3.4 in Xu [5] and implies, in particular, that the optimum dual ξ* can be taken to be a local martingale.
Proposition 4.9. Suppose that either (i) or (ii) of Proposition 4.5 is satisfied. Moreover, in the case of condition (ii), assume additionally that
Proof. Let us prove the case when condition (i) in Proposition 4.5 is in force. In light of Proposition 4.8, we assume without loss of generality that with Assume that min iv(t, zi) < 0 < max iv(t, zi). Otherwise if, for instance, max iv(t, zi) ≤ 0, then it can be shown that ht ≥ 0, a.s. (similarly to case (b) in Example 4.6), and the first term of is 0 under our convention that ∞ · 0 = 0. Notice that, in any case, one can find a predictable process z taking values on , such that
5. Replicability of the Upper Bound
We now show that the tentative optimum final wealth , suggested by the inequality (iii) in Theorem 3.5, is (super-) replicable. We will combine the dual optimality of ξ* with the super-hedging theorem, which states that given a contingent claim satisfying , one can find for any fixed an admissible trading strategy β* (depending on z) such that almost surely (see Kramkov [2], and also Delbaen and Schachermayer [1]). Recall that ℳ denotes the class of all equivalent risk neutral probability measures.
Proposition 5.1. Under the setting and conditions of Theorem 3.5, for any 0 < z < wΓ, there is an admissible trading strategy β* for z such that
Proof. For simplicity, we write , y = y(z), and
6. Concluding Remarks
We conclude the paper with the following remarks.
(i) The dual class Γ The dual domain of the dual problem can be taken to be the more familiar class of equivalent risk-neutral probability measures ℳ. To be more precise, define
In particular we conclude that condition (6.2) is sufficient for both the existence of the solution to the primal problem and its characterization in terms of the dual solution of the dual problem induced by . We now further know that ξ* belongs to the class defined in (4.3), and hence, enjoys an explicit parametrization of the form
(ii) Market driven by general additive models Our analysis can be extended to more general multidimensional models driven by additive processes (i.e., processes with independent, possibly nonstationary increments; cf. Sato [15] and Kallenberg [20]). For instance, let (Ω, ℱ, ℙ) be a complete probability space on which is defined a ddimensional additive process Z with Lévy-Itô decomposition:
(iii) Optimal wealth-consumption problem Another classical portfolio optimization in the literature is that of optimal wealth-consumption strategies under a budget constraint. Namely, we allow the agent to spend money outside the market, while maintaining “solvency” throughout [0, T]. In that case the agent aims to maximize the cost functional that contains a “running cost”:
Acknowledgments
The first author is partially supported by the NSF Grant no. DMS 0906919. This author would like to thank the Department of Statistics and Applied Probability at the University of California at Santa Barbara for its hospitality and support while this paper was in preparation. The second author is supported in part by the NSF Grant no. DMS 0806017.
Appendices
A. Convex Classes of Exponential Supermartingales
The goal of this part is to establish the theoretical foundations behind Theorem 4.1. We begin by recalling an important optional decomposition theorem due to Föllmer and Kramkov [12]. Given a family of supermartingales 𝒮 satisfying suitable conditions, the result characterizes the nonnegative exponential local supermartingales ξ : = ξ0ℰ(X − A), where X ∈ 𝒮 and A ∈ 𝒱+, in terms of the so-called upper variation process for 𝒮. Concretely, let 𝒫(𝒮) be the class of probability measures ℚ ~ ℙ for which there is an increasing predictable process (depending on ℚ and 𝒮) such that {Xt − At} t≥0 is a local supermartingale under ℚ, for all X ∈ 𝒮. The smallest of such processes A is denoted by A𝒮(ℚ) and is called the upper variation process for 𝒮 corresponding to ℚ. For easy reference, we state Föllmer and Kramkov’s result (see [12] for a proof).
Theorem A.1. Let 𝒮 be a family of semimartingales that are null at zero, and that are locally bounded from below. Assume that 0 ∈ 𝒮, and that the following conditions hold:
- (i)
𝒮 is predictably convex,
- (ii)
𝒮 is closed under the Émery distance,
- (iii)
𝒫(𝒮) ≠ ∅,
- (1)
ξ is of the form ξ = ξ0ℰ(X − A), for some X ∈ 𝒮 and an increasing process A ∈ 𝒱+;
- (2)
ξ/ℰ(A𝒮(ℚ)) is a supermartingale under ℚ for each ℚ ∈ 𝒫(𝒮).
The next result is a direct consequence of the previous representation. Recall that a sequence of processes {ξn} n≥1 is said to be “Fatou convergent on π” to a process ξ if {ξn} n≥1 is uniformly bounded from below and it holds that
Proposition A.2. If 𝒮 is a class of semimartingales satisfying the conditions in Theorem A.1, then
Proof. The convexity of Γ(𝒮) is a direct consequence of Theorem A.1, since the convex combination of supermartingales remains a supermartingale. Let us prove the closure property. Fix a ℚ ∈ 𝒫(𝒮) and denote Ct : = ℰ(A𝒮(ℚ)). Notice that Ct > 0 because A𝒮(ℚ) t is increasing and hence, its jumps are nonnegative. Since ξn ∈ Γ(𝒮), is a supermartingale under ℚ. Then, for 0 < s′ < t′,
The most technical condition in Theorem A.1 is the closure property under Émery distance. The following result is useful to deal with this condition. It shows that the class of integrals with respect to a Poisson random measure is closed with respect to Émery distance, thus extending the analog property for integrals with respect to a fixed semimartingale due to Mémin [13].
Theorem A.3. Let Θ be a closed convex subset of ℝ2 containing the origin. Let Π be the set of all predictable processes (F, G), F ∈ Gloc (N), and , such that F(t, ·) = G(t) = 0, for all t ≥ T, and (F(ω, t, z), G(ω, t)) ∈ Θ, for ℙ × dt × ν(dz)-a.e. (ω, t, z) ∈ Ω × ℝ+ × ℝ0. Then, the class
Proof. Consider a sequence of semimartingales
For some ℚ ~ ℙ, we denote ℳ2(ℚ) to be the Banach space of all ℚ-square integrable martingales on [0, T], endowed with the norm , and 𝒜(ℚ) to be the Banach space of all predictable processes on [0, T] that have ℚ-integrable total variations, endowed with the norm ∥A∥𝒜(ℚ) : = 𝔼ℚVar (A). Below, stands for the localized class of increasing process in 𝒜(ℚ). By [13, Theorem II.3], one can extract a subsequence from {Xn}, still denote it by {Xn}, for which one can construct a probability measure ℚ, defined on ℱT and equivalent to ℙT (the restriction of ℙ on ℱT), such that the following assertions hold:
- (i)
ξ : = dℚ/(dℙT) is bounded by a constant;
- (ii)
, t ≤ T, for Cauchy sequences {Mn} n≥1 and {An} n≥1 in ℳ2(ℚ) and 𝒜(ℚ), respectively.
Let us extend Mn and An to [0, ∞) by setting and for all t ≥ 0. Also, we extend ℚ for A ∈ ℱ by setting ℚ(A): = ∫Aξdℙ, so that ℚ ~ ℙ (on ℱ). In that case, it can be proved that This follows essentially from [17, Proposition III.3.5] and Doob′s Theorem. Now, let denote the density process. Since ξ is bounded, both {ξt} t and {|Δξt | } t are bounded. By [17, Lemma III.3.14 and Theorem III.3.11], the ℙ-quadratic covariation [Xn, ξ] has ℙ-locally integrable variation and the unique canonical decomposition Mn + An of Xn relative to ℚ is given by
B. Proofs of Some Standard Convex Duality Results
This appendix sketches the proofs of the results in Section 3. The proofs are standard in convex duality and are given only for the sake of completeness.
Proof of Proposition 3.4. For simplicity, we write v(y) = vΓ(y). The monotonicity and range of values of v are straightforward. To prove (2), notice that since is convex, nonincreasing, and , we have
Part (i) of (4) is well known. Let us turn out to prove (3) and part (ii) in (4). Let be such that
Proof of Theorem 3.5. We follow the arguments in [9, Theorem 9.3]. For simplicity let us write v(y) instead of vΓ(y). Recall that wΓ : = sup ξ∈Γ𝔼[ξ(T)H] and define v(0): = 𝔼[U(H; ω)]. In light of Proposition 3.4, the continuous function fz(y): = v(y) + zy satisfies