Stability and Probability 1 Convergence for Queueing Networks via Lyapunov Optimization
Abstract
Lyapunov drift is a powerful tool for optimizing stochastic queueing networks subject to stability. However, the most convenient drift conditions often provide results in terms of a time average expectation, rather than a pure time average. This paper provides an extended drift-plus-penalty result that ensures stability with desired time averages with probability 1. The analysis uses the law of large numbers for martingale differences. This is applied to quadratic and subquadratic Lyapunov methods for minimizing the time average of a network penalty function subject to stability and to additional time average constraints. Similar to known results for time average expectations, this paper shows that pure time average penalties can be pushed arbitrarily close to optimality, with a corresponding tradeoff in average queue size. Further, in the special case of quadratic Lyapunov functions, the basic drift condition is shown to imply all major forms of queue stability.
1. Introduction
This paper considers Lyapunov methods for analysis and control of queueing networks. Landmark work in [1, 2] uses the drift of a quadratic Lyapunov function to design general max-weight algorithms for network stability. Work in [3, 4] extends this result using a drift-plus-penalty algorithm that leads to joint stability and penalty optimization. The results in [1, 2] are developed for systems that evolve on ergodic Markov chains with countably infinite-state space. The results in [3, 4] apply to more general systems but are stated in terms of time average expectations, rather than pure time averages. Time average expectations can often be translated into pure time averages when the system has a Markov structure and some additional assumptions are satisfied, such as when the system regularly returns to a renewal state. However, these additional assumptions often fail and can be difficult to verify for particular systems of interest. This paper seeks to provide simple conditions that ensure desirable queue sample paths and desirable time average penalties with probability 1.
First, a basic drift condition for a general class of Lyapunov functions is shown to imply two weak forms of queue stability, called rate stability and mean rate stability. Next, this paper focuses on quadratic and subquadratic Lyapunov functions. Using the law of large numbers for martingale differences, a simple drift-plus-penalty condition is shown to ensure queue stability together with desirable time averages of an associated network penalty function. In the special case of quadratic Lyapunov functions, it is shown that the basic drift condition implies all major forms of queue stability, including rate and mean rate stability, as well as four stronger versions. These results require only mild bounded fourth moment conditions on queue difference processes and bounded second-moment conditions on penalty processes and do not require a Markov structure or renewal assumptions. Two simple examples are given in Appendix A to show what can go wrong if these bounded moment conditions are violated.
Finally, these results are used to design and analyze a class of algorithms for minimizing the time average of a network penalty subject to queue stability constraints and additional time average constraints. It is shown that the time average penalty can be pushed arbitrarily close to optimality, with a corresponding tradeoff in queue size. In the special case of quadratic functions, the tradeoff results of [3, 4] are recovered in a stronger form. They are shown to hold for time averages with probability 1 (rather than time average expectations). The results for quadratic functions in this paper assume the problem is feasible and that the constraints can be achieved with “ϵ-slackness” (similar to a Slater-type condition of static optimization theory [5]). However, this paper also obtains results for subquadratics. Algorithms based on subquadratics are shown to provide desired results whenever the problem is feasible, without requiring the additional slackness assumption. This analysis is useful for control of queueing networks, and for other stochastic problems that seek to optimize time averages subject to time average constraints.
1.1. On Relationships between Time Average Expectations and Time Averages
It is known by Fatou′s Lemma that if a random process is deterministically lower bounded (such as being nonnegative) and has time averages that converge to a constant with probability 1, then this constant must be less than or equal to the liminf time average expectation (see, e.g., [6]). This can be used to translate the existing bounds in [4], which hold for time average expectations, into sample path bounds that hold with probability 1. However, this translation is only justified in the special case when the processes are suitably lower-bounded and have convergent time averages. Time average convergence can often be shown when the system is defined on a Markov chain with suitable renewal properties [7, 8]. Further, queue stability is often defined in terms of positive Harris recurrence [9, 10] although systems with positive Harris recurrence do not always have finite average queue size. For example, consider a standard M/G/1 queue with service times that have finite means 𝔼[X] but infinite second moments, and with arrival rate λ < 1/𝔼[X]. The system spends a positive fraction of time in the 0 state but has infinite time average queue size.
Networks with flow control often have a structure that yields deterministically bounded queues [11, 12]. While this is perhaps the strongest form of stability, it requires special structure. Primal-dual algorithms are considered for scheduling in wireless systems with “infinite backlog” in [13, 14] and shown to converge to a utility-optimal operating point although this work does not consider queueing or time average constraints. Related primal-dual algorithms are treated in [15, 16] for systems with queues, where [15] shows utility optimality for a fluid version of the system, and [16] proves that the actual network exhibits rate stability with near-optimal utility. Related analysis is considered in [17] for systems with countably infinite-state ergodic Markov chains, and utility-optimal scheduling is treated for deterministic systems in [18]. The approaches in [13–18] do not obtain the desired tradeoffs between network utility and queue size.
1.2. Prior Work on Quadratics and Other Lyapunov Functions
Much prior work on queue stability uses quadratic Lyapunov functions, including [1, 2, 19–21] for ergodic systems defined on countably infinite Markov chains, and [22] for more general systems but where stability is defined in terms of a time average expectation. Stability with exponential Lyapunov functions is treated in [23, 24], and stability with more general Lyapunov functions is in [25]. The works [26–29] develop algorithms based on subquadratic Lyapunov functions and suggest that these often have better delay in comparison to algorithms derived from quadratic Lyapunov functions. This motivates recent work in [30] that has combined subquadratic Lyapunov functions with joint stability and penalty minimization, using time average expectations similar to [4].
2. Lyapunov Functions and Drift
Let Q(t) = (Q1(t), …, QK(t)) be a real-valued random vector that evolves over time slots t ∈ {0,1, 2, …}. The distribution of Q(0) is given, and the values of Q(t) for t ∈ {1,2, 3, …} are determined by events on a probability space. Specifically, assume that outcomes on the probability space have the structure {η(−1), η(0), η(1), η(2), …}, where η(−1) = Q(0) is the initial vector state, and η(t) for t ∈ {0,1, 2, …} represents an event that occurs on slot t. For each slot t ∈ {0,1, 2, …}, the value of Q(t) is assumed to be determined by the history {Q(0), η(0), η(1), …, η(t − 1)}. Let ℋ(t) represent this history up to (but not including) slot t. Formally, ℋ(t) is a filtration of the infinite horizon probability space onto the finite set of slots {−1,0, 1, …, t − 1}. For any slot t ∈ {0,1, 2, …}, a given ℋ(t) includes information that specifies the value of Q(t).
2.1. Lyapunov Functions
2.2. Performance for Time Average Expectations
The above arithmetic did not require Markov chain assumptions, renewal assumptions, or assumptions on the higher-order moments of the processes. We desire to obtain performance bounds similar to (2.7) and (2.8), but with the above limiting time average expectations replaced, with probability 1, by limiting time averages over a sample path. This cannot be done without imposing more structural assumptions on the problem (see Appendix A for simple counterexamples). The goal is to find simple and easily verifiable additional assumptions that preserve the elegance of the method while strengthening the results.
3. Rate Stability and Mean Rate Stability
Proposition 3.1. Suppose that L(Q) is nonnegative and satisfies (3.2), and that there is a constant B ≥ 0 such that 𝔼[Δ(t)] ≤ B for all t ∈ {0,1, 2, …}. Suppose that 𝔼[L(Q(0))] < ∞. Then we have the following.
(a) For all queues k ∈ {1, …, K}, we have
(b) If additionally assumption (3.1) holds for some constants D > 0, θ > 0, then for all queues k ∈ {1, …, K} we have
The condition (3.3) is called mean rate stability, and the condition (3.4) is called rate stability. Both are weak forms of stability. Parts (a) and (b) of the proposition are proven separately.
Proof (Proposition 3.1 Part (a)). By definition of Δ(τ), the statement 𝔼[Δ(τ)] ≤ B is equivalent to
The proof of part (b) follows by noting that the condition (3.10), together with condition (3.1), satisfies the requirements of the following lemma concerning queues with higher-order moments that grow at most linearly.
Lemma 3.2. Let Q(t) be a real-valued stochastic process that evolves over slots t ∈ {0,1, 2, …}, and suppose there are constants b > 0, θ > 0, C > 0, D > 0 such that
Proof. See Appendix B.
4. Convergence of Time Averages
This section considers drift conditions of the type (2.3).
4.1. The Law of Large Numbers for Martingale Differences
Let X(t) be a real-valued stochastic process defined over slots t ∈ {0,1, 2, …}. For integers t > 0, let ℋ(t) be a filtration that includes the information {X(0), …, X(t − 1)}. The following theorem is from [35] and concerns processes that satisfy 𝔼[X(t)∣ℋ(t)] = 0. Such processes are called martingale differences.
Theorem 4.1 (from [35]). Suppose that X(t) is a real-valued random process over t ∈ {0,1, 2, …} such that 𝔼[X(t)∣ℋ(t)] = 0 for all t ∈ {1,2, …} and all ℋ(t). Further suppose that the second moments 𝔼[X(t) 2] are finite for all t and satisfy
Corollary 4.2. Let X(t) be a real-valued random process over slots t ∈ {0,1, 2, …}, and suppose there is a finite constant B such that 𝔼[X(t)∣ℋ(t)] ≤ B for all t ∈ {1,2, …} and all ℋ(t). Further suppose the second moments 𝔼[X(t) 2] are finite for all t and satisfy
Proof. Define . Then . Further, using (4.3), it is not difficult to verify that is finite (note that , and the latter term can be bounded by 2𝔼[X(t) 2] via Jensen′s inequality). Thus, the conditions required for Theorem 4.1 hold, so we conclude that
4.2. A Drift-Plus-Penalty Result for General Lyapunov Functions
Proposition 4.3. Suppose L(Q(0)) is finite with probability 1, that 𝔼[Δ(t) 2], 𝔼[f(Q(t)) 2], 𝔼[p(t) 2] are finite for all t, that (4.7) and (4.8) hold, and that
Proof. Define X(t) = Δ(t) + Vp(t) + ϵf(Q(t)), and note by algebra that
The condition (4.10) together with 𝔼[p(t)∣ℋ(t)] ≥ pmin also implies that
The result of Proposition 4.3 is almost the result we desire, with the exception that the condition (4.9) may be difficult to verify. Note that (4.9) often trivially holds in special cases when all queues are deterministically bounded. Hence, in the flow control algorithms [11, 12] designed from drift-plus-penalty theory, Proposition 4.3 proves that time average penalties satisfy (4.11) with probability 1. The next section shows that this condition follows from the drift assumption when L(Q) is a subquadratic Lyapunov function. Section 6 treats quadratic Lyapunov functions.
5. Subquadratic Lyapunov Functions
5.1. The Drift Structure
5.2. Drift-Plus-Penalty for Subquadratics
Proposition 5.1. Let L(Q) be a subquadratic Lyapunov function of the form (5.1), with 1 < β < 2. Assume that 𝔼[L(Q(0))] < ∞, (5.2) holds for the queue difference process δk(t), and (4.7)-(4.8) hold for the penalty process p(t). Suppose that the drift-plus-penalty condition (5.5) holds for some constants p*, V ≥ 0, B ≥ 0, ϵ ≥ 0. Then for all k ∈ {1, …, K}
Proof. The drift-plus-penalty expression (5.5) implies that
Define . If we can show that
Recall from (5.9) that 𝔼[Δ(τ)] ≤ F for all τ, and so
Note that the final line of the proof requires O(1/t2/β) to be summable, which is true for 1 < β < 2 but not true for β = 2. This is one reason why quadratic Lyapunov functions must be considered separately. Another distinction is that the above proposition holds even for ϵ = 0, whereas we require ϵ > 0 for the quadratic analysis in the next section.
6. Quadratic Lyapunov Functions
Proposition 6.1. Suppose that L(Q) has the quadratic form (6.1), and 𝔼[|Qk(0)|3] < ∞ for all k ∈ {1, …, K}. Assume that the bounded moment conditions (6.2), (4.7), and (4.8) hold. If the drift-plus-penalty expression (6.3) holds for some constants p*, B ≥ 0, V ≥ 0, ϵ > 0, wk > 0 for k ∈ {1, …, K}, then
The proposition is proven with the assistance of the following lemma.
Lemma 6.2. Suppose that the same assumptions of Proposition 6.1 hold, and that the following additional assumption holds. For all k ∈ {1, …, K}, we have
Proof (see Lemma 6.2.)Using , by Proposition 4.3, it suffices to show that
Proposition 6.3. Suppose that L(Q) has the quadratic form (6.1), and that 𝔼[|Qk(0)|3] < ∞ for all k ∈ {1, …, K}. Suppose that (6.2) holds, and that there are constants F ≥ 0, ϵ > 0, and wk > 0 for k ∈ {1, …, K} such that for all t and all ℋ(t)
(a) For all k ∈ {1, …, K} we have
(b) For all k ∈ {1, …, K} we have
Inequalities (6.15)–(6.20) represent six major forms of queue stability. Thus, all of these forms of stability are satisfied whenever the bounded second and fourth moment conditions (6.2) hold, and when the drift condition (6.13) holds for a quadratic Lyapunov function.
Proof. Part (a) is proven in Appendix C. To prove part (b), note that second moments are bounded because (6.2) holds. Further, the quadratic Lyapunov function (6.1) satisfies the condition (3.2), and the expression (6.13) implies that
7. Stochastic Network Optimization
Here we use Propositions 5.1 and 6.1 to analyze an important class of stochastic queueing networks. This is the same scenario treated in [4, 36]. However, while the work in [4] obtains bounds on the time average expectations, here we obtain bounds on the pure time averages. Further, the works [4, 36] treat only quadratic Lyapunov functions. Here we treat both quadratics and subquadratics.
Typical penalties can represent power expenditures [11]. For example, suppose that , where pm(t) is the power incurred in component m of the network on slot t, and is a required time average power expenditure. Then ensuring ensures that , so that the desired time average power constraint is met.
7.1. Computing the Drift-Plus-Penalty Inequality
7.2. The Dynamic Algorithm
7.3. ω-Only Policies and the Slackness Condition
7.4. Performance Bounds for Subquadratics
7.5. Performance Bounds for Quadratics
8. Conclusions
This work derives sample path stability and time average penalty results for stochastic networks. It uses the law of large numbers for martingale differences to strengthen existing performance guarantees from time average expectations to pure time averages, with probability 1. This requires only mild-bounded second and fourth moment assumptions, and these assumptions were shown to be necessary via two simple counterexamples in Appendix A. The analysis considers both quadratic and subquadratic Lyapunov functions and uses both types of functions to develop and analyze algorithms for minimizing a time average penalty subject to network stability constraints and additional time average constraints. These results are useful for control of queueing networks, and for other stochastic problems of optimizing time averages subject to time average constraints.
Acknowledgment
This material is supported in part by one or more of the following: the NSF Career Grant CCF-0747525, NSF grants 0964479 and 1049541, the Network Science Collaborative Technology Alliance sponsored by the U.S. Army Research Laboratory W911NF-09-2-0053.
Appendices
A. Counterexamples
This appendix provides two examples of systems that satisfy the drift-plus-penalty condition (2.3), so that the desired time average expectations (2.7)-(2.8) are satisfied. However, the corresponding sample path time averages do not hold because one of the bounded moment assumptions (6.2), (4.7), and (4.8) is violated.
A.1. The Penalty Process p(t)
A.2. The Queue Process Q(t)
B. Proof of Lemma 3.2
Lemma B.1. If (B.2) holds for b > 1, then lim t→∞ Q(t)/t = 0 with probability 1.
Proof (Fix ϵ > 0). From the Markov inequality, we have for all t ∈ {1,2, 3, …}
Now suppose that 0 < b ≤ 1, and define α = 2/b (so that α ≥ 2). For k ∈ {1,2, 3, …}, define the increasing subsequence of times tk = ⌈kα⌉.
Lemma B.2. If (B.2) holds for 0 < b ≤ 1, then lim k→∞ Q(tk)/tk = 0 with probability 1.
Proof (Lemma B.2, Fix ϵ > 0) . From (B.3), we have for t = tk
To complete the proof that Q(t)/t → 0 with probability 1 for the case 0 < b ≤ 1, we require the following simple lemma, which is also useful in other parts of this paper.
Lemma B.3. Let {X1, …, XM} be a collection of random variables, let γ, a0, a1, …, aM be real numbers, and assume that γ ≥ 1. Then we have
Proof. We have
C. Proof of Proposition 6.3 Part (a)
Lemma C.1. For the quadratic Lyapunov function (C.1), suppose that there are constants F > 0, ϵ > 0, wk > 0 for k ∈ {1, …, K} such that
Proof. From (C.3), we have
The above lemma is related to Proposition 2.5 in [25] and to results in [37]. The results in [25, 37] show that if queue differences δk(t) have bounded exponential moments (such being deterministically bounded), and if some additional assumptions hold, then drift statements of the type (C.3) imply that queue processes have bounded exponential moments. The next result uses only the bounded second and fourth moment conditions (6.2) for δk(t), and so does not require bounded exponential moments.
Lemma C.2. Suppose that (C.3) holds for the quadratic Lyapunov function (C.1) and for some constants ϵ > 0, F > 0. Assume that for all k ∈ {1, …, K} we have 𝔼[|Qk(0)|3] < ∞ and that the queue difference processes δk(t) satisfy (6.2). Then for all k ∈ {1, …, K} we have
Proof. We have Q(τ + 1) = Q(τ) + δ(τ), where δ(τ)≜(δ1(τ), …, δK(τ)). Define γ(τ)≜∥Q(τ + 1)∥ − ∥Q(τ)∥. Then |γ(τ)| ≤ ∥δ(τ)∥ (by the triangle inequality), and we have
Using the fact that 𝔼[δk(t) 2∣ℋ(t)] ≤ D, and , there exists a real number G > 0 for which the above inequality simplifies to