A Strong Limit Theorem for Functions of Continuous Random Variables and an Extension of the Shannon-McMillan Theorem
Abstract
By means of the notion of likelihood ratio, the limit properties of the sequences of arbitrary-dependent continuous random variables are studied, and a kind of strong limit theorems represented by inequalities with random bounds for functions of continuous random variables is established. The Shannon-McMillan theorem is extended to the case of arbitrary continuous information sources. In the proof, an analytic technique, the tools of Laplace transform, and moment generating functions to study the strong limit theorems are applied.
1. Introduction
Definition 1.1. Let {Xn, n ≥ 1} be a sequence of random variables with joint distribution (1.1), and let gk(xk) (k = 1, 2, …,n) be defined by (1.2). Let
Although r(ω) is not a proper metric between probability measures, we nevertheless think of it as a measure of “dissimilarity” between their joint distribution fn(x1, … ,xn) and the product πn(x1, … ,xn) of their marginals.
Obviously, r(ω) = 0, a.s. if and only if {Xn, n ≥ 1} are independent.
A stochastic process of fundamental importance in the theory of testing hypotheses is the sequence of likelihood ratio. In view of the above discussion of the asymptotic logarithmic likelihood ratio, it is natural to think of r(ω) as a measure how far (the random deviation of) Xn is from being independent, how dependent they are. The smaller r(ω) is, the smaller the deviation is (cf. [3–5]).
In [3], the strong deviation theorems for discrete random variables were discussed by using the generating function method. Later, the approach of Laplace transform to study the strong limit theorems was first proposed by Liu [4]. Yang [6] further studied the limit properties for Markov chains indexed by a homogeneous tree through the analytic technique. Many comprehensive works may be found in Liu [7]. The purpose of this paper is to establish a kind of strong deviation theorems represented by inequalities with random bounds for functions of arbitrary continuous random variables, by combining the analytic technique with the method of Laplace transform, and to extend the strong deviation theorems to the differential entropy for arbitrary-dependent continuous information sources in more general settings.
Definition 1.2. Let {hn(xn), n ≥ 1} be a sequence of nonnegative ℬ orel measurable functions defined on ℛ, the Laplace transform of random variables hn(Xn) on the probability space (Ω,ℱ,Q) is defined by
- (1)
Assume that there exists s0 ∈ (0, ∞) such that
(1.6) - (2)
Assume M > 0 is a constant, satisfying
(1.7)
In order to prove our main results, we first give a lemma, and it will be shown that it plays a central role in the proofs.
Lemma 1.3. Let fn(x1, … ,xn), gn(x1, … ,xn) be two probability functions on (Ω,ℱ,P), let
2. Main Results
Theorem 2.1. Let {Xn, n ≥ 1}, Zn(ω), r(ω), fn(s) be defined as before, and under the assumptions of (1) and (2), let
Remark 2.2. Let
Proof. Let s be an arbitrary real number in (−s0,s0), let
Let −s0 < s < 0 in (2.18), by (2.20) and (2.1), we have
By (2.4), (2.5), and (2.14), if x > 0, we have
Corollary 2.3. If P = Q, or {Xn, n ≥ 1} is a sequence of independent random variables, and under the assumptions of (1) and (2), then
3. An Extension of the Shannon-McMillan Theorem
In order to understand better, we first introduce some definitions in information theory in this section.
A question of importance in information theory is the study of the limit properties of the relative entropy density fn(ω). Since Shannon′s initial work was published (cf. [11]), there has been a great deal of investigation about this question (e.g., cf. [12–20]).
In this paper, a class of small deviation theorems (i.e., the strong limit theorems represented by inequalities) is established by using the analytical technique, and an extension of the Shannon-McMillan theorem to the arbitrary-dependent continuous information sources is given. Especially, an approach of applying the tool of Laplace transform to the study of the strong deviation theorems on the differential entropy is proposed.
Let hk(xk) = −ln gk(xk) (1 ≤ k ≤ n, n = 1, 2, …) in (1.5), then we give the following definitions.
Definition 3.1. The Laplace transform of −ln gk(xk) is defined by
Definition 3.2. The differential entropy for continuous random variables Xk is defined by
In the following theorem, let {Xn, n ≥ 1} be independent random variables with respect to Q, then the reference density function , and let hk(Xk) = −ln gk(Xk) (1 ≤ k ≤ n) in Theorem 2.1.
Theorem 3.3. Let {Xn, n ≥ 1}, Ln(ω), L(ω), fn(s) be given as above, and under the assumptions of (1) and (2), let
Remark 3.4. Let
Corollary 3.5. Let pn(ω) be defined by (3.2). Under the condition of Theorem 3.3, then
Corollary 3.6. If P = Q, or {Xn, n ≥ 1} are independent random variables, and there exists s0 > 0, such that (2.1) holds, then
Acknowledgments
This research is supported by the National Natural Science Foundation of China (Grants nos. 10671052 and 10571008), the Natural Science Foundation of Beijing (Grant no. 1072004), Funding Project for Academic Human Resources Development in Institutions of Higher Learning Under the Jurisdiction of Beijing Municipality, the Basic Research and Frontier Technology Foundation of Henan (Grant no. 072300410090), and the Natural Science Research Project of Henan (Grant no. 2008B110009). The authors would like to thank the editor and the referees for helpful comments, which helped to improve an earlier version of the paper.