A Refinement of the Integral Jensen Inequality Pertaining Certain Functions with Applications
Abstract
In this paper, we present a new refinement of the integral Jensen inequality by utilizing certain functions and give its applications to various means. We utilize the refinement to obtain some new refinements of the Hermite-Hadamard and Hölder’s inequalities as well. Also, we present its applications in information theory. At the end of this paper, we give a more general form of the proposed refinement of the Jensen inequality, associated to several functions.
1. Introduction
Being an important part of modern applied analysis, the field of mathematical inequalities has recorded an exponential growth with significant impact on various parts of science and technology [1–5]. These inequalities are also extended and generalized in various aspects; one can see such results in [6–15]. The Jensen weighted integral inequality is a central tool among them; its basic form is follows as [16].
Theorem 1. Assume a convex function f : I⟶ℝ and g, h : [θ1, θ2]⟶ℝ are measurable functions such that g(θ) ∈ I and h(θ) ≥ 0∀θ ∈ [θ1, θ2]. Also, suppose that h, gh, (f∘g).h are all integrable functions on [θ1, θ2] and , then
The Jensen inequality is one of the fundamental inequalities in modern applied analysis. This inequality is of pivotal importance because various other classical inequalities, for example, the Beckenbach-Dresher, Minkowski’s, the Hermite-Hadamard, Ky-Fan’s, Hölder’s, the arithmetic-geometric, and Levinson’s and Young’s inequalities, can be deduced from this inequality. Also, this inequality can be treated as a problem solving oriented tool in different areas of science and technology, and an extensive literature is dedicated to this inequality regarding its counterparts, generalizations, improvements, and converse results (see, for instance, [17–21]) and the references therein.
The Hermite-Hadamard inequality is presented as follows ([22], page 10 in [23]).
Theorem 2. Assume a convex function f : [θ1, θ2]⟶ℝ, then the following double inequalities hold:
The Hölder inequality in its integral form is presented as follows [23].
Theorem 3. Let 1 < p, q be such that 1/p + 1/q = 1 and assume two measurable functions say h, g : [θ1, θ2]⟶ℝ such that the functions |h(θ)|p and |g(θ)|q are integrable on [θ1, θ2]. Then
The remaining paper is organized in the following manner: Section 2 proposes a new refinement of the integral Jensen inequality, associated to four functions whose sum is equal to unity in pairs. Utilizing this refinement, we derive some new refinements of the Hölder and Hermite-Hadamard inequalities and some new inequalities for power and quasi-arithmetic means. In Section 3, we focus to deduce inequalities for the Csiszâr divergence, variational distance, Shannon entropy, and Kullback-Leibler divergence. In the last section, we present a more general form of the proposed refinement concerning several certain functions.
2. Main Results
We give the following refinement of the integral Jensen inequality associated to four functions.
Theorem 4. Let f be a convex function defined on the interval I. Also, let h, u, v, w, z, g : [θ1, θ2]⟶ℝ be some integrable functions with the following conditions such as g(θ) ∈ I, h(θ), u(θ), v(θ), w(θ), z(θ) ∈ ℝ+ with u(θ) + w(θ) = 1, v(θ) + z(θ) = 1 for all θ ∈ [θ1, θ2] and . Then, for any nonempty subinterval of [θ1, θ2] with the following inequalities hold
For a concave function f, the reverse inequalities hold in (5).
Proof. Since u(θ) + w(θ) = 1, v(θ) + z(θ) = 1 for all θ ∈ [θ1, θ2], therefore for the subinterval of [θ1, θ2] with , we have
Multiplying equation (6) by and assigning it to the function f, then by convexity of f, we have
Also, by making use of the integral Jensen inequality, one has
From Theorem 4, we obtain a new refinement of the H-H inequality as follows.
Corollary 6. Let f be a convex function defined on [θ1, θ2]. Also, let u, v, w, z : [θ1, θ2]⟶ℝ be integrable functions such that u(θ), v(θ), w(θ), z(θ) ∈ ℝ+ with u(θ) + w(θ) = 1 and v(θ) + z(θ) = 1 for all θ ∈ [θ1, θ2]. Then, for a subinterval of [θ1, θ2] with , the following inequalities hold
The direction of the inequalities reverses in (11), when the function f becomes concave.
From Theorem 4, we deduce the following refinement of Hölder’s inequality.
Corollary 7. If p, q ∈ ℝ and the functions u, v, w, z, h1, g1, and g2 defined on [θ1, θ2] are nonnegative such that the functions and wh1g1g2, vh1g1g2, zh1g1g2, h1g1g2 are integrable with u(θ) + w(θ) = 1 and v(θ) + z(θ) = 1 for all θ ∈ [θ1, θ2]. Also, if is a subinterval of [θ1, θ2] with , then
- (A)
For p, q > 1 such that (1/p) + (1/q) = 1, the following inequalities hold
- (B)
For 0 < p < 1 and q = p/(p − 1) with , the following inequalities hold
- (C)
For p < 0 and q = p/(p − 1) with , the inequalities in (13) hold
Proof.
- (A)
In the case when p, q > 1, let . Then, by using Theorem 4 for f(θ) = θp, θ > 0, , and , we obtain (12). Also, let ; then, applying the same procedure as above and replacing p, q, g1, g2 by q, p, g2, g1, respectively, we obtain (12)
For and , the inequalities in (12) also hold. This can be proved as follows, since we know that
Taking integral, then with the proposed conditions, we obtain , which concludes the result.
The Hölder inequality is refined by the following corollary.
Corollary 8. Let p, q ∈ ℝ and u, v, w, z, h1, g1, g2 be nonnegative functions defined on [θ1, θ2] such that and are integrable functions with u(θ) + w(θ) = 1 and v(θ) + z(θ) = 1 for all θ ∈ [θ1, θ2]. Also, assume that is a subinterval of [θ1, θ2] with , then
- (A)
For p > 1, q = p/(p − 1) and , the following inequalities hold
- (B)
For 0 < p < 1 and q = p/(p − 1) with , the following inequalities hold
- (C)
For p < 0 and q = p/(p − 1) with , the result in (16) holds
Proof.
- (A)
Let f(θ) = θ1/p, θ > 0 which is clearly a concave function for p > 1. Thus, by using Theorem 4 for f(θ) = θ1/p, , and , we obtain (15). If , then adopting the same procedure and replacing p, q, g1, g2 by q, p, g2, g1, respectively, we obtain (15)
For and , the inequalities in (12) also hold. This can be proved as follows, since we know that
Hence, taking integral and using the proposed conditions, we get , which verifies the result.
- (B)
In the case when 0 < p < 1, applying (15) for p⟶1/p > 1, q⟶(1 − p)−1, , and , we obtain (16)
- (C)
In the case when 0 > p, we have q ∈ (0, 1), which shows that this case reflects case B; therefore, applying the arguments of case B with replacing p, q, g1(θ), g2(θ) by q, p, g2(θ), g1(θ), respectively, we get (16)
Corollary 9. Assume some positive integrable functions h, u, v, w, z and g defined on [θ1, θ2] with u(θ) + w(θ) = 1 and v(θ) + z(θ) = 1 for θ ∈ [θ1, θ2]. Also, assume that α, β ∈ ℝ such that α ≤ β, then
- (A)
For α/β ∈ ℝ − {(−1, 0)}, β ≠ 0, the following inequalities hold
- (B)
For β/α ∈ ℝ − {(−1, 0)}, α ≠ 0, the following inequalities hold
Proof.
- (A)
Let f(θ) = θα/β for θ > 0, then the following possible cases can be discussed:
Case 1. If α, β ∈ ℝ− with α ≤ β, then α/β ≥ 1, and the function f(θ) is convex. Therefore, utilizing (5) for f(θ) and g(θ)⟶gβ(θ), after that taking power 1/α, we obtain (20)
Case 2. If α, β ∈ ℝ+ with α < β, then 0 < α/β < 1, and the function f(θ) is concave. Therefore, utilizing (5) for f(θ) and g(θ)⟶gβ(θ) and then taking power 1/α, we obtain (20)
Case 3. If β ∈ ℝ+, α ∈ ℝ− with α ≤ β, then α/β ≤ 0, and the function f(θ) is convex. Therefore, utilizing (5) for f(θ) and g(θ)⟶gβ(θ) and then taking power 1/α, we obtain (20)
For the case when α = 0, taking of (20), we get (21).
- (B)
Let f(θ) = θβ/α for θ > 0, then the following possible cases can be discussed:
Case 1. If α, β ∈ ℝ+ with α ≤ β, then β/α ≥ 1, and the function f(θ) is convex. Hence, using (5) for f(θ) and g(θ)⟶gα(θ) and then taking power 1/β, we obtain (22)
Case 2. If α, β ∈ ℝ− with α ≤ β, then 0 < β/α < 1, and the function f(θ) is concave. Hence, using (5) for f(θ) and g(θ)⟶gα(θ) and then taking power 1/β, we obtain (22)
Case 3. Similarly, if α ∈ ℝ−, β ∈ ℝ+ with α ≤ β, then β/α ≤ −1, and the function f(θ) is convex function. Hence, using (5) for f(θ) and g(θ)⟶gα(θ) and taking power 1/β, we obtain (22)
For the case when β = 0, taking limβ⟶0 in (22), we get (23).
Some inequalities are given for the quasi-arithmetic mean as follows.
Corollary 10. Assume some positive integrable functions h, u, v, w, z defined on [θ1, θ2] such that u(θ) + w(θ) = 1 and v(θ) + z(θ) = 1 for θ ∈ [θ1, θ2] and further assume that g is an arbitrary integrable function defined on [θ1, θ2]. Also, suppose that p is a strictly monotone continuous function whose domain is the image of g. Then, for (Ψ∘p−1)(θ) as a convex function, the following inequalities hold
The direction of the inequalities reverses in (26), when the function (Ψ∘p−1)(θ), becomes concave.
Proof. The desired inequalities can be calculated by utilizing (5) for g⟶p∘g and f⟶Ψ∘p−1.
3. Applications in Information Theory
In this section, we use the main result to obtain some new and interesting estimates for various divergences and Shannon entropy in information theory. A literature about the inequalities related to these divergences can be found in [24].
Definition 11 (Csiszâr divergence [25]). Assume that Ψ : I+⟶ℝ is a function defined on a positive interval I+. Also, assume that p, q : [θ1, θ2]⟶(0, ∞) are two integrable functions such that q(θ)/p(θ) ∈ I+ for all θ ∈ [θ1, θ2], then the integral form of Csiszâr-divergence is defined by
Theorem 12. Let Ψ : I+⟶ℝ be a convex function defined on a positive interval I+ and assume that u, v, w, z, p, q : [θ1, θ2]⟶ℝ+ are integrable functions such that q(θ)/p(θ) ∈ I+, u(θ) + w(θ) = 1, and v(θ) + z(θ) = 1 for all θ ∈ [θ1, θ2]. Then, for a subinterval of [θ1, θ2] with , the following inequalities hold
Definition 13 (Shannon entropy). The Shannon entropy for a positive probability density function p(θ) defined on [θ1, θ2] is given by
Corollary 14. Let u, v, w, z : [θ1, θ2]⟶ℝ+ be integrable functions such that u(θ) + w(θ) = 1 and v(θ) + z(θ) = 1 for all θ ∈ [θ1, θ2]. Also, let p, q be two positive probability density functions defined on [θ1, θ2], then for a subinterval of [θ1, θ2] with , the following inequalities hold
Remark 15. For q(θ) = 1, the result (30) becomes
Definition 16 (Kullback-Leibler divergence). The Kullback-Leibler divergence for two positive probability densities p(θ) and q(θ) defined on [θ1, θ2] is given by
Corollary 17. Let u, v, w, z, p, q : [θ1, θ2]⟶ℝ+ be integrable functions such that p(θ) and q(θ) are positive probability density functions and u(θ) + w(θ) = 1 and v(θ) + z(θ) = 1 for all θ ∈ [θ1, θ2]. Then, for a subinterval of [θ1, θ2] with the following inequalities hold
Definition 18 (variational distance). The variational distance for two positive probability densities p(θ) and q(θ) defined on [θ1, θ2] is given by
Corollary 19. Let u, v, w, z, p, q and be interpreted as in Corollary 17, then
Definition 20 (Jeffrey’s distance). Jeffrey’s distance for two positive probability density functions p(θ) and q(θ) defined on [θ1, θ2] is given by
Corollary 21. Let u, v, w, z, p, q and be interpreted as in Corollary 17, then
Definition 22 (Bhattacharyya coefficient). The Bhattacharyya coefficient for two positive probability density functions p(θ) and q(θ) defined on [θ1, θ2] is given by
Corollary 23. Let u, v, w, z, p, q and be interpreted as in Corollary 17, then
Definition 24 (Hellinger distance). The Hellinger distance for two positive probability density functions p(θ) and q(θ) defined on [θ1, θ2] is given by
Corollary 25. Let u, v, w, z, p, q and be defined as in Corollary 17, then
Definition 26 (triangular discrimination). The triangular discrimination for two positive probability density functions p(θ) and q(θ) defined on [θ1, θ2] is given by
Corollary 27. Let u, v, w, z, p, q and be as stated in Corollary 17, then
3.1. Further Generalization
In this section, we give more general form of the proposed refinement of the integral Jensen inequality concerning several certain functions.
Theorem 28. Let f : I⟶ℝ be a convex function defined on the interval I. Also, let be integrable functions for each i = 1, 2, ⋯, s where for all θ ∈ [θ1, θ2](ℓ = 1, 2, ⋯, n, i = 1, 2, ⋯, s, s ∈ N) and , for each i. Suppose that L1, L2, ⋯, Ls be some nonempty subsets of {1, 2, ⋯, n} such that Lk∩Lt = ∅ for k ≠ t and . Furthermore, if are some nonempty subintervals of [θ1, θ2] such that for k ≠ t and , then the following inequalities hold:
If the function f is concave, then the reverse inequalities hold in (44).
Proof. Since for all θ ∈ [θ1, θ2] and each i = 1, 2, ⋯, s, therefore for the subintervals of [θ1, θ2], we have
Applying the integral Jensen inequality to all terms on the right hand side of (45), we obtain
4. Conclusion
Jensen’s inequality and its refinements can help in various aspects; for example, it helps to authenticate the positivity of Kullback-Leibler divergence, it can be used to obtain some useful estimates for Shannon and Zipf-Mandelbrot entropies and for various divergences in information theory, its gap can be utilized to obtain error bounds in the estimation of certain parameters, and it is useful in the stability analysis of discrete and continuous-time systems with time-varying delay. In this paper, a new refinement of the integral Jensen inequality is proposed with the help of four special type of functions. The refinement is utilized to obtain improved inequalities for various means. Also, new refinements of the Hölder and the Hermite-Hadamard inequalities are obtained. New estimates for several divergences and Shannon entropy are presented. At the end of this paper, more general form of the proposed refinement of the Jensen inequality associated to several special type of functions is established.
Conflicts of Interest
The authors declare that there are no conflicts of interest regarding the publication of this paper.
Authors’ Contributions
All authors contributed equally to writing of this paper. All authors read and approved the final manuscript.
Open Research
Data Availability
No data were used to support this study.