Volume 2022, Issue 1 8396644
Research Article
Open Access

A Refinement of the Integral Jensen Inequality Pertaining Certain Functions with Applications

Zaid Mohammed Mohammed Mahdi Sayed

Corresponding Author

Zaid Mohammed Mohammed Mahdi Sayed

Department of Mathematics, University of Sa’adah, Sa’adah 1872, Yemen

Department of Mathematics, University of Peshawar, Peshawar 25000, Pakistan uop.edu.pk

Search for more papers by this author
Muhammad Adil Khan

Muhammad Adil Khan

Department of Mathematics, University of Peshawar, Peshawar 25000, Pakistan uop.edu.pk

Search for more papers by this author
Shahid Khan

Shahid Khan

Department of Mathematics, University of Peshawar, Peshawar 25000, Pakistan uop.edu.pk

Search for more papers by this author
Josip Pečarić

Josip Pečarić

Department of Mathematical, Physical and Chemical Sciences, Croatian Academy of Sciences and Arts, Zrinski trg 11, 10000 Zagreb, Croatia hazu.hr

Search for more papers by this author
First published: 30 July 2022
Academic Editor: Hugo Leiva

Abstract

In this paper, we present a new refinement of the integral Jensen inequality by utilizing certain functions and give its applications to various means. We utilize the refinement to obtain some new refinements of the Hermite-Hadamard and Hölder’s inequalities as well. Also, we present its applications in information theory. At the end of this paper, we give a more general form of the proposed refinement of the Jensen inequality, associated to several functions.

1. Introduction

Being an important part of modern applied analysis, the field of mathematical inequalities has recorded an exponential growth with significant impact on various parts of science and technology [15]. These inequalities are also extended and generalized in various aspects; one can see such results in [615]. The Jensen weighted integral inequality is a central tool among them; its basic form is follows as [16].

Theorem 1. Assume a convex function f : I and g, h : [θ1, θ2]⟶ are measurable functions such that g(θ) ∈ I and h(θ) ≥ 0∀θ ∈ [θ1, θ2]. Also, suppose that h, gh, (fg).h are all integrable functions on [θ1, θ2] and , then

(1)

The Jensen inequality is one of the fundamental inequalities in modern applied analysis. This inequality is of pivotal importance because various other classical inequalities, for example, the Beckenbach-Dresher, Minkowski’s, the Hermite-Hadamard, Ky-Fan’s, Hölder’s, the arithmetic-geometric, and Levinson’s and Young’s inequalities, can be deduced from this inequality. Also, this inequality can be treated as a problem solving oriented tool in different areas of science and technology, and an extensive literature is dedicated to this inequality regarding its counterparts, generalizations, improvements, and converse results (see, for instance, [1721]) and the references therein.

The Hermite-Hadamard inequality is presented as follows ([22], page 10 in [23]).

Theorem 2. Assume a convex function f : [θ1, θ2]⟶, then the following double inequalities hold:

(2)

The Hölder inequality in its integral form is presented as follows [23].

Theorem 3. Let 1 < p, q be such that 1/p + 1/q = 1 and assume two measurable functions say h, g : [θ1, θ2]⟶ such that the functions |h(θ)|p and |g(θ)|q are integrable on [θ1, θ2]. Then

(3)

The remaining paper is organized in the following manner: Section 2 proposes a new refinement of the integral Jensen inequality, associated to four functions whose sum is equal to unity in pairs. Utilizing this refinement, we derive some new refinements of the Hölder and Hermite-Hadamard inequalities and some new inequalities for power and quasi-arithmetic means. In Section 3, we focus to deduce inequalities for the Csiszâr divergence, variational distance, Shannon entropy, and Kullback-Leibler divergence. In the last section, we present a more general form of the proposed refinement concerning several certain functions.

2. Main Results

Assuming a real valued convex function f defined on the interval I and suppose that h, u, v, w, z, g : [θ1, θ2]⟶ are some integrable functions with the following conditions g(θ) ∈ I, h(θ), u(θ), v(θ), w(θ), z(θ) ∈ + with u(θ) + w(θ) = 1, v(θ) + z(θ) = 1 for all θ ∈ [θ1, θ2] and . Also, let for a nonempty subinterval of [θ1, θ2], we put ; then, for the above facts, we can define the following functional
(4)

We give the following refinement of the integral Jensen inequality associated to four functions.

Theorem 4. Let f be a convex function defined on the interval I. Also, let h, u, v, w, z, g : [θ1, θ2]⟶ be some integrable functions with the following conditions such as g(θ) ∈ I, h(θ), u(θ), v(θ), w(θ), z(θ) ∈ + with u(θ) + w(θ) = 1, v(θ) + z(θ) = 1 for all θ ∈ [θ1, θ2] and . Then, for any nonempty subinterval of [θ1, θ2] with the following inequalities hold

(5)

For a concave function f, the reverse inequalities hold in (5).

Proof. Since u(θ) + w(θ) = 1, v(θ) + z(θ) = 1 for all θ ∈ [θ1, θ2], therefore for the subinterval of [θ1, θ2] with , we have

(6)

Multiplying equation (6) by and assigning it to the function f, then by convexity of f, we have

(7)

Also, by making use of the integral Jensen inequality, one has

(8)

From (7) and (8), we get (5).

Remark 5. The following is an equivalent form of the inequality (5)

(9)
(10)

From Theorem 4, we obtain a new refinement of the H-H inequality as follows.

Corollary 6. Let f be a convex function defined on [θ1, θ2]. Also, let u, v, w, z : [θ1, θ2]⟶ be integrable functions such that u(θ), v(θ), w(θ), z(θ) ∈ + with u(θ) + w(θ) = 1 and v(θ) + z(θ) = 1 for all θ ∈ [θ1, θ2]. Then, for a subinterval of [θ1, θ2] with , the following inequalities hold

(11)

The direction of the inequalities reverses in (11), when the function f becomes concave.

Proof. Utilizing Theorem 4 for h(θ) = 1, g(θ) = θ for all θ ∈ [θ1, θ2], we get (11).

From Theorem 4, we deduce the following refinement of Hölder’s inequality.

Corollary 7. If p, q and the functions u, v, w, z, h1, g1, and g2 defined on [θ1, θ2] are nonnegative such that the functions and wh1g1g2, vh1g1g2, zh1g1g2, h1g1g2 are integrable with u(θ) + w(θ) = 1 and v(θ) + z(θ) = 1 for all θ ∈ [θ1, θ2]. Also, if is a subinterval of [θ1, θ2] with , then

  • (A)

    For p, q > 1 such that (1/p) + (1/q) = 1, the following inequalities hold

(12)
  • (B)

    For 0 < p < 1 and q = p/(p − 1) with , the following inequalities hold

(13)
  • (C)

    For p < 0 and q = p/(p − 1) with , the inequalities in (13) hold

Proof.

  • (A)

    In the case when p, q > 1, let . Then, by using Theorem 4 for f(θ) = θp, θ > 0, , and , we obtain (12). Also, let ; then, applying the same procedure as above and replacing p, q, g1, g2 by q, p, g2, g1, respectively, we obtain (12)

For and , the inequalities in (12) also hold. This can be proved as follows, since we know that

(14)

Taking integral, then with the proposed conditions, we obtain , which concludes the result.

  • (B)

    In the case when 0 < p < 1, we have q < 0. So, applying (12) for p⟶1/p > 1, q⟶(1 − p)−1, , and , we obtain (13)

  • (C)

    In the case when p < 0, we have 0 < q < 1. So, applying the arguments of part B with replacing p, q, g1(θ), g2(θ) by q, p, g2(θ), g1(θ), respectively, we get (13)

The Hölder inequality is refined by the following corollary.

Corollary 8. Let p, q and u, v, w, z, h1, g1, g2 be nonnegative functions defined on [θ1, θ2] such that and are integrable functions with u(θ) + w(θ) = 1 and v(θ) + z(θ) = 1 for all θ ∈ [θ1, θ2]. Also, assume that is a subinterval of [θ1, θ2] with , then

  • (A)

    For p > 1, q = p/(p − 1) and , the following inequalities hold

(15)
  • (B)

    For 0 < p < 1 and q = p/(p − 1) with , the following inequalities hold

(16)
  • (C)

    For p < 0 and q = p/(p − 1) with , the result in (16) holds

Proof.

  • (A)

    Let f(θ) = θ1/p, θ > 0 which is clearly a concave function for p > 1. Thus, by using Theorem 4 for f(θ) = θ1/p, , and , we obtain (15). If , then adopting the same procedure and replacing p, q, g1, g2 by q, p, g2, g1, respectively, we obtain (15)

For and , the inequalities in (12) also hold. This can be proved as follows, since we know that

(17)

Hence, taking integral and using the proposed conditions, we get , which verifies the result.

  • (B)

    In the case when 0 < p < 1, applying (15) for p⟶1/p > 1, q⟶(1 − p)−1, , and , we obtain (16)

  • (C)

    In the case when 0 > p, we have q ∈ (0, 1), which shows that this case reflects case B; therefore, applying the arguments of case B with replacing p, q, g1(θ), g2(θ) by q, p, g2(θ), g1(θ), respectively, we get (16)

Let h and g be positive integrable functions defined on [θ1, θ2] and be any nonempty subinterval of [θ1, θ2], then the integral power means of order r are defined as
(18)
(19)

Corollary 9. Assume some positive integrable functions h, u, v, w, z and g defined on [θ1, θ2] with u(θ) + w(θ) = 1 and v(θ) + z(θ) = 1 for θ ∈ [θ1, θ2]. Also, assume that α, β such that αβ, then

  • (A)

    For α/β − {(−1, 0)}, β ≠ 0, the following inequalities hold

(20)
(21)
  • (B)

    For β/α − {(−1, 0)}, α ≠ 0, the following inequalities hold

(22)
(23)

Proof.

  • (A)

    Let f(θ) = θα/β for θ > 0, then the following possible cases can be discussed:

Case 1. If α, β with αβ, then α/β ≥ 1, and the function f(θ) is convex. Therefore, utilizing (5) for f(θ) and g(θ)⟶gβ(θ), after that taking power 1/α, we obtain (20)

Case 2. If α, β+ with α < β, then 0 < α/β < 1, and the function f(θ) is concave. Therefore, utilizing (5) for f(θ) and g(θ)⟶gβ(θ) and then taking power 1/α, we obtain (20)

Case 3. If β+, α with αβ, then α/β ≤ 0, and the function f(θ) is convex. Therefore, utilizing (5) for f(θ) and g(θ)⟶gβ(θ) and then taking power 1/α, we obtain (20)

For the case when α = 0, taking of (20), we get (21).

  • (B)

    Let f(θ) = θβ/α for θ > 0, then the following possible cases can be discussed:

Case 1. If α, β+ with αβ, then β/α ≥ 1, and the function f(θ) is convex. Hence, using (5) for f(θ) and g(θ)⟶gα(θ) and then taking power 1/β, we obtain (22)

Case 2. If α, β with αβ, then 0 < β/α < 1, and the function f(θ) is concave. Hence, using (5) for f(θ) and g(θ)⟶gα(θ) and then taking power 1/β, we obtain (22)

Case 3. Similarly, if α, β+ with αβ, then β/α ≤ −1, and the function f(θ) is convex function. Hence, using (5) for f(θ) and g(θ)⟶gα(θ) and taking power 1/β, we obtain (22)

For the case when β = 0, taking limβ⟶0 in (22), we get (23).

Let h and u be some positive integrable functions and g be an arbitrary integrable function defined on [θ1, θ2]. Further, if p is considered as a strictly monotone and continuous function whose domain is the image of g, then for any nonempty subinterval of [θ1, θ2], the quasi-arithmetic mean is defined by
(24)
(25)

Some inequalities are given for the quasi-arithmetic mean as follows.

Corollary 10. Assume some positive integrable functions h, u, v, w, z defined on [θ1, θ2] such that u(θ) + w(θ) = 1 and v(θ) + z(θ) = 1 for θ ∈ [θ1, θ2] and further assume that g is an arbitrary integrable function defined on [θ1, θ2]. Also, suppose that p is a strictly monotone continuous function whose domain is the image of g. Then, for (Ψp−1)(θ) as a convex function, the following inequalities hold

(26)

The direction of the inequalities reverses in (26), when the function (Ψp−1)(θ), becomes concave.

Proof. The desired inequalities can be calculated by utilizing (5) for gpg and fΨp−1.

3. Applications in Information Theory

In this section, we use the main result to obtain some new and interesting estimates for various divergences and Shannon entropy in information theory. A literature about the inequalities related to these divergences can be found in [24].

Definition 11 (Csiszâr divergence [25]). Assume that Ψ : I+ is a function defined on a positive interval I+. Also, assume that p, q : [θ1, θ2]⟶(0, ∞) are two integrable functions such that q(θ)/p(θ) ∈ I+ for all θ ∈ [θ1, θ2], then the integral form of Csiszâr-divergence is defined by

(27)

Theorem 12. Let Ψ : I+ be a convex function defined on a positive interval I+ and assume that u, v, w, z, p, q : [θ1, θ2]⟶+ are integrable functions such that q(θ)/p(θ) ∈ I+, u(θ) + w(θ) = 1, and v(θ) + z(θ) = 1 for all θ ∈ [θ1, θ2]. Then, for a subinterval of [θ1, θ2] with , the following inequalities hold

(28)

Proof. Using Theorem 4 for fΨ, g(θ) = q(θ)/p(θ), and h(θ) = p(θ), we obtain (28).

Definition 13 (Shannon entropy). The Shannon entropy for a positive probability density function p(θ) defined on [θ1, θ2] is given by

(29)

Corollary 14. Let u, v, w, z : [θ1, θ2]⟶+ be integrable functions such that u(θ) + w(θ) = 1 and v(θ) + z(θ) = 1 for all θ ∈ [θ1, θ2]. Also, let p, q be two positive probability density functions defined on [θ1, θ2], then for a subinterval of [θ1, θ2] with , the following inequalities hold

(30)

Proof. Using (28) for the function Ψ(ζ) = −logζ, ζI+, we obtain (30).

Remark 15. For q(θ) = 1, the result (30) becomes

(31)

Definition 16 (Kullback-Leibler divergence). The Kullback-Leibler divergence for two positive probability densities p(θ) and q(θ) defined on [θ1, θ2] is given by

(32)

Corollary 17. Let u, v, w, z, p, q : [θ1, θ2]⟶+ be integrable functions such that p(θ) and q(θ) are positive probability density functions and u(θ) + w(θ) = 1 and v(θ) + z(θ) = 1 for all θ ∈ [θ1, θ2]. Then, for a subinterval of [θ1, θ2] with the following inequalities hold

(33)

Proof. Using (28) for the convex function Ψ(ζ) = ζlogζ, ζI+, we obtain (33).

Definition 18 (variational distance). The variational distance for two positive probability densities p(θ) and q(θ) defined on [θ1, θ2] is given by

(34)

Corollary 19. Let u, v, w, z, p, q and be interpreted as in Corollary 17, then

(35)

Proof. Using the convex function Ψ(ζ) = |ζ − 1| for ζI+ in (28), we obtain (35).

Definition 20 (Jeffrey’s distance). Jeffrey’s distance for two positive probability density functions p(θ) and q(θ) defined on [θ1, θ2] is given by

(36)

Corollary 21. Let u, v, w, z, p, q and be interpreted as in Corollary 17, then

(37)

Proof. Using the convex function Ψ(ζ) = (ζ − 1)logζ, ζI+ in (28), we obtain (37).

Definition 22 (Bhattacharyya coefficient). The Bhattacharyya coefficient for two positive probability density functions p(θ) and q(θ) defined on [θ1, θ2] is given by

(38)

Corollary 23. Let u, v, w, z, p, q and be interpreted as in Corollary 17, then

(39)

Proof. Using the convex function , ζI+ in (28), we obtain (39).

Definition 24 (Hellinger distance). The Hellinger distance for two positive probability density functions p(θ) and q(θ) defined on [θ1, θ2] is given by

(40)

Corollary 25. Let u, v, w, z, p, q and be defined as in Corollary 17, then

(41)

Proof. Using (28) for the convex function we obtain (41).

Definition 26 (triangular discrimination). The triangular discrimination for two positive probability density functions p(θ) and q(θ) defined on [θ1, θ2] is given by

(42)

Corollary 27. Let u, v, w, z, p, q and be as stated in Corollary 17, then

(43)

Proof. Utilizing the convex function Ψ(ζ) = (ζ − 1)2/(ζ + 1), ζI+ in (28), we obtain (43).

3.1. Further Generalization

In this section, we give more general form of the proposed refinement of the integral Jensen inequality concerning several certain functions.

Theorem 28. Let f : I be a convex function defined on the interval I. Also, let be integrable functions for each i = 1, 2, ⋯, s where for all θ ∈ [θ1, θ2]( = 1, 2, ⋯, n, i = 1, 2, ⋯, s, sN) and , for each i. Suppose that L1, L2, ⋯, Ls be some nonempty subsets of {1, 2, ⋯, n} such that LkLt = ∅ for kt and . Furthermore, if are some nonempty subintervals of [θ1, θ2] such that for kt and , then the following inequalities hold:

(44)

If the function f is concave, then the reverse inequalities hold in (44).

Proof. Since for all θ ∈ [θ1, θ2] and each i = 1, 2, ⋯, s, therefore for the subintervals of [θ1, θ2], we have

(45)

Applying the integral Jensen inequality to all terms on the right hand side of (45), we obtain

(46)
which concludes the result (44).

Remark 29. Using Theorem 28 for n = s = 2, we obtain Theorem 4. Also, one can present similar applications of Theorem 28 as in the previous sections.

4. Conclusion

Jensen’s inequality and its refinements can help in various aspects; for example, it helps to authenticate the positivity of Kullback-Leibler divergence, it can be used to obtain some useful estimates for Shannon and Zipf-Mandelbrot entropies and for various divergences in information theory, its gap can be utilized to obtain error bounds in the estimation of certain parameters, and it is useful in the stability analysis of discrete and continuous-time systems with time-varying delay. In this paper, a new refinement of the integral Jensen inequality is proposed with the help of four special type of functions. The refinement is utilized to obtain improved inequalities for various means. Also, new refinements of the Hölder and the Hermite-Hadamard inequalities are obtained. New estimates for several divergences and Shannon entropy are presented. At the end of this paper, more general form of the proposed refinement of the Jensen inequality associated to several special type of functions is established.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Authors’ Contributions

All authors contributed equally to writing of this paper. All authors read and approved the final manuscript.

Data Availability

No data were used to support this study.

    The full text of this article hosted at iucr.org is unavailable due to technical difficulties.