In this paper we use large deviation theory to determine the equilibrium distribution of a basic droplet model that underlies a number of important models in material science and statistical mechanics. Given and c > b, K distinguishable particles are placed, each with equal probability 1/N, onto the N sites of a lattice, where K/N equals c. We focus on configurations for which each site is occupied by a minimum of b particles. The main result is the large deviation principle (LDP), in the limit K → ∞ and N → ∞ with K/N = c, for a sequence of random, number-density measures, which are the empirical measures of dependent random variables that count the droplet sizes. The rate function in the LDP is the relative entropy R(θ∣ρ^∗), where θ is a possible asymptotic configuration of the number-density measures and ρ^∗ is a Poisson distribution with mean c, restricted to the set of positive integers n satisfying n ≥ b. This LDP implies that ρ^∗ is the equilibrium distribution of the number-density measures, which in turn implies that ρ^∗ is the equilibrium distribution of the random variables that count the droplet sizes.

1. Introduction

This paper is motivated by a natural question for a basic model of a droplet. Given and c > b, K distinguishable particles are placed, each with equal probability 1/N, onto the N sites of a lattice Λ_N = {1,2, …, N}. Under the assumption that K/N = c and that each site is occupied by a minimum of b particles, what is the equilibrium distribution, as N → ∞, of the number of particles per site? We prove in Corollary 3 that this equilibrium distribution is a Poisson distribution, with mean c, restricted to the set of positive integers n satisfying n ≥ b. As we explain near the end of the Introduction, this equilibrium distribution has important applications to technologies using sprays and powders.

As in many other models in statistical mechanics, we can identify the equilibrium distribution by exhibiting it as the unique minimum point of a rate function in a large deviation principle (LDP). Other models for which this procedure can be implemented are discussed at the end of the Introduction.

For the droplet model we prove the LDP for a sequence of random probability measures, called number-density measures, which are the empirical measures of a sequence of dependent random variables that count the droplet sizes. This LDP is stated in Theorem 1. Our proof is self-contained and starts from first principles, using techniques that are familiar in applied mathematics and statistical mechanics. For example, the proof of the local large deviation estimate in Theorem 5, a key step in the proof of the LDP for the number-density measures, is based on combinatorics, Stirling’s formula, and Laplace asymptotics.

Our use of combinatorial methods goes back to Boltzmann in his work on the discrete ideal gas. He calculated the Maxwell-Boltzmann equilibrium distribution for this system by analyzing the asymptotic behavior of a particular multinomial coefficient [1]. Starting with Boltzmann’s work, combinatorial methods have remained an important tool in both statistical mechanics and in the theory of large deviations, offering insights into a wide variety of physical and mathematical phenomena via techniques that are elegant, powerful, and often elementary. In applications to statistical mechanics, this state of affairs is explained by the observation that “many fundamental questions … are inherently combinatorial, … including the Ising model, the Potts model, monomer-dimer systems, self-avoiding walks and percolation theory” [2]. For the two-dimensional Ising model and other exactly soluble models, [3, 4] are recommended.

A similar situation holds in the theory of large deviations. For example, Section 2.1 of [5] discusses combinatorial techniques for finite alphabets and points out that because of the concreteness of these applications the LDPs are proved under much weaker conditions than the corresponding results in the general theory, into which the finite-alphabet results give considerable insight. The text [6] devotes several early sections to large deviation results for i.i.d. random variables having a finite state space and proved by combinatorial methods, including a sophisticated, level-3 result for the empirical pair measure.

In order to formulate the LDP for the number-density measures in our droplet model, a standard probabilistic model is introduced. The configuration space is the set consisting of all ω = (ω₁, ω₂, …, ω_K), where ω_i denotes the site in Λ_N occupied by the ith particle. The cardinality of Ω_N equals N^K. Denote by P_N the uniform probability measure that assigns equal probability 1/N^K to each of the N^K configurations ω ∈ Ω_N. The asymptotic analysis of the droplet model involves the two random variables, which are functions of the configuration ω ∈ Ω_N: for , denotes the number of particles occupying the site in the configuration ω; for , N_j(ω) denotes the number of sites for which .

We focus on the subset of Ω_N consisting of all configurations ω for which every site of Λ_N is occupied by at least b particles. Because of this restriction N_j(ω) is indexed by

. It is useful to think of each particle as having one unit of mass and of the set of particles at each site

as defining a droplet. With this interpretation, for each configuration ω,

denotes the mass or size of the droplet at site

. The jth droplet class has N_j(ω) droplets and mass jN_j(ω). Because the number of sites in Λ_N equals N and the sum of the masses of all the droplet classes equals K, the following conservation laws hold for such configurations:

()

In addition, since the total number of particles is K, it follows that

. These equality constraints show that the random variables N_j and

are not independent.

In order to carry out the asymptotic analysis of the droplet model, we introduce a quantity m = m(N) that converges to ∞ sufficiently slowly with respect to N; specifically, we require that m(N) ²/N → 0 as N → ∞. In terms of b and m we define the subset Ω_N,b,m of Ω_N consisting of all configurations ω for which every site of Λ_N is occupied by at least b particles and at most m of the quantities N_j(ω) are positive. This second condition is a useful technical device that allows us to control the errors in several estimates. In Appendix D of [7] we present evidence supporting the conjecture that this condition can be eliminated. The discussion in that appendix involves a number of interesting topics including Stirling numbers of the second kind (see [8, pp. 96-97] and [9, §5.4]) and their asymptotic behavior [10, Example 5.4].

The random quantities in the droplet model for which we formulate an LDP are the number-density measures Θ_N,b. For ω ∈ Ω_N,b,m these random probability measures assign to the probability N_j(ω)/N, which is the number density of the jth droplet class. Because of the two conservation laws in (1) and because K/N = c, for ω ∈ Ω_N,b,m, Θ_N,b(ω) is a probability measure on having mean c. Thus Θ_N,b takes values in , which is defined to be the set of probability measures on having mean c.

The probability measure P_N,b,m defining the droplet model is obtained by restricting the uniform measure P_N to the set of configurations Ω_N,b,m. Thus P_N,b,m equals the conditional probability P_N(·∣Ω_N,b,m). In the language of statistical mechanics P_N,b,m defines a microcanonical ensemble that incorporates the conservation laws for number and mass expressed in (1).

A natural question is to determine two equilibrium distributions: the equilibrium distribution ρ^∗ of the number-density measures and the equilibrium distribution

of the droplet-size random variables

. These distributions are defined by the following two limits: for any ε > 0, any

, and all

()

where B(ρ^∗, ε) denotes the open ball with center ρ^∗ and radius ε defined with respect to an appropriate metric on

. As we prove, the equilibrium distributions of Θ_N,b and

coincide. As in many models in statistical mechanics, an efficient way to determine the equilibrium distribution Θ_N,b is to prove an LDP for Θ_N,b, which we carry out in Theorem 1. This theorem is the main result in the paper.

The content of Theorem 1 is the following: as N → ∞, the sequence of number-density measures Θ_N,b satisfies the LDP on with respect to the measures P_N,b,m. The rate function is the relative entropy R(θ∣ρ_b,α) of with respect to the Poisson distribution ρ_b,α on having components ρ_b,α;j = [Z_b(α)] ⁻¹ · α^j/j! for . In this formula Z_b(α) is the normalization that makes ρ_b,α a probability measure, and α equals the unique value α_b(c) for which has mean c [Theorem A.2]. Using the fact that equals 0 at the unique measure , we apply the LDP for Θ_N,b to conclude in Theorem 2 that is the equilibrium distribution of Θ_N,b. Corollary 3 then implies that is also the equilibrium distribution of .

The space is the most natural space on which to formulate the LDP for Θ_N,b in Theorem 1. Not only is the smallest convex set of probability measures containing the range of Θ_N,b for all , but also the union over of the range of Θ_N,b is dense in . As we explain in part (a) of Theorem 4, is not a complete, separable metric space, a situation that prevents us from directly applying general results in the theory of large deviations that require the setting of a complete, separable metric space.

The droplet model is defined in Section 2. Step 1 in the proof of the LDP for Θ_N,b is to derive the local large deviation estimate in part (b) of Theorem 5. This local estimate, one of the centerpieces of the paper, gives information not available in the LDP for Θ_N,b, which involves global estimates. Step 2 is to lift the local large deviation estimate to the large deviation limit for Θ_N,b lying in open balls and certain other subsets of while Step 3 is to lift the large deviation limit for open balls and certain other subsets to the LDP for Θ_N,b stated in Theorem 1. Steps 2 and 3 are explained in Section 4.

Details of Steps 2 and 3 as well as other routine proofs are omitted from the present paper. They appear in the unpublished companion paper [7], which also contains additional background material. The paper [1] explores how our work on the droplet model was inspired by the work of Ludwig Boltzmann on a simple model of a discrete ideal gas. The main connection is via the local large deviation estimate in part (b) of Theorem 5. When b = 0, the LDP for a path version of Θ_n,0 with K = tN and t > 0 varying appears in [11, 12].

The main application of the results in this paper is to technologies using sprays and powders, which are ubiquitous in many fields, including agriculture, the chemical and pharmaceutical industries, consumer products, electronics, manufacturing, material science, medicine, mining, paper making, the steel industry, and waste treatment. In this paper we focus on sprays; our theory also applies to powders with only changes in terminology [13]. The behavior of sprays might be complex depending on various parameters including evaporation, temperature, and viscosity. Our goal here is to consider the simplest model where the only assumption is made on the average size of droplets in the spray. In many situations it is important to have good control over the sizes of the droplets, which can be translated into properties of probability distributions. The size distributions are important because they determine reliability and safety in each particular application.

Interestingly, there does not seem to be a rigorous theory that predicts the equilibrium distribution of droplet sizes, analogous to the Maxwell-Boltzmann distribution of energy levels in a discrete ideal gas [14, 15]. Our goal in the present paper is to provide such a theory. We do so by focusing on one aspect of the problem related to the relative entropy, an approach that characterizes the equilibrium distribution of droplet sizes as being a Poisson distribution restricted to . We expect that this distribution will be important in experimental observations. A full understanding of droplet behavior under dynamic conditions requires treating many other aspects and is beyond the scope of this paper. We plan to apply the ideas in this paper to understand the entropy of dislocation networks [16].

The importance of predicting droplet size can be seen from the wide range of applications utilizing sprays [17, 18]. Because of the importance of this problem, novel approaches for measuring size distribution of droplet size in sprays have been developed [19–23]. What makes the problem of predicting droplet size particularly interesting is the complexity of droplet-size distribution, which is attributed to many factors such as temperature and viscosity. As [24] shows, even the nozzle plays a significant role in the outcome. Many theoretical tools used to understand the distribution of droplet size in sprays include entropy [25], which also plays a key role in the present paper.

We end the Introduction by expanding on a comment made at the beginning of this section. This comment concerns one of the main applications of large deviation theory in statistical mechanics, which is to identify the equilibrium distribution or distributions of a model as the minimum point(s) of the rate function in an LDP for the model. This procedure is also useful to study phase transitions in the model, which concern how the structure of the set of equilibrium distributions changes as the parameters defining the model change. There are numerous other models for which this procedure has been used. They include the following three lattice spin models: the Curie-Weiss spin system, the Curie-Weiss-Potts model, and the mean-field Blume-Capel model, which is also known as the mean-field BEG model. As explained in the respective Sections 6.6.1, 6.6.2, and 6.6.3 of [26], the large deviation analysis shows that each of these three models has a different phase transition structure. Details of the analysis for the three models are given in the references [6, §IV.4], [27–29]. Section 9 of [30] outlines how large deviation theory can be applied to determine equilibrium structures in statistical models of two-dimensional turbulence. Details of this analysis are given in [31].

2. Definition of Droplet Model and Main Theorem

After defining the droplet model, we state the main theorem in the paper, Theorem 1. The content of this theorem is the LDP for the sequence of random, number-density measures, which are the empirical measures of a sequence of dependent random variables that count the droplet sizes in the model. As we show in Theorem 2 and in Corollary 3, the LDP enables us to identify a Poisson distribution as the equilibrium distribution both of the number-density measures and of the droplet-size random variables. In Theorem 4 we prove a number of properties of two spaces of probability measures in terms of which the LDP for the number-density measures is formulated.

We start by fixing parameters and c ∈ (b, ∞). The droplet model is defined by a probability measure P_N,b parameterized by and the nonnegative integer b. The measure depends on two other positive integers, K and m, where 2 ≤ m ≤ N < K. Both K and m are functions of N in the large deviation limit N → ∞. In this limit we take K → ∞ and N → ∞, where K/N, the average number of particles per site, equals c. Thus K = Nc. In addition, we take m → ∞ sufficiently slowly by choosing m to be a function m(N) satisfying m(N) → ∞ and m(N) ²/N → 0 as N → ∞; for example, m(N) = N^δ for some δ ∈ (0,1/2). Throughout this paper we fix such a function m(N). The parameter b and the function m = m(N) first appear in the definition of the set of configurations Ω_N,b,m in (3), where these quantities will be explained.

Because K and N are integers, c must be a rational number. This in turn imposes a restriction on the values of N and K. If c is a positive integer, then N → ∞ along the positive integers and K → ∞ along the subsequence K = cN. If c = x/y, where x and y are relatively prime, positive integers with y ≥ 2, then N → ∞ along the subsequence N = yn for and K → ∞ along the subsequence K = cN = xn. Throughout this paper, when we write or N → ∞, it is understood that N and K satisfy the restrictions discussed here.

In the droplet model K distinguishable particles are placed, each with equal probability 1/N, onto the sites of the lattice Λ_N = {1,2, …, N}. This simple description corresponds to a simple probabilistic model. The configuration space is the set consisting of all sequences ω = (ω₁, ω₂, …, ω_K), where ω_i ∈ Λ_N denotes the site in Λ_N occupied by the ith particle. Let ρ^(N) be the measure on Λ_N that assigns equal probability 1/N to each site in Λ_N, and let P_N = (ρ^(N)) ^K be the product measure on Ω_N with equal one-dimensional marginals ρ^(N). Thus P_N is the uniform probability measure that assigns equal probability 1/N^K to each of the N^K configurations ω ∈ Ω_N; for subsets A of Ω_N we have P_N(A) = card⁡(A)/N^K, where card denotes cardinality.

The asymptotic analysis of the droplet model involves two random variables. For and ω ∈ Ω_N, denotes the number of particles occupying site in the configuration ω. For and ω ∈ Ω_N, N_j(ω) denotes the number of sites for which . The dependence of and N_j(ω) on N is not indicated in the notation. Because the distributions of both random variables depend on N, both and N_j form triangular arrays.

We now specify the role played by the nonnegative integer b, first focusing on the case where b is a positive integer. The case where b = 0 is discussed later. For ω ∈ Ω_N, in general there exist sites

for which

; that is, sites that are occupied by 0 particles. The next step in the definition of the droplet model is to restrict to a subset Ω_N,b,m of configurations ω ∈ Ω_N for which every site is occupied by at least b particles and the following constraint holds: for any configuration ω ∈ Ω_N,b,m at most m of the components N_j(ω) are positive, where m = m(N) → ∞ and m(N) ²/N → 0 as N → ∞. Because for ω ∈ Ω_N,b,m every site

is occupied by at least b particles, we have

and N_j(ω) is indexed by

. We denote by N(ω) the sequence

and define

. In terms of this notation

()

The constraint restricting the number of positive components of N(ω) is a useful technical device that allows us to control the errors in several estimates. In Appendix D of [7] we give evidence supporting the conjecture that this restriction can be eliminated.

When b is a positive integer, for each ω ∈ Ω_N,b,m, each site in Λ_N is occupied by at least b particles. In this case it is useful to think of each particle as having one unit of mass and of the set of particles at each site as defining a droplet. With this interpretation, for each configuration ω, denotes the mass or the size of the droplet at site . The jth droplet class has N_j(ω) droplets and mass jN_j(ω). Because the number of sites in Λ_N equals N and the sum of the masses of all the droplet classes equals K, it follows that the quantities N_j(ω) satisfy the two conservation laws in (1) for all ω ∈ Ω_N,b,m.

We now consider the modifications that must be made in these definitions when b = 0. In this case the first constraint in the definition of Ω_N,b,m disappears because we allow sites to be occupied by 0 particles, and therefore N_j(ω) is indexed by . On the other hand, we retain the second constraint in the definition of Ω_N,0,m, which requires that for any configuration ω ∈ Ω_N,0,m at most m of the components N_j(ω) for are positive. When b = 0, the definition of Ω_N,0,m becomes Ω_N,0,m = {ω ∈ Ω_N : |N(ω)|₊ ≤ m = m(N)}. Because the choice b = 0 allows sites to be empty, we lose the interpretation of the set of particles at each site as being a droplet. However, for ω ∈ Ω_N,0,m the two conservation laws in (1) continue to hold.

For the remainder of this paper we work with any fixed nonnegative integer b. The probability measure P_N,b,m defining the droplet model is obtained by restricting the uniform measure P_N to the set Ω_N,b,m. Thus P_N,b,m equals the conditional probability P_N(·∣Ω_N,b,m). For subsets A of Ω_N,b,m, P_N,b,m(A) takes the form

()

Having defined the droplet model, we introduce the random probability measures whose large deviations we will study. For ω ∈ Ω_N,b,m these measures are the number-density measures Θ_N,b that assign to

the probability N_j(ω)/N. This ratio represents the number density of droplet class j. Thus for any subset A of

()

By the two formulas in (1)

and

. Thus Θ_N,b(ω) is a probability measure on

having mean c.

We next introduce several spaces of probability measures that arise in the large deviation analysis of the droplet model. denotes the set of probability measures on . Thus has the form , where the components θ_j satisfy θ_j ≥ 0 and . We say that a sequence of measures in converges weakly to , and write θ⁽ⁿ⁾⇒θ, if, for any bounded function f mapping into , as n → ∞. is topologized by the topology of weak convergence. There is a standard technique for introducing a metric structure on for which we quote the main facts. Because is a complete, separable metric space with metric d(x, y) = |x − y|, there exists a metric π on called the Prohorov metric with the following two properties: (1) convergence with respect to the Prohorov metric is equivalent to weak convergence [32, Thm. 3.3.1]; (2) with respect to the Prohorov metric, is a complete, separable metric space [32, Thm. 3.1.7].

We denote by the set of measures in having mean c. Thus has the form , where the components θ_j satisfy θ_j ≥ 0, , and . The number-density measures Θ_N,b defined in (5) take values in .

According to part (a) of Theorem 4, is not a closed subset of . Hence it is natural to introduce the closure of in . As we prove in part (b) of Theorem 4, the closure of in equals , which is the set of measures in having mean lying in the closed interval [b, c]. Being the closure of the relatively compact, separable metric space , is a compact, separable metric space with respect to the Prohorov metric. This space appears in the formulation of the large deviation upper bound in part (c) of Theorem 1.

We next state Theorem 1, which is the LDP for the sequence of distributions P_N,b,m(Θ_N,b ∈ dθ) on

as N → ∞. The rate function in the LDP is the relative entropy of θ with respect to the Poisson distribution

defined in (7), where each

. Thus any

is absolutely continuous with respect to

. For

the relative entropy of θ with respect to

is defined by

()

If θ_j = 0, then

. For

the components of the measure

appearing in the LDP have the form

()

where α_b(c)∈(0, ∞) is chosen so that

has mean c and Z_b(α_b(c)) is the normalization making

a probability measure; thus

and, for

. As we show in Theorem A.2, there exists a unique value of α_b(c).

As a consequence of the fact that is not closed in , the large deviation upper bound takes two forms depending on whether the subset F of is compact or whether F is closed. When F is compact, in part (b) we obtain the standard large deviation upper bound for F. When F is closed, in part (c) we obtain a variation of the standard large deviation upper bound, which, when F is compact, coincides with the upper bound in part (b). The refinement in part (c) is important. It is applied in the proof of Theorem 2 to show that is the equilibrium distribution of the number-density measures Θ_N,b. In turn, Theorem 2 is applied in the proof of Corollary 3 to show that is the equilibrium distribution of the droplet-size random variables .

In the next theorem we assume that m is the function m(N) appearing in the definition of Ω_N,b,m in (3) and satisfying m(N) → ∞ and m(N) ²/N → 0 as N → ∞. The assumption that m(N) ²/N → 0 is used to control error terms in Lemmas 6 and 7 in the present paper and in Lemma B.3 in [7]. This assumption on m(N) is optimal in the sense that it is a minimal assumption guaranteeing that error terms in parts (a) and (b) of Lemma B.3 in [7] converge to 0. In the next theorem, for A a subset of or we denote by the infimum of over θ ∈ A.

Theorem 1. Fix a nonnegative integer b and a rational number c ∈ (b, ∞). Let m be the function m(N) appearing in the definition of Ω_N,b,m in (3) and satisfying m(N) → ∞ and m(N) ²/N → 0 as N → ∞. Let be the distribution having the components defined in (7). Then as N → ∞, with respect to the measures P_N,b,m, the sequence Θ_N,b satisfies the LDP on with rate function in the following sense.

(a)
maps into [0, ∞] and has compact level sets in ; that is, for any M < ∞ the set is compact.
(b)
For any compact subset F of we have the large deviation upper bound
()
(c)
For any closed subset F of , let denote the closure of F in . We have the large deviation upper bound
()
(d)
For any open subset G of we have the large deviation lower bound
()

The properties of in part (a) are proved in [33, Lem. 1.4.1] and part (a) of Theorem A.1. The basic step in proving the large deviation bounds in parts (b)–(d) is the local large deviation estimate in part (b) of Theorem 5. As explained in Section 4, this local estimate is lifted to large deviation limits involving open balls stated in Theorem 8, which in turn are used to derive the bounds in parts (b)–(d) of Theorem 1.

In the next theorem we use the large deviation upper bound in part (c) of Theorem 1 to prove that the Poisson distribution is the equilibrium distribution of the number-density measures Θ_N,b. In this theorem denotes the complement in of the open ball . denotes the complement in of the open ball .

Theorem 2. One assumes the hypotheses of Theorem 1. The following results hold for any ε > 0.

(a)
The quantity is strictly positive.
(b)
For any number y in the interval (0, x^∗) and all sufficiently large N
()

This upper bound implies that, as N → ∞,

and for any bounded, continuous function g mapping

into

()

These two limits allow us to interpret the Poisson distribution

as the equilibrium distribution of the number-density measures Θ_N,b with respect to P_N,b,m.

Proof. The starting point is the large deviation upper bound in part (c) of Theorem 1 applied to the closed set , which is a subset of . We denote the closure of in by . Since , the large deviation upper bound in part (c) of Theorem 1 takes the form

()

We now prove part (a) of Theorem 2. Since

is lower semicontinuous on

and has compact level sets in

[33, Lem. 1.4.3(b)–(c)], it attains its infimum x^∗ on the closed set

. If x^∗ = 0, then there would exist

such that

. But on

attains its infimum of 0 at the unique measure

[33, Lem. 1.4.1]. This contradicts the fact that

, completing the proof of part (a). The inequality in part (b) is an immediate consequence of part (a) and the large deviation upper bound (13). This inequality yields the limit

, which in turn implies (12). The proof of Theorem 2 is complete.

We now apply Theorem 2 to prove that is also the equilibrium distribution of the random variables , which count the droplet sizes at the sites of Λ_N. This is the content of the next corollary. A fact needed in the proof is that Θ_N,b is the empirical measure of these random variables; that is, for ω ∈ Ω_N,b,m, Θ_N,b(ω) assigns to subsets A of the probability . This representation is valid because both Θ_N,b(ω) and the empirical measure assign to j ∈ Λ_N the probability N_j(ω)/N.

Corollary 3. One assumes the hypotheses of Theorem 1. Then for any site and any

()

Proof. Since the random variables are identically distributed, it suffices to prove the corollary for . For fixed , the limit (12) with g(θ) = θ_j yields

()

This completes the proof.

The last theorem in this section proves several properties of and with respect to the Prohorov metric that are needed in the paper.

Theorem 4. Fix a nonnegative integer b and a real number c ∈ (b, ∞). The metric spaces and have the following properties.

(a)
, the set of probability measures on having mean c, is a relatively compact, separable subset of . However, is not a closed subset of and thus is not a compact subset or a complete metric space.
(b)
, the set of probability measures on having mean lying in the closed interval [b, c], is the closure of in . is a compact, separable subset of .

Proof. (a) For satisfying ξ ≥ b let Ψ_ξ denote the compact subset {b, b + 1, …, ξ} of , and let [Ψ_ξ] ^c denote its complement. For any

()

It follows that

is tight; that is, for any ε > 0 there exists

such that θ([Ψ_ξ] ^c) < ε for all

. Prohorov’s theorem implies that

is relatively compact [32, Thm. 3.2.2]. The separability of

is proved in Corollary B.2 in [7].

We now prove that is not a closed subset of by exhibiting a sequence having a weak limit that does not lie in . Let θ be any measure in with mean β ∈ [b, c); thus . The sequence

()

has the property that

and that

. This completes the proof of part (a).

(b) Since is a separable subset of and is dense in , it follows that is separable. We prove that is the closure of in . Let θ⁽ⁿ⁾ be a sequence in converging weakly to . Since θ⁽ⁿ⁾⇒θ implies that for each , Fatou’s lemma implies that c = liminf_n→∞⁡〈θ⁽ⁿ⁾〉 ≥ 〈θ〉, where 〈θ⁽ⁿ⁾〉 and 〈θ〉 denote the means of θ⁽ⁿ⁾ and θ. Since for any we have 〈θ〉≥b, it follows that c ≥ 〈θ〉≥b. This shows that the closure of in is a subset of . We next prove that is a subset of the closure of in by showing that for any there exists a sequence such that θ⁽ⁿ⁾⇒θ. If 〈θ〉 = c, then we choose θ⁽ⁿ⁾ = θ for all . If 〈θ〉 = β ∈ [b, c), then we use the sequence θ⁽ⁿ⁾ in (17), which converges weakly to θ. We conclude that θ lies in the closure of and thus that is a subset of the closure of in . This completes the proof of part (b). The proof of Theorem 4 is done.

In the next section we present the local large deviation estimate that will be used in Section 4 to prove the LDP for Θ_N,b in Theorem 1.

3. Local Large Deviation Estimate Yielding Theorem 1

The main result needed to prove the LDP in Theorem 1 is the local large deviation estimate stated in part (b) of Theorem 5. The first step is to introduce a set A_N,b,m that plays a central role in this paper. Fix a nonnegative integer b and a rational number c ∈ (b, ∞). Given

define K = Nc and let m be the function appearing in the definition of Ω_N,b,m in (3) and satisfying m(N) → ∞ and m(N) ²/N → 0 as N → ∞. Define

; thus

is the set of nonnegative integers. Let ν be a sequence

for which each

; thus

. We define A_N,b,m to be the set of

satisfying

()

where

. Because

, the two sums involve only finitely many terms.

For ω ∈ Ω_N,b,m the components Θ_N,b;j(ω) of the number-density measure defined in (5) are N_j(ω)/N for , where N_j(ω) denotes the number of sites in Λ_N containing j particles in the configuration ω. We denote by N(ω) the sequence . By definition, for every ω ∈ Ω_N,b,m each site is occupied by at least b particles, and |N(ω)|₊ ≤ m = m(N). It follows that A_N,b,m is the range of N(ω) for ω ∈ Ω_N,b,m; the two sums involving ν_j in (18) correspond to the two sums involving N_j(ω) in (1).

Since the range of N(ω) is A_N,b,m, for ω ∈ Ω_N,b,m the range of Θ_N,b(ω) is the set of probability measures θ_N,b,ν whose components for

have the form θ_N,b,ν;j = ν_j/N for ν ∈ A_N,b,m. By (18) θ_N,b,ν takes values in

, the set of probability measures on

having mean c. It follows that the set

()

is the range of Θ_N,b(ω) for ω ∈ Ω_N,b,m.

In part (b) of the next theorem we state the local large deviation estimate for the event {Θ_N,b = θ_N,b,ν}. In part (a) we introduce the Poisson distribution that appears in the local estimate; is defined in terms of a parameter α_b(c) guaranteeing that it has mean c.

In part (a) of Theorem C.2 in [7] we give the straightforward proof of the existence of α_b(c) for b = 1. The proof of the existence of α_b(c) for general is much more subtle than the proof for b = 1. The proof for general is given in Theorem A.2 in the present paper.

Theorem 5. (a) Fix a nonnegative integer b and a real number c ∈ (b, ∞). For α ∈ (0, ∞) let ρ_b,α be the measure on having components ρ_b,α;j = [Z_b(α)] ⁻¹ · α^j/j! for , where Z_0,α = e^α, and, for , . Then there exists a unique value α_b(c)∈(0, ∞) such that lies in the set of probability measures on having mean c. If b = 0, then α₀(c) = c. If , then α_b(c) is the unique solution in (0, ∞) of αZ_b−1(α)/Z_b(α) = c.

(b) Fix a nonnegative integer b and a rational number c ∈ (b, ∞). Let m be the function m(N) appearing in the definition of Ω_N,b,m in (3) and satisfying m(N) → ∞ and m(N) ²/N → 0 as N → ∞. For any ν ∈ A_N,b,m we define to have the components θ_N,b,ν;j = ν_j/N for . Then

()

is finite because it involves only finitely many components of θ_N,b,ν, and ε_N(ν) → 0 uniformly for ν ∈ A_N,b,m as N → ∞.

We now prove the local large deviation estimate in part (b) of Theorem 5. This proof is based on a combinatorial argument that is reminiscent of and is as natural as the combinatorial argument used to prove Sanov’s theorem for empirical measures defined in terms of i.i.d. random variables having a finite state space [1, §3]. Part (b) of Theorem 5 is proved by analyzing the asymptotic behavior of the product of two multinomial coefficients that we now introduce.

Given ν ∈ A_N,b,m, our goal is to estimate the probability P_N,b,m(Θ_N,b = θ_N,b,ν), where θ_N,b,ν has the components θ_N,b,ν;j = ν_j/N for

. A basic observation is that {ω ∈ Ω_N,b,m : Θ_N,b(ω) = θ_N,b,ν} coincides with

()

It follows that

()

Our first task is to determine the asymptotic behavior of card⁡(Δ_N,b,m;ν). In determining the asymptotic behavior of card⁡(Ω_N,b,m), we will use the fact that Ω_N,b,m can be written as the disjoint union

()

Let ν ∈ A_N,b,m be given. We start by expressing the cardinality of card⁡(Δ_N,b,m;ν) as a product of two multinomial coefficients. For each configuration ω ∈ Δ_N,b,m;ν, K particles are distributed onto the N sites of the lattice Λ_N with j particles going onto ν_j sites for

. We carry this out in two stages. In stage one K particles are placed into N bins, ν_j of which have j particles for

. The number of ways of making this placement equals the multinomial coefficient

. This multinomial coefficient is well-defined since

. Given this placement of K particles into N bins, the number of ways of moving the particles from the bins onto the sites 1,2, …, N of the lattice Λ_N equals the multinomial coefficient

. This second multinomial coefficient is well-defined since

. We conclude that the cardinality of Δ_N,b,m;ν is given by the product of these two multinomial coefficients:

()

Since |ν|₊ ≤ m, at most m of the components ν_j are positive. Such a product of multinomial coefficients is well known in combinatorial analysis [8, Thm. 2.10]. A related version of this formula is derived in Example III.23 of [34]. See also [35, p. 115] and formula (2) in [36, p. 36].

The next two steps in the proof of the local estimate given in part (b) of Theorem 5 are to prove the asymptotic formula for card⁡(Δ_N,b,m;ν) in Lemma 6 and the asymptotic formula for card⁡(Ω_N,b,m) in part (b) of Lemma 7. The proof of Lemma 6 is greatly simplified by a substitution in line 4 of (34). This substitution involves a parameter α ∈ (0, ∞), which, we emphasize, is arbitrary in this lemma. The substitution in line 4 of (34) allows us to express the asymptotic behavior of both card(Δ_N,b,m;ν) in Lemma 6 and card(Ω_N,b,m) in Lemma 7 directly in terms of the relative entropy R(θ_N,b,ν∣ρ_b,α), where ρ_b,α is the probability measure on having the components defined in part (a) of Theorem 5. One of the major issues in the proof of part (b) of Theorem 5 is to show that the arbitrary parameter α appearing in Lemmas 6 and 7 must take the value α_b(c), which is the unique value of α guaranteeing that [Theorem 5(a)]. We show that α must equal α_b(c) after the statement of Lemma 7.

Lemma 6. Fix a nonnegative integer b and a rational number c ∈ (b, ∞). Let α be any real number in (0, ∞), and let m be the function m(N) appearing in the definition of Ω_N,b,m in (3) and satisfying m(N) → ∞ and m(N) ²/N → 0 as N → ∞. We define

()

For any ν ∈ A_N,b,m, we define

to have the components θ_N,b,ν;j = ν_j/N for

. Then

()

The quantity ζ_N(ν) → 0 uniformly for ν ∈ A_N,b,m as N → ∞.

Proof. The proof is based on a weak form of Stirling’s approximation, which states that, for all satisfying N ≥ 2 and for all satisfying 1 ≤ n ≤ N, 1 ≤ log⁡(n!) − (nlog⁡n − n) ≤ 2log⁡N. We summarize the last formula by writing

()

The term denoted by O(log⁡N) satisfies 1 ≤ O(log⁡N) ≤ 2log⁡N.

To simplify the notation, we rewrite (24) in the form card⁡(Δ_N,b,m;ν) = M₁(N, ν) · M₂(K, ν), where M₁(N, ν) denotes the first multinomial coefficient on the right side of (24), and M₂(K, ν) denotes the second multinomial coefficient on the right side of (24). We have

()

The asymptotic behavior of the first term on the right side of the last display is easily calculated. Since ν ∈ A_N,b,m, there are |ν|₊ ∈ {1,2, …, m} positive components ν_j. Because of this restriction on the number |ν|₊ of positive components of ν, we are able to control the error in line 3 of (29). We define . For each j ∈ Ψ_N(ν), since the components ν_j satisfy 1 ≤ ν_j ≤ N, we have log⁡(ν_j!) = ν_jlog⁡ν_j − ν_j + O(log⁡N) for all N ≥ 2. Using the fact that , we obtain

()

where

as N → ∞ and

. By the inequality noted after (27) and the fact that |ν|₊ ≤ m

()

Since (mlog⁡N)/N → 0 as N → ∞, we conclude that

uniformly for ν ∈ A_N,b,m as N → ∞.

We now study the asymptotic behavior of the second term on the right side of (28). Since K = Nc, we obtain for all K ≥ 2

()

where

as N → ∞. The weak form of Stirling’s formula is used to rewrite the term log⁡(K!) in the last display, but not to rewrite the terms log⁡(j!), which we leave untouched.

Substituting (29) and (31) into (28), we obtain

()

In this formula

. As N → ∞,

()

We conclude that ζ_N(ν) → 0 uniformly for ν ∈ A_N,b,m as N → ∞.

Now comes the key step, the purpose of which is to express the sum in the next-to-last line of (32) as the relative entropy R(θ_N,b,ν;j∣ρ_b,α), where α ∈ (0, ∞) is arbitrary. To express the sum in the next-to-last line of (32) as R(θ_N,b,ν∣ρ_b,α), we rewrite the sum as shown in line 4 of the next display:

()

The facts that

and

are used to derive the next-to-last equality. The proof of Lemma 6 is complete.

The next step in the proof of the local large deviation estimate in part (b) of Theorem 5 is to prove the asymptotic formula for card⁡(Ω_N,b,m) stated in part (b) of the next lemma. The proof of this lemma uses Lemma 6 in a fundamental way. After the statement of this lemma we show how to apply it and Lemma 6 to prove part (b) of Theorem 5.

Lemma 7. Fix a nonnegative integer b and a rational number c ∈ (b, ∞). The following conclusions hold:

(a)
lim_N→∞⁡N⁻¹log⁡card⁡(A_N,b,m) = 0.
(b)
Let α be the positive real number in Lemma 6, and let m be the function m(N) appearing in the definition of Ω_N,b,m in (3) and satisfying m(N) → ∞ and m(N) ²/N → 0 as N → ∞. We define f(α, b, c, K) = log⁡Z_b(α) − clog⁡α + clog⁡K − c. Then R(θ∣ρ_b,α) attains its infimum over , and
()

The quantity η_N → 0 as N → ∞.

Before proving Lemma 7, we derive the local large deviation estimate in part (b) of Theorem 5 by applying Lemmas 6 and 7. An integral part of the proof is to show how the arbitrary value of α ∈ (0, ∞) appearing in these lemmas is replaced by the specific value α_b(c) appearing in Theorem 5. As in the statement of part (b) of Theorem 5, let ν be any vector in A_N,b,m and define

to have the components θ_N,b,ν;j = ν_j/N for

. By (22)

()

Substituting the asymptotic formula for log⁡card(Δ_N,b,m;ν) derived in Lemma 6 and the asymptotic formula for log⁡card(Ω_N,b,m) given in part (b) of Lemma 7 yields

()

The error term ε_N(ν) equals ζ_N(ν) − η_N; ζ_N(ν) is the error term in Lemma 6, and η_N is the error term in Lemma 7. As N → ∞, ζ_N(ν) → 0 uniformly for ν ∈ A_N,b,m, and η_N → 0. It follows that ε_N(ν) → 0 uniformly for ν ∈ A_N,b,m as N → ∞.

We now consider the first two terms on the right side of (37). By part (b) of Theorem A.1 applied to

, for any α ∈ (0, ∞)

()

With this step we have succeeded in replacing the relative entropy R(θ_N,b,ν∣ρ_b,α) with respect to ρ_b,α, which appears in Lemma 6, by the relative entropy

with respect to

, which appears in Theorem 5. Substituting the last equation into (37) gives

()

where ε_N(ν) → 0 uniformly for ν ∈ A_N,b,m as N → ∞. This is the conclusion of part (b) of Theorem 5.

We now complete the proof of part (b) of Theorem 5 by proving Lemma 7.

Proof of Lemma 7. (a) We write . By [8, Cor. 2.5] the number of elements in the set indexed by k equals the binomial coefficient C(N − 1, k − 1). Since by assumption m/N → 0 as N → ∞, for all sufficiently large N, the quantities C(N − 1, k − 1) are increasing and are maximal when k = m. Since C(N − 1, k − 1) ≤ C(N, k), it follows that

()

An application of the weak form of Stirling’s formula yields for all m ≥ 2 and all N ≥ m + 2

()

Since m/N → 0 as N → ∞, we conclude that 0 ≤ N⁻¹log⁡card⁡(A_N,b,m) → 0 as N → ∞. This completes the proof of part (a).

(b) The starting point is (23), which states that . For distinct ν ∈ A_N,b,m the sets Δ_N,b,m;ν are disjoint. Hence

()

where

()

It follows from part (a) that δ_N → 0 as N → ∞.

We continue with the estimation of card⁡(Ω_N,b,m). By Lemma 6

()

As proved in Lemma 6,

as N → ∞. Hence by (42)

()

Under the assumption that R(·∣ρ_b,α) attains its infimum over

, we define

()

In the last two paragraphs of this proof, we show that η_N → 0 as N → ∞. Given this fact, the last equation yields the asymptotic formula (35) in part (b).

We now prove that η_N → 0 as N → ∞. To do this, we use (45) to write

()

Like the second and third terms on the right side, the first term on the right side is nonnegative because A_N,b,m is a subset of

. Since

and δ_N → 0 as N → ∞, it will follow that η_N → 0 if we can show that R(·∣ρ_b,α) attains its infimum over

and that

()

We now prove (48). R(·∣ρ_b,α) is lower semicontinuous on [33, Lem. 1.4.3(b)] and thus on . Since R(·∣ρ_b,α) has compact level sets in [Theorem A.1(a)], it attains its infimum over at some measure θ^∗. We apply Theorem B.1 in [7] to θ = θ^∗, obtaining a sequence θ^(N) with the following properties: (1) for , θ^(N) ∈ B_N,b,m has components for , where ν^(N) is an appropriate sequence in A_N,b,m; (2) θ^(N)⇒θ^∗ as N → ∞; (3) R(θ^(N)∣ρ_b,α) → R(θ^∗∣ρ_b,α) as N → ∞. The limit in (48) follows from the inequalities

()

and the limit

as N → ∞. This completes the proof of Lemma 7 and thus the proof of the local estimate in part (b) of Theorem 5.

In the next section we explain how the local large deviation estimate in part (b) of Theorem 5 yields the LDP in Theorem 1.

4. Proof of Theorem 1 from Part (b) of Theorem 5

In Theorem 1 we state the LDP for the sequence Θ_N,b of number-density measures. This sequence takes values in , which is the set of probability measures on having mean c ∈ (b, ∞). The purpose of the present section is to explain how the local large deviation estimate in part (b) of Theorem 5 yields the LDP for Θ_N,b. All details appear in Section 4 of [7]. The basic idea is first to prove the large deviation limit for Θ_N,b lying in open balls in and in other subsets defined in terms of open balls and then to use this large deviation limit to prove the LDP in Theorem 1.

In Theorem 8 we state the large deviation limit for open balls and other subsets defined in terms of open balls. Two types of open balls are considered. Let θ be a measure in , and take r > 0. Part (a) states the large deviation limit for open balls , where π denotes the Prohorov metric on . This limit is used to prove the large deviation upper bound for compact subsets of in part (b) of Theorem 1 and the large deviation lower bound for open subsets of in part (d) of Theorem 1. Now let θ be a measure in . Part (b) states the large deviation limit for sets of the form , where . This limit is used to prove the large deviation upper bound for closed subsets in part (c) of Theorem 1. If , then , and the conclusions of parts (a) and (b) of the next theorem coincide.

Theorem 8. Fix a nonnegative integer b and a rational number c ∈ (b, ∞). Let m be the function m(N) appearing in the definitions of Ω_N,b,m in (3) and satisfying m(N) → ∞ and m(N) ²/N → 0 as N → ∞. The following conclusions hold:

(a)
Let θ be a measure in and take r > 0. Then for any open ball B_π(θ, r) in , is finite, and one has the large deviation limit
()
(b)
Let θ be a measure in and take r > 0. Then the set is nonempty, is finite, and one has the large deviation limit
()

We prove Theorem 8 by applying the local large deviation estimate in part (b) of Theorem 5. A key step is to approximate probability measures in B_π(θ, ε) and in by appropriate sequences of probability measures in the range of Θ_N,b. This procedure allows one to show in part (a) that the infimum can be approximated by the infimum of over θ lying in the intersection of B_π(θ, ε) and the range of Θ_N,b; a similar statement holds for the infimum in part (b). A set of hypotheses that allow one to carry out this approximation procedure is given in Theorem 4.2 in [7], a general formulation that yields Theorem 8 as a special case.

Theorem 1 states the LDP for the number-density measures Θ_N,b. In order to complete the proof of Theorem 1, we must lift the large deviation limits in Theorem 8 to the large deviation upper bound for compact sets and for closed sets and the large deviation lower bound for open sets. The large deviation lower bound for open sets is immediate from the limit in part (a). To prove the large deviation upper bound for compact sets, we cover the compact set by open balls and use the limit in part (a); the large deviation upper bound for closed sets follows by a similar procedure involving part (b). The details of this procedure are carried out as an application of general formulation in Theorem 4.3 in [7].

In the Appendix we prove two properties of the relative entropy and prove the existence of the quantity α_b(c) appearing in part (a) of Theorem 5.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

The research of Shlomo Ta’asan is supported in part by a grant from the National Science Foundation (NSF-DMS-1216433). Richard S. Ellis thanks Jonathan Machta for sharing his insights into statistical mechanics and Michael Sullivan for his generous help with a number of topological issues arising in this paper. Both authors thank the referee for a careful reading of the paper and for suggesting a number of references.

Appendix

Properties of Relative Entropy and Existence of α_b(c)

We fix a nonnegative integer b and a real number c ∈ (b, ∞). Given θ a probability measure on , the mean of θ is denoted by 〈θ〉. In Theorem A.1 we present two properties of the relative entropy R(θ∣ρ_b,α) and for θ in each of the following three spaces, which are introduced in Section 2: , the set of probability measures on ; , the set of satisfying 〈θ〉 = c; and , the set of satisfying 〈θ〉∈[b, c].

We recall that, for α ∈ (0, ∞), ρ_b,α denotes the Poisson distribution on having components ρ_b,α;j = [Z_b(α)] ⁻¹ · α^j/j! for , where Z₀(α) = e^α, and, for , . According to part (a) of Theorem 5 there exists a unique value α = α_b(c) for which ; thus lies in . In Theorem A.2 we prove the existence of α_b(c). In part (a) of the next theorem we show that R(θ∣ρ_b,α) has compact level sets in , , and . After the statement of Lemma 7 we use part (b) of the next theorem to show that the arbitrary parameter α in Lemmas 6 and 7 must have the value α_b(c).

Theorem A.1. Fix a nonnegative integer b and a real number c ∈ (b, ∞). For any α ∈ (0, ∞) the relative entropy has the following properties:

(a)
R(·∣ρ_b,α) has compact level sets in , , and .
(b)
For any , .

Proof. (a) The fact that has compact level sets in is proved in part (c) of Lemma 1.4.3 in [33]. Since is a compact subset of [Theorem 4(d)], R(·∣ρ_b,α) also has compact level sets in . Because is not a closed subset of [Theorem 4(a)], the proof that R(·∣ρ_b,α) has compact level sets in is more subtle. If θ⁽ⁿ⁾ is any sequence in satisfying R(θ⁽ⁿ⁾∣ρ_b,α) ≤ M < ∞, then since and R(·∣ρ_b,α) has compact level sets in , there exist and a subsequence such that and R(θ∣ρ_b,α) ≤ M. To complete the proof that R(·∣ρ_b,α) has compact level sets in , we must show that ; that is, 〈θ〉 = c. By Fatou’s lemma . In addition, for any w ∈ (0, ∞)

()

Lemma 5.1 in [37] shows that the sequence

is uniformly integrable, implying that

[32, Appendix, Prop. 2.3]. This completes the proof that R(·∣ρ_b,α) has compact level sets in

. The proof of part (a) is finished.

(b) We define g(α, b, c) = log⁡Z_b(α) − clog⁡α − (log⁡Z_b(α_b(c)) − clog⁡α_b(c)). Step 1 is to prove that for any

()

For any

we have

and

. Hence

()

Since the last two lines equal

, the proof of (A.2) is complete. Step 2 is to prove that R(θ∣ρ_b,α) attains its infimum over

at the measure

, and

()

Given these two assertions part (b) of the theorem follows by substituting

into (A.2).

We now prove the two assertions in Step 2. R(·∣ρ_b,α) is lower semicontinuous on [33, Lem. 1.4.3(b)] and thus on . Since R(·∣ρ_b,α) has compact level sets in , it attains its infimum over . The relative entropy attains its minimum value of 0 over at the unique measure [33, Lem. 1.4.1]. Hence (A.2) implies that the minimum value of R(·∣ρ_b,α) over equals

()

The last equality follows by applying (A.2) with

. This display shows that R(·∣ρ_b,α) attains its infimum over

and yields (A.4). The proof of part (b) is finished, completing the proof of the theorem.

We now prove that there exists a unique value of α_b(c) for which . The conclusion of the next theorem is part (a) of Theorem C.1 in [7]. In part (b) of that theorem we derive two sets of bounds on α_b(c) and use these bounds to show that α_b(c) is asymptotic to c as c → ∞. In part (d) of Theorem C.1 in [7] we make precise the relationship between and a Poisson random variable having parameter α_b(c).

Theorem A.2. Fix a nonnegative integer b and a real number c ∈ (b, ∞). There exists a unique value α_b(c)∈(0, ∞) such that lies in the set of probability measures on having mean c. If b = 0, then α₀(c) = c. If , then α_b(c) is the unique solution in (0, ∞) of αZ_b−1(α)/Z_b(α) = c.

According to this theorem, for , α_b(c) is the unique solution of αZ_b−1(α)/Z_b(α) = c. The heart of the proof of Theorem A.2, and its most subtle step, is to prove that the function γ_b(α) = αZ_b−1(α)/Z_b(α) satisfies for α ∈ (0, ∞) and thus is monotonically increasing on this interval. This fact is proved in the next lemma.

Lemma A.3. Fix a positive integer b and a real number c ∈ (b, ∞). For α ∈ (0, ∞) the function γ_b(α) = αZ_b−1(α)/Z_b(α) satisfies .

Proof. For and for α ∈ (0, ∞), we have . Thus . The key to proving that is to represent log⁡Z_b(α) in terms of the moment generating function of a probability measure. We do this by first expressing Z_b(α) in terms of the upper incomplete gamma function via the formula . As suggested in [38], we now make the change of variables x = yα, obtaining the representation

()

The function g_b is the moment generating function of the probability measure on

having the density h_b(y) = b(−y)^b−1 on [−1,0]. For α ∈ (0, ∞) let σ_b,α be the probability measure on

having the density e^αyh_b(y)/g_b(α) on [−1,0]. A straightforward calculation shows that

()

It follows that

for all α ∈ (0, ∞).

Using (A.6) and the formulas and , we calculate

()

This completes the proof of the lemma.

We are now ready to prove Theorem A.2.

Proof of Theorem A.2. We first consider b = 0. In this case ρ_0,α is a standard Poisson distribution on having mean α. It follows that α₀(c) = c is the unique value for which has mean c and thus lies in . This completes the proof for b = 0.

We now consider . In this case ρ_b,α is a probability measure on having mean

()

Thus ρ_b,α has mean c if and only if α satisfies γ_b(α) = c, where γ_b(α) = αZ_b−1(α)/Z_b(α). We prove the theorem by showing that γ_b(α) = c has a unique solution α_b(c)∈(0, ∞) for all

and any c > b. This assertion is a consequence of the following three steps:

; (2) lim_α→∞⁡γ_b(α) = ∞; (3) for all α ∈ (0, ∞),

. Steps 1 and 2 follow immediately from the definition of γ_b(α), and Step 3 is proved in Lemma A.3.

We have proved the theorem for all . Since we also validated the conclusion of the theorem for b = 0, the proof for all nonnegative integers b is done.

References

1 Ellis R. S. and Ta′asan S., The Boltzmann-Sanov large deviation principle and applications to statistical mechanics, 2014, 48 pages, http://people.math.umass.edu/~rsellis/pdf-files/boltzmann-sanov-applications.pdf.
Google Scholar
2 Randall D., Combinatorial Methods for Statistical Physics Models, Special Topics Course, 1999, http://people.math.gatech.edu/~randall/topics2.html.
Google Scholar
3 Baxter R. J., Exactly Solved Models in Statistical Mechanics, 1982, Academic Press, New York, NY, USA.
Google Scholar
4 McCoy B. and Wu T. T., The Two-Dimensional Ising Model, 1973, Harvard University Press, Cambridge, Mass, USA.
10.4159/harvard.9780674180758
Google Scholar
5 Dembo A. and Zeitouni O., Large Deviations Techniques and Applications, 1998, 2nd edition, Springer, New York, NY, USA, https://doi.org/10.1007/978-1-4612-5320-4.
10.1007/978-1-4612-5320-4
Google Scholar
6 Ellis R. S., Entropy, Large Deviations, and Statistical Mechanics, 1985, Springer, New York, NY, USA, Classics of Mathematics.
10.1007/978-1-4613-8533-2
Google Scholar
7 Ellis R. S. and Ta′asan S., Detailed large deviation analysis of a droplet model having a Poisson equilibrium distribution, http://arxiv.org/abs/1405.5091v4.
Google Scholar
8 Charalambides C. A., Enumerative Combinatorics, 2002, Chapman & Hall, CRC Press, Boca Raton, Fla, USA.
Google Scholar
9 Comtet L., Advanced Combinatorics: The Art of Finite and Infinite Expansions, 1974, D. Reidel Publishing Company, Dordrecht, The Netherlands, Translated by: J. W. Nienhuys.
10.1007/978-94-010-2196-8
Google Scholar
10 Bender E. A., Central and local limit theorems applied to asymptotic enumeration, Journal of Combinatorial Theory, Series A. (1973) 15, no. 1, 91–111, https://doi.org/10.1016/0097-3165(73)90038-1, 2-s2.0-0015896876.
10.1016/0097-3165(73)90038-1
Google Scholar
11 Boucheron S., Gamboa F., and Léonard C., Bins and balls: large deviations of the empirical occupancy process, Annals of Applied Probability. (2002) 12, no. 2, 607–636, https://doi.org/10.1214/aoap/1026915618, 2-s2.0-0036338068.
10.1214/aoap/1026915618
Google Scholar
12 Dupuis P., Nuzman C., and Whiting P., Large deviation asymptotics for occupancy problems, Annals of Probability. (2004) 32, no. 3, 2765–2818, https://doi.org/10.1214/009117904000000135, 2-s2.0-4544279028.
10.1214/009117904000000135
Google Scholar
13 Duran J., Sands, Powders, and Grains: An Introduction to the Physics of Granular Materials, 2000, Springer, New York, NY, USA, Translated by Axel Reisinger.
10.1007/978-1-4612-0499-2
Google Scholar
14 Mugele R. A. and Evans H. D., Droplet size distribution in sprays, Industrial & Engineering Chemistry. (1951) 43, no. 6, 1317–1324, https://doi.org/10.1021/ie50498a023.
10.1021/ie50498a023
CAS Web of Science® Google Scholar
15 Sellens R. W. and Brzustowski T. A., A prediction of the drop size distribution in a spray from first principles, Atomisation and Spray Technology. (1985) 1, 89–102.
Google Scholar
16 Ladd A. J. C. and Hoover W. G., Energy and entropy of interacting dislocations, Physical Review B. (1982) 26, no. 10, 5469–5479, https://doi.org/10.1103/physrevb.26.5469, 2-s2.0-1542652778.
10.1103/PhysRevB.26.5469
CAS Web of Science® Google Scholar
17 Datta A. and Som S. K., Effects of spray characteristics on combustion performance of a liquid fuel spray in a gas turbine combustor, International Journal of Energy Research. (1999) 23, no. 3, 217–228, 2-s2.0-0033541072.
10.1002/(SICI)1099-114X(19990310)23:3<217::AID-ER473>3.0.CO;2-U
CAS Web of Science® Google Scholar
18 Miller C. O. M., A mathematical model of aerial deposition of pesticides from aircraft, Environmental Science & Technology. (1980) 14, no. 7, 824–831, https://doi.org/10.1021/es60167a016, 2-s2.0-0019044310.
10.1021/es60167a016
Google Scholar
19 Alderliesten M., Mean particle diameters. Part I: evaluation of definition systems, Particle & Particle Systems Characterization. (1990) 7, no. 4, 233–241, https://doi.org/10.1002/ppsc.19900070138, 2-s2.0-0025535349.
10.1002/ppsc.19900070138
CAS Web of Science® Google Scholar
20 Alderliesten M., Mean particle diameters. Part II: standardization of nomenclature, Particle & Particle Systems Characterization. (1991) 8, no. 3, 237–241, https://doi.org/10.1002/ppsc.19910080143, 2-s2.0-0026220008.
10.1002/ppsc.19910080143
CAS Web of Science® Google Scholar
21 Li X., Tankin R. S., and Renksizbulut M., Calculated characteristics of droplet size and velocity distributions in liquid sprays, Particle & Particle Systems Characterization. (1990) 7, no. 2, 54–59, https://doi.org/10.1002/ppsc.19900070111, 2-s2.0-0025067730.
10.1002/ppsc.19900070111
CAS Web of Science® Google Scholar
22 Ma L. and Hanson R. K., Measurement of aerosol size distribution functions by wavelength-multiplexed laser extinction, Applied Physics B: Lasers and Optics. (2005) 81, no. 4, 567–576, https://doi.org/10.1007/s00340-005-1913-3, 2-s2.0-24144457522.
10.1007/s00340-005-1913-3
CAS Google Scholar
23 Megoulas N. C. and Koupparis M. A., Twenty years of evaporative light scattering detection, Critical Reviews in Analytical Chemistry. (2005) 35, no. 4, 301–316, https://doi.org/10.1080/10408340500431306, 2-s2.0-30144444641.
10.1080/10408340500431306
CAS Web of Science® Google Scholar
24 Schegk C.-D., Loeffler F., and Umhauer H., Drop-size distributions produced by flat-spray nozzles, Particle & Particle Systems Characterization. (1986) 3, no. 1, 14–19, https://doi.org/10.1002/ppsc.19860030104, 2-s2.0-0022696220.
10.1002/ppsc.19860030104
CAS Web of Science® Google Scholar
25 Dumouchel C., A new formulation of the maximum entropy formalism to model liquid spray drop-size distribution, Particle & Particle Systems Characterization. (2007) 23, no. 6, 468–479, https://doi.org/10.1002/ppsc.200500989, 2-s2.0-34247481070.
10.1002/ppsc.200500989
Google Scholar
26 Ellis R. S., T. Dauxois, S. Ruffo, and L. F. Cugliandolo, The theory of large deviations and applications to statistical mechanics, Long-Range Interacting Systems: Les Houches 2008 Session XC, 2010, Oxford University Press, New York, NY, USA, 227–277, http://people.math.umass.edu/~rsellis/pdf-files/Les-Houches-paper.pdf.
Google Scholar
27 Ellis R. S. and Wang K., Limit theorems for the empirical vector of the Curie-Weiss-Potts model, Stochastic Processes and Their Applications. (1990) 35, no. 1, 59–79, https://doi.org/10.1016/0304-4149(90)90122-9, 2-s2.0-38249020780.
10.1016/0304-4149(90)90122-9
Web of Science® Google Scholar
28 Costeniuc M., Ellis R. S., and Touchette H., Complete analysis of phase transitions and ensemble equivalence for the Curie-Weiss-Potts model, Journal of Mathematical Physics. (2005) 46, 063301, https://doi.org/10.1063/1.1904507, 2-s2.0-21844464303.
10.1063/1.1904507
Web of Science® Google Scholar
29 Ellis R. S., Otto P. T., and Touchette H., Analysis of phase transitions in the mean-field Blume-Emery-Griffiths model, Annals of Applied Probability. (2005) 15, no. 3, 2203–2254, https://doi.org/10.1214/105051605000000421, 2-s2.0-23744470029.
10.1214/105051605000000421
Web of Science® Google Scholar
30 Ellis R. S., The theory of large deviations: from Boltzmann′s 1877 calculation to equilibrium macrostates in 2D turbulence, Physica D: Nonlinear Phenomena. (1999) 133, no. 1–4, 106–136, https://doi.org/10.1016/s0167-2789(99)00101-3, 2-s2.0-0002719686.
10.1016/S0167-2789(99)00101-3
Web of Science® Google Scholar
31 Boucher C., Ellis R. S., and Turkington B., Derivation of maximum entropy principles in two-dimensional turbulence via large deviations, Journal of Statistical Physics. (2000) 98, no. 5-6, 1235–1278, https://doi.org/10.1023/A:1018671813486, ZBL0966.76039, 2-s2.0-0034339675.
10.1023/A:1018671813486
Web of Science® Google Scholar
32 Ethier S. N. and Kurtz T. G., Markov Processes: Characterization and Convergence, 1986, John Wiley & Sons, New York, NY, USA.
10.1002/9780470316658
Google Scholar
33 Dupuis P. and Ellis R. S., A Weak Convergence Approach to the Theory of Large Deviations, 1997, John Wiley & Sons, New York, NY, USA.
10.1002/9781118165904
Google Scholar
34 Flajolet P. and Sedgewick R., Analytic Combinatorics, 2009, Cambridge University Press, Cambridge, UK, https://doi.org/10.1017/cbo9780511801655.
10.1017/CBO9780511801655
Web of Science® Google Scholar
35 Johnson N. L. and Kotz S., Urn Models and Their Application: An Approach to Modern Discrete Probability Theory, 1977, John Wiley & Sons, New York, NY, USA.
Google Scholar
36 Kolchin V. F., Sevast′yanov B. A., and Chistyakov V. P., Random Allocations, 1978, V. H. Winston & Sons, Washington, DC, USA, edited by: A. V. Balakrishnan.
Google Scholar
37 Donsker M. D. and Varadhan S. R. S., Asymptotic evaluation of certain Markov process expectations for large time—III, Communications on Pure and Applied Mathematics. (1976) 29, no. 4, 389–461, https://doi.org/10.1002/cpa.3160290405.
10.1002/cpa.3160290405
Web of Science® Google Scholar
38 Neuman E., Inequalities and bounds for the incomplete gamma function, Results in Mathematics. (2013) 63, no. 3-4, 1209–1214, https://doi.org/10.1007/s00025-012-0263-9, 2-s2.0-84878382371.
10.1007/s00025-012-0263-9
Google Scholar

All articles

Large Deviation Analysis of a Droplet Model Having a Poisson Equilibrium Distribution

Abstract

1. Introduction

2. Definition of Droplet Model and Main Theorem

3. Local Large Deviation Estimate Yielding Theorem 1

4. Proof of Theorem 1 from Part (b) of Theorem 5

Conflict of Interests

Acknowledgments

Appendix

Properties of Relative Entropy and Existence of α_b(c)

References

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Large Deviation Analysis of a Droplet Model Having a Poisson Equilibrium Distribution

Abstract

1. Introduction

2. Definition of Droplet Model and Main Theorem

3. Local Large Deviation Estimate Yielding Theorem 1

4. Proof of Theorem 1 from Part (b) of Theorem 5

Conflict of Interests

Acknowledgments

Appendix

Properties of Relative Entropy and Existence of αb(c)

References

References

Related

Information

Properties of Relative Entropy and Existence of α_b(c)