International Journal of Stochastic Analysis

Volume 2011, Issue 1 605068

Research Article

Open Access

Nonconservative Diffusions on [0, 1] with Killing and Branching: Applications to Wright-Fisher Models with or without Selection

Corresponding Author

Thierry E. Huillet

[email protected]

Laboratoire de Physique Théorique et Modélisation, CNRS-UMR 8089 et Université de Cergy-Pontoise, 2 Avenue Adolphe Chauvin, 95302 Cergy-Pontoise, France u-cergy.fr

Search for more papers by this author

Thierry E. Huillet,

Corresponding Author

Thierry E. Huillet

[email protected]

Laboratoire de Physique Théorique et Modélisation, CNRS-UMR 8089 et Université de Cergy-Pontoise, 2 Avenue Adolphe Chauvin, 95302 Cergy-Pontoise, France u-cergy.fr

Search for more papers by this author

First published: 18 July 2011

https://doi.org/10.1155/2011/605068

Academic Editor: Manuel O. Cáceres

Share a link

Email
Wechat
Bluesky

Abstract

We consider nonconservative diffusion processes x_t on the unit interval, so with absorbing barriers. Using Doob-transformation techniques involving superharmonic functions, we modify the original process to form a new diffusion process presenting an additional killing rate part d > 0. We limit ourselves to situations for which is itself nonconservative with upper bounded killing rate. For this transformed process, we study various conditionings on events pertaining to both the killing and the absorption times. We introduce the idea of a reciprocal Doob transform: we start from the process , apply the reciprocal Doob transform ending up in a new process which is x_t but now with an additional branching rate b > 0, which is also upper bounded. For this supercritical binary branching diffusion, there is a tradeoff between branching events giving birth to new particles and absorption at the boundaries, killing the particles. Under our assumptions, the branching diffusion process gets eventually globally extinct in finite time. We apply these ideas to diffusion processes arising in population genetics. In this setup, the process x_t is a Wright-Fisher diffusion with selection. Using an exponential Doob transform, we end up with a killed neutral Wright-Fisher diffusion . We give a detailed study of the binary branching diffusion process obtained by using the corresponding reciprocal Doob transform.

1. Introduction

We consider diffusion processes on the unit interval with a series of elementary stochastic models arising chiefly in population dynamics in mind. These connections found their way over the last sixty years, chiefly in mathematical population genetics. In this context, we refer to [1] and to its extensive and nonexhaustive list of references for historical issues in the development of modern mathematical population genetics (after Wright, Fisher, Crow, Kimura, Nagylaki, Maruyama, Ohta, Watterson, Ewens, Kingman, Griffiths, and Tavaré, to cite only a few). See also the general monographs [2–6].

Special emphasis is put on Doob-transformation techniques of the diffusion processes under concern. Most of the paper′s content focuses on the specific Wright-Fisher (WF) diffusion model and some of its variations, describing the evolution of one two-locus colony undergoing random mating, possibly under the additional actions of mutation, selection, and so on. We now describe the content of this work in more detail.

Section 2 is devoted to generalities on one-dimensional diffusions on the unit interval [0, 1]. It is designed to fix the background and notations. Special emphasis is put on the Kolmogorov backward and forward equations, while stressing the crucial role played by the boundaries in such one-dimensional diffusion problems. Some questions such as the meaning of speed and scale functions, existence of an invariant measure, and validity of detailed balance are addressed in the light of the Feller classification of boundaries. When the boundaries are absorbing, the important problem of evaluating additive functionals along sample paths is then briefly discussed, emphasizing the prominent role played by the Green function of the model; several simple illustrative examples are supplied. So far, we have dealt with a given process, say x_t, and recalled the various ingredients for computing the expectations of various quantities of interest, summing up over the history of paths. In this setup, there is no distinction among paths with different destinations, nor did we allow for annihilation or creation of paths inside the domain before the process reached one of the boundaries. The Doob transform of paths allows to do so. We, therefore, describe the transformation of sample paths techniques deriving from superharmonic additive functionals. Some Doob transformations of interest are then investigated, together with the problem of evaluating additive functionals of the transformed diffusion process itself. Roughly speaking, the transformation of paths procedure allows to select sample paths of the original process with, say, a fixed destination and/or, more generally, to kill certain sample paths that do not fit the integral criterion encoded by the additive functional. As a result, this selection of paths procedure leads to a new process described by an appropriate modification of the infinitesimal generator of the original process including a multiplicative killing part rate of the sample paths inside the interval. It turns out, therefore, that the same diffusion methods used in the previous discussions apply to the transformed processes, obtained after a change of measure.

Let us be more specific. In this work, we limit ourselves to nonconservative diffusion processes x_t on the unit interval and so with absorbing barriers. Using Doob-transformation techniques involving superharmonic functions α, we modify the original process to form a new diffusion process presenting an additional killing rate part d > 0. We further limit ourselves to situations for which is itself nonconservative with bounded above killing rate. For a large class of diffusion processes, the exponential function or some linear combinations of exponential functions are admissible superharmonic functions α, leading to the required property on d. The full transformed process has two stopping times: the time to absorption to the boundaries and the killing time inside the domain. We study various conditionings of the transformed process: conditioning on events leading to both random stopping times occurring after the current time or only in the remote future and conditioning on events leading to either killing or absorption time occurring first. We give the relevant quasistationary limit laws, in the spirit of Yaglom [7]. This is made possible thanks to the existence of an harmonic function for the full infinitesimal generator of the transformed process.

We next introduce the idea of a reciprocal Doob transform: we start now from the process , apply the reciprocal Doob transform ending up in a new process which is x_t but now with an additional branching rate b > 0, which is bounded. Under this reciprocal technique, the particles are not killed, rather they are allowed either to survive or split. The transformed process is a binary branching diffusion. For this supercritical binary branching diffusion process, there is a tradeoff between branching events giving birth to new particles and absorption at the boundaries, killing the particles. Under our assumptions, the branching diffusion process gets eventually globally extinct in finite time.

We next apply these general ideas to diffusion processes arising in population genetics.

In Section 3 we start recalling that Wright-Fisher diffusion models with various drifts are continuous space-time models which can be obtained as scaling limits of a biased discrete Galton-Watson model with a conservative number of offspring over the generations. Sections 4 and 5 are devoted to a detailed study of both the neutral WF diffusion process and the WF diffusion with selection, respectively.

In Section 6, we apply the Doob-transformation techniques to these processes: The starting point process x_t is a Wright-Fisher diffusion with selection differential σ > 0. We use the exponential Doob kernel α = e^−σx. The transformed process accounts for a neutral Wright-Fisher evolution for the allele 1 frequency, subject to the additional possibility of the extinction of the population itself due to killing at rate d proportional to its heterozygosity. This model is of importance in population genetics as it first appeared in [8, Page 272] as a scaling limit of a discrete population genetics model of recombination. We particularize the relevant Yaglom limit laws obtained after conditionings on events pertaining to both the killing or the absorption times occurring first. The computations of the quasistationary distributions are explicit here. Our approach relies on the spectral expansion of the transition probability kernels of both x_t and which are known (from the works of Kimura) to involve oblate spheroidal wave functions and Gegenbauer polynomials, respectively.

In Section 7, we follow the general reciprocal path indicated in Section 2 and apply it to the particular models under concern, thereby illustrating and developing the idea of a reciprocal Doob transform. We give a detailed study of the binary branching diffusion process obtained by using the corresponding reciprocal Doob transform e^σx when the starting point process is now a neutral Wright-Fisher diffusion process. We end up in a globally subcritical branching particle system, each diffusing according to the WF model with selection. This problem is amenable to the results obtained in [9, 10].

2. Diffusion Processes on The Unit Interval: A Reminder

We start with generalities on one-dimensional diffusions exemplifying our study to the Wright-Fisher model and its relatives. For more technical details, we refer to [8, 11–13].

2.1. Generalities on One-Dimensional Diffusions on the Interval [0,1]

Let (w_t; t ≥ 0) be a standard one-dimensional Brownian (Wiener) motion. Consider a one-dimensional Itô diffusion driven by (w_t; t ≥ 0) on the interval say [0,1]; see [14]. We will let I = (0,1). Assume that it has locally Lipschitz continuous drift (x) and local standard deviation (volatility) g(x), namely, consider the stochastic differential equation (SDE)

()

The condition on f(x) and g(x) guarantees in particular that there is no point x_* in I for which |f(x)| or |g(x)| would blow up and diverge as |x − x_*| → 0.

The Kolmogorov backward infinitesimal generator of (2.1) is

. As a result, for all suitable ψ in the domain of the operator S_t : = e^tG,

satisfies the Kolmogorov backward equation (KBE)

()

In the definition of the mathematical expectation u, we have t∧τ_x : = inf (t, τ_x), where τ_x indicates a random time at which the process should possibly be stopped (absorbed), given the process was started in x. The description of this (adapted) absorption time is governed by the type of boundaries which {0,1} are to (x_t; t ≥ 0).

2.2. Natural Coordinate, Scale, and Speed Measure

For such Markovian diffusions, it is interesting to consider the G-harmonic coordinate φ ∈ C² belonging to the kernel of G that is, satisfying G(φ) = 0. For φ and its derivative φ^′ : = dφ/dy, with (x₀, y₀) ∈ (0,1), one finds

()

One should choose a version of φ satisfying φ^′(y) > 0, y ∈ I. The function φ kills the drift f of (x_t; t ≥ 0) in the sense that considering the change of variable y_t = φ(x_t),

()

The driftless diffusion (y_t; t ≥ 0) is often termed the diffusion in natural coordinates with state-space [φ(0), φ(1)]. Its volatility is

. The function φ is often called the scale function.

Whenever φ(0) > −∞ and φ(1) < +∞, one can choose the integration constants defining φ(x) so that

()

with φ(0) = 0 and φ(1) = 1. In this case, the state-space of (y_t; t ≥ 0) is again [0,1], the same as for (x_t; t ≥ 0).

Finally, considering the random time change t → θ_t with inverse: θ → t_θ defined by

and

()

the novel diffusion

is easily checked to be identical in law to a standard Brownian motion. Let now δ_y(·) = weak-lim _ε↓0(1/2ε) 1(· ∈ (y − ε, y + ε)) stand for the Dirac delta mass at y. The random time θ_t can be expressed as

()

where m(x) : = 1/(g²φ^′)(x) is the (positive) speed density at x = φ⁻¹(y) and

the local time at y of the Brownian motion before time t. Both the scale function φ and the speed measure dμ = m(x) · dx are, therefore, essential ingredients to reduce the original stochastic process (x_t; t ≥ 0) to the standard Brownian motion (w_t; t ≥ 0). Indeed, it follows from the above arguments that if

, then

is a Brownian motion. The Kolmogorov backward infinitesimal generator G may be written in Feller form

()

Examples 2.2 (from population genetics). (i) Assume that f(x) = 0 and g²(x) = x(1 − x). This is the neutral Wright-Fisher (WF) model discussed at length later. This diffusion is already in natural scale and φ(x) = x, m(x) = [x(1−x)]⁻¹. The speed measure is not integrable.

(ii) With u₁, u₂ > 0, assume f(x) = u₁ − (u₁ + u₂)x and g²(x) = x(1 − x). This is the Wright-Fisher model with mutation. The parameters u₁, u₂ can be interpreted as mutation rates. The drift vanishes when x = u₁/(u₁ + u₂) which is an attracting point for the dynamics. Here,,, with φ(0) = −∞ and φ(1) = +∞ if u₁, u₂ > 1/2. The speed measure density is and so is always integrable.

(iii) With σ ∈ R, assume a model with quadratic logistic drift f(x) = σx(1 − x) and local variance g²(x) = x(1 − x). This is the WF model with selection. For this diffusion (see [15]), φ(x) = ((1 − e^−2σx)/(1 − e^−2σ)) and m(x) ∝ [x(1−x)]⁻¹e^2σx are not integrable. Here, σ is a selection or fitness parameter. We shall return at length to this model and its neutral version later.

2.3. The Transition Probability Density

Assume that f(x) and g(x) are now differentiable in I. Let then p(x; t, y) stand for the transition probability density function of x_t at y given x₀ = x. Then, p : = p(x; t, y) is the smallest solution to the Kolmogorov forward (Fokker-Planck) equation (KFE)

()

where

is the adjoint of G (G^* acts on the terminal value y, whereas G acts on the initial value x). The way one can view this PDE depends on the type of boundaries that {0,1} are.

We will next suppose that the boundaries ∘: = 0 or 1 are both exit (or absorbing) boundaries. From the Feller classification of boundaries, this will be the case if for all y₀ ∈ (0,1)

()

where a function f(y) ∈ L₁(y₀, ∘) if

In this case, a sample path of (x_t; t ≥ 0) can reach ∘ from the inside of I in finite time but cannot reenter. The sample paths are absorbed at ∘. There is an absorption at ∘ at time τ_x,∘ = inf (t > 0 : x_t = ∘|x₀ = x) and P(τ_x,∘ < ∞) = 1. Whenever both boundaries {0,1} are absorbing, the diffusion x_t should be stopped at τ_x : = τ_x,0∧τ_x,1. Would none of the boundaries {0,1} be absorbing, then τ_x = +∞, which we rule out.

Examples of diffusion with exit boundaries are WF model and WF model with selection. In the WF model including mutations, the boundaries are entrance boundaries and so are not absorbing.

When the boundaries are absorbing, p(x; t, y) is a subprobability. Letting , we clearly have ρ_t(x) = P(τ_x > t). Such models are nonconservative.

For one-dimensional diffusions, the transition density p(x; t, y) is reversible with respect to the speed density ([8, Chapter 15, Section 13]) and so detailed balance holds

()

The speed density m(y) satisfies G^*(m) = 0. It may be written as a Gibbs measure with density: m(y) ∝ (1/g²(y))e^−U(y), where the potential function U(y) reads

()

and with the measure dy/g²(y) standing for the reference measure.

Further, if p(s, x; t, y) is the transition probability density from (s, x) to (t, y), s < t, then −∂_sp = G(p), with terminal condition p(t, x; t, y) = δ_y(x) and so p(s, x; t, y) also satisfies the KBE when looking at it backward in time. The Feller evolution semigroup being time homogeneous, one may as well observe that with p : = p(x; t, y), operating the time substitution t − s → t, p itself solves the KBE

()

In particular, integrating over y, ∂_tρ_t(x) = G(ρ_t(x)), with ρ₀(x) = 1(x ∈ (0,1)).

p(x; t, y) being a sub-probability, we may define the normalized conditional probability density q(x; t, y) : = p(x; t, y)/ρ_t(x), now with total mass 1. We get

()

The term b_t(x) : = −∂_tρ_t(x)/ρ_t(x) > 0 is the time-dependent birth rate at which mass should be created to compensate the loss of mass of the original process due to absorption of (x_t; t ≥ 0) at the boundaries. In this creation of mass process, a diffusing particle started in x dies at rate b_t(x) at point (t, y), where it is duplicated in two new independent particles both started at y (resulting in a global birth) evolving in the same diffusive way (consider a diffusion process with forward infinitesimal generator G^* governing the evolution of p(x; t, y). Suppose that a sample path of this process has some probability that it will be killed or create a new copy of itself and that the killing and birth rates d and b depend on the current location y of the path. Then, the process with the birth and death opportunities of a path has the infinitesimal generator λ(y) · +G^*(·), where λ(y) = b(y) − d(y). The rate can also depend on t and x). The birth rate function b_t(x) depends here on x and t, not on y.

When the boundaries of x_t are absorbing, the spectra of both −G and −G^* are discrete (see [8, Page 330]): There exist positive eigenvalues

ordered in ascending sizes and eigenvectors

of both −G^* and −G satisfying −G^*(v_k) = λ_kv_k and −G(y_k) = λ_ku_k such that with

and

, the spectral representation

()

holds.

Let λ₁ > λ₀ = 0 be the smallest nonnull eigenvalue of the infinitesimal generator −G^* (and of −G). Clearly,

and by L′ Hospital rule, therefore,

. Putting ∂_tq = 0 in the latter evolution equation, independently of the initial condition x

()

where v₁ is the eigenvector of −G^* associated to λ₁, satisfying −G^*v₁ = λ₁v₁. The limiting probability v₁/norm (after a proper normalization) is called the Yaglom limit law of (x_t; t ≥ 0) conditioned on being currently alive at all time t (see [7]).

2.4. Additive Functionals Along Sample Paths

Let (x_t; t ≥ 0) be the diffusion model defined by (2.1) on the interval I, where both endpoints are assumed absorbing (exit). This process is, thus, transient and nonconservative. We wish to evaluate the nonnegative additive quantities

()

where the functions c and d are both assumed nonnegative on I and ∂I = {0,1}. The functional α(x) ≥ 0 solves the Dirichlet problem

()

and α is a superharmonic function for G, satisfying −G(α) ≥ 0.

Some Examples. (1) Assume that c = 1 and d = 0: here, α = E(τ_x) is the mean time of absorption (average time spent in (0,1) before absorption), solution to

()

(2) Whenever both {0,1} are exit boundaries, it is of interest to evaluate the probability that x_t first hits [0,1] (say) at 1, given x₀ = x. This can be obtained by choosing c = 0 and d(∘) = 1(∘ = 1).

Let then α = :α₁(x) = P(x_t first hits [0,1] at 1∣x₀ = x). α₁(x) is a G-harmonic function solution to G(α₁) = 0, with boundary conditions α₁(0) = 0 and α₁(1) = 1. Solving this problem, we get

()

On the contrary, choosing α₀(x) to be a G-harmonic function with boundary conditions α₀(0) = 1 and α₀(1) = 0, α₀(x) = P(x_t first hits [0,1] at 0∣x₀ = x) = 1 − α₁(x).

(3) Let y ∈ I and put c = (1/2ε) 1 (x ∈ (y − ε, y + ε)) and d = 0. As ε → 0, c converges weakly to δ_y(x) and, is the Green function, solution to

()

𝔤 is, therefore, the mathematical expectation of the local time at y, starting from x (the sojourn time density at y). The solution is known to be (see [8, page 198] or [5, page 280])

()

The Green function is of particular interest to solve the general problem of evaluating additive functionals α(x). Indeed, as is well known, see [8], for example, the integral operator with respect to the Green kernel inverts the second-order operator −G leading to

()

Under this form, α(x) appears as a potential function and all potential function is superharmonic. Note that for all harmonic function h ≥ 0 satisfying −G(h) = 0,

()

is again superharmonic because −G(α_h) = c ≥ 0.

(4) Also of interest are the additive functionals of the type

()

where the functions c and d are again both assumed to be nonnegative. The functional α_λ(x) ≥ 0 solves the Dynkin problem, [8]

()

involving the action of the resolvent operator (λI−G)⁻¹ on c.

Whenever c(x) = δ_y(x), d = 0, then

()

is the λ-potential function, solution to

()

𝔤_λ is, therefore, the mathematical expectation of the exponentially damped local time at y, starting from x (the temporal Laplace transform of the transition probability density from x to y at t), with g₀ = g. Then, it holds that

()

The λ-potential function is also useful in the computation of the distribution of the first-passage time τ_x,y to y starting from x. From the convolution formula,

()

and taking the Laplace transform of both sides with respect to time, we obtain the Laplace-Stieltjes transform (LST) of the law of τ_x,y as

()

We have P(τ_x,y < ∞) = 𝔤(x, y)/(𝔤(y, y)) ∈ (0,1) as a result of both terms in the ratio being finite and x, y belonging to the same transience class of the process (under our assumptions that the boundaries are absorbing). Note that from the reversibility property

()

2.5. Transformation of Sample Paths (Doob-Transform) and Killing

In the preceding subsections, we have dealt with a given process and recalled the various ingredients for the expectations of various quantities of interest, summing over the history of paths. In this setup, there is no distinction among paths with different destinations nor did we allow for annihilation or creation of paths inside the domain before the process reached one of the boundaries. The Doob transform of paths allows to do so.

Consider a one-dimensional diffusion (x_t; t ≥ 0) as in (2.1) with absorbing barriers. Let p(x; t, y) be its transition probability, and let τ_x be its absorption time at the boundaries.

Let

be a nonnegative additive functional solving

()

Recall the functions c and d are both chosen nonnegative so that so is α.

Define a new transformed stochastic process

by its transition probability

()

In this construction of

through a change of measure, sample paths of (x_t; t ≥ 0) for which α(y) is large are favored. This is a selection of paths procedure due to Doob (see [11]).

Now, the KFE for

clearly is

, with p(x; 0, y) = δ_y(x) and

. The Kolmogorov backward operator of the transformed process is, therefore, by duality

()

Developing, with α^′(x) : = dα(x)/dx and

, we get

()

and the new KB operator can be obtained from the latter by adding a drift term (α^′/α)g²∂_x to the one in G of the original process to form a new process

with the KB operator

and by killing its sample paths at death rate d(x) : = (c/α)(x) (provided c ≠ 0). Note that

()

In others words, with

, the novel time-homogeneous SDE to consider is

()

possibly killed at rate d = c/α as soon as c ≠ 0. Whenever

is killed, it enters conventionally into some coffin state {∂} added to the state-space. Let

be the new absorption time at the boundaries of

started at x (with

would the boundaries be inaccessible to the new process

which we ruled out). Let

be the killing time of

started at x (the hitting time of ∂), with

if c = 0. Then,

is the novel stopping time of

. The SDE for

, together with its global stopping time

characterize the new process

with generator

to consider.

In the sequel, we shall limit ourselves to the cases for which the following additional conditions hold on the transformed process.

(i) Nonconservativeness of x̃t. We will next suppose that the boundaries ∘: = 0 or 1 are both exit (or absorbing) boundaries for the new process in (2.38). From the Feller criterion for exit boundaries, this will be the case if for all y₀ ∈ (0,1)

()

where

is the new speed measure density for

and

its scale function. Recalling

and

, we have

()

So, we assume here that

obeys itself a nonconservative diffusion.

(ii) Boundedness of the Killing Rate d. In some examples, the killing rate d = −G(α)/α is bounded above. For example, suppose that the drift of the diffusion process (x_t; t ≥ 0) is bounded above by f_* = max _x(f(x)) > 0. (If the drift of (x_t; t ≥ 0) is bounded below by f_* < 0, we are led to the same conclusions while considering the process 1 − x_t instead of x_t.) Then, choosing α(x) = e^−ax, a > 0, − G(α) = (af − (a²/2)g²)α < af_*α. Thus, d = −G(α)/α is bounded above by af_*. Because −G(α) = c ≥ 0, all this makes sense if, for all x, af(x) − (a²/2)g²(x) ≥ 0 or −∂U : = 2f/g² ≥ a (the opposite of the gradient of the potential function U in (2.12) is bounded below).

Let (a_k; k ≥ 1) be a nonincreasing sequence of [0,1]-valued real numbers. Let (α_k; k ≥ 1) be a sequence of nonnegative real numbers such that for all x ∈ (0,1)

()

Whenever f is bounded above and, for all x, 2f/g² ≥ a₁, we have

()

Thus, d = −G(α)/α is bounded above by a₁f_*.

Therefore, for a large class of diffusion processes, the exponential function or some linear combinations of exponential functions are superharmonic functions α, leading to a bounded above killing rate d = −G(α)/α.

2.6. Normalizing and Conditioning

Because the transformed process is nonconservative, it is of interest to inspect various conditionings in the sense of Yaglom, [7].

(i) Consider again the process with infinitesimal generator

losing mass due to killing and/or absorption at the boundaries. Integrating over y, with

, we have

()

with

. This gives the tail distribution of the full stopping time

Defining the conditional probability density

, now with total mass 1, with

, we get

()

The term

is the rate at which mass should be created to compensate the loss of mass of the process

due to its possible absorption at the boundaries and/or killing. Again, we have

, where λ₁ is the smallest positive eigenvalue of −G, and therefore, putting

in the latter evolution equation, we get that independently of the initial condition x

()

where

is the solution to

()

With v₁ the eigenvector of −G^* associated to λ₁,

is of the product form

()

where

. This results directly from the fact that

and that v₁ is the stated eigenvector of −G^*. A different way to see this is as follows. We have

()

and the conditional density of

given

is, therefore,

()

The rest follows from observing that, to the leading order in t in (2.15), for large time

()

where u₁ (v₁, resp.) is the eigenvector of −G (−G^*, resp.) associated to λ₁ and

. From this, it is clear that

and

()

The limiting probability

norm can, therefore, be interpreted as the Yaglom limit law of

conditioned on the event

(ii) Under our assumptions, in the transformation of paths process, the transformed process can both be absorbed at the boundaries and be killed. So, both and are finite with positive probability. We wish to understand the processes conditioned on the events or , (see [16]).

The probability mass cumulated at the boundaries {0,1} by time t clearly is [17]

()

As t → ∞, this probability tends to

. Note that

()

Now (assuming x ≠ {0,1}),

()

Thus, β is defined by

()

[or

], with boundary conditions β(0) = β(1) = 1. It serves as a positive harmonic function for

. This is a Sturm-Liouville problem to be solved for each case study.

The density of the process

conditioned on the event

()

The density of the process

conditioned on the event

()

Note that

()

and, with

()

as required, because this is the probability that

is neither in {0,1} nor in state ∂ at time t.

Note also that ] are the transition probability densities of conditioned on the event [of conditioned on the event ]. They are the Yaglom limits of both conditioned processes.

The backward infinitesimal generators of both processes with transition probability densities

and

are, respectively, given by

()

We get, respectively,

()

Thus, in

, there is no multiplicative part (no killing) and a shift in the drift, showing that the associated conditioned process

obeys the SDE

()

with drift

()

This process is ultimately absorbed at {0,1}.

, there is a killing multiplicative part which is enhanced d/(1 − β) > d and a shift in the drift, showing that the associated conditioned process

exhibits a faster killing rate, but the drift shift guarantees that

is not absorbed at the boundaries. We have

()

Additive Functionals of the Transformed Process. for the new process , it is also of interest to evaluate additive functionals along their own sample paths. Let then be such an additive functional where the functions and are themselves both nonnegative. It solves

()

Then, recalling the expression of the Green function 𝔤(x, y) of (x_t; t ≥ 0) in (2.22), we find explicitly

()

Specific transformations of interest. (i) The case c = 0 deserves a special treatment. Indeed, in this case, and so , the absorption time for the process governed by the new SDE (2.38). Here, . Assuming α solves −G(α) = 0 if x ∈ I with boundary conditions α(0) = 0 and α(1) = 1 (α(0) = 1 and α(1) = 0, resp.), the new process is just (x_t; t ≥ 0) conditioned on exiting at x = 1 (at x = 0, resp.). In the first case, the boundary 1 is exit, whereas 0 is entrance; α reads

()

with

()

giving the new drift. In the second case, α(x)=

, and the boundary 0 is exit, whereas 1 is entrance. Thus,

is just the exit time at x = 1 (at x = 0, resp.). Let

. Then,

solves

, whose explicit solution is

()

in terms of 𝔤(x, y), the Green function of (x_t; t ≥ 0).

Example 2.1. Consider the WF model on [0,1] with selection for which, with σ ∈ R, f(x) = σx(1 − x) and g²(x) = x(1 − x). Assume that α solves −G(α) = 0 if x ∈ (0,1) with α(0) = 0 and α(1) = 1; one gets, α(x) = (1 − e^−2σx)/(1 − e^−2σ). The diffusion corresponding to (2.38) has the new drift: , independently of the sign of σ. It models the WF diffusion with selection conditioned on exit at ∘ = 1.

(ii) Assume that α now solves −G(α) = 1 if x ∈ I with boundary conditions α(0) = α(1) = 0. In this case study, one selects sample paths of (x_t; t ≥ 0) with a large mean absorption time α(x) = E(τ_x). Sample paths with large sojourn time in I are favored. We have

()

where 𝔤(x, y) is the Green function (2.22). The boundaries of

are now both entrance boundaries and so

is not absorbed at the boundaries. The stopping time

is just its killing time

. Let

. Then,

solves

, with explicit solution

()

(iii) Assume that α now solves −G(α) = δ_y(x) if x ∈ I with boundary conditions α(0) = α(1) = 0. In this case study, one selects sample paths of (x_t; t ≥ 0) with a large sojourn time density at y recalling . The stopping time of occurs at rate δ_y(x)/𝔤(x, y). It is a killing time when the process is at y for the last time after a geometrically distributed number of passages there with rate 1/𝔤(x, y) (or with success probability 1/(1 + 𝔤(x, y))). Let . Then, solves , with explicit solution

()

when

may be viewed as the age of a mutant currently observed to present frequency y, see [18].

The Green function at y₀ ∈ (0,1) of the transformed process is solution to . It takes the simple form

()

(iv) Let λ₁ be the smallest non-null eigenvalue of the infinitesimal generator G. Let α = u₁ be the corresponding eigenvector, that is, satisfying, −Gu₁ = λ₁u₁ with boundary conditions u₁(0) = u₁(1) = 0. Then, c = λu₁. The new KB operator associated to the transformed process is

()

obtained while killing the sample paths of the process

governed by

at constant death rate d = λ₁. The transition probability of the transformed stochastic process

()

Define

. It is the transition probability of the process

governed by

; it corresponds to the original process (x_t; t ≥ 0) conditioned on never hitting the boundaries {0,1} (the so-called Q-process of (x_t; t ≥ 0), see [19]). It is simply obtained from (x_t; t ≥ 0) by adding the additional drift term

to f, where u₁ is the eigenvector of G associated to its smallest non-null eigenvalue. The determination of α = u₁ is a Sturm-Liouville problem. When t is large, to the dominant order

()

where v₁ is the Yaglom limit law of (x_t; t ≥ 0). Therefore

()

Thus, the limit law of the Q-process

is the normalized Hadamard product of the eigenvectors u₁ and v₁ associated, respectively, to G and G^*. On the other hand, the limit law of

is directly given by

()

where Z is the appropriate normalizing constant. Comparing (2.77) and (2.78)

()

The eigenvector v₁ associated to G^* is, therefore, equal to the eigenvector u₁ associated to G times the speed density of (x_t; t ≥ 0).

When dealing for example with the neutral Wright-Fisher diffusion, it is known that λ₁ = 1 with u₁ = x(1 − x) and v₁ ≡ 1 (see Section 4.3, example (ii)). The Q-process in this case obeys

()

with the stabilizing drift toward 1/2:

.The limit law of the Q-process

in this case is 6y(1 − y). The latter conditioning is more stringent than the Yaglom conditioning and so the limiting law has more mass away from the boundaries (compare with the uniform Yaglom limit). For additional similar examples in the context of WF diffusions and related ones, see [20].

2.7. Branching and the Reciprocal Doob Transform

Clearly, starting from the killed diffusion process with infinitesimal generator

and applying the reciprocal Doob transform defined by

()

leads to

. Indeed,

()

because

()

Note that

This suggests that starting from a diffusion process with infinitesimal generator

(without its killing part) and applying the reciprocal Doob transform

, one ends up with a modified process whose infinitesimal generator is

()

where

and

()

is now a pure birth rate. Note that

is now a subharmonic function for

because

Let

. Because G(α(x)) = −α(x)b(x), we have

()

and so

is harmonic for

Doob-transforming

using

, we get

()

which is the infinitesimal generator of the original diffusion process.

Under our assumptions, both process x_t and

with infinitesimal generators G and

are nonconservative diffusion processes with absorbing barriers. Further, b(x) > 0 is bounded above. Therefore, b(x) may be written as

()

where b_* = max _xb(x) > 0 and μ(x) ∈ [1,2].

The process with infinitesimal generator is now a pure binary branching diffusion process. For this class of models, an initial particle started at x obeys a diffusion process with infinitesimal generator G, absorbed when it hits the boundaries. At some random (mean b_*) exponential time, this particle dies, giving birth in the process to a random number M(x) (either 1 or 2) of daughter particles started where the mother particle died and diffusing independently as their mother did and so forth for the subsequent generation particles. We have EM(x) = μ(x).

The process with infinitesimal generator is, thus, a branching diffusion with supercritical binary splitting mechanism (μ(x) > 1). There is, therefore, a competition between the branching phenomenon that leads to an exponential increase of the number of particles in the system and the absorption at the boundaries of the living particles.

Let N_t(x) be the global number of particles which are alive in the system at each time t, descending from an Eve particle started at x, and let

()

be the global extinction time of the population. Under our assumptions, this branching model fits to the general formalism for branching diffusion developed in ([9, 10]) from which we conclude

()

uniformly in x. This means the global extinction of the particle system under concern: In the tradeoff between branching and absorption at the boundaries, the system gets eventually extinct with probability 1 in finite time. We shall develop a typical example arising in population genetics in the subsequent sections.

3. The Wright-Fisher Example

In this section, we briefly and informally recall that the celebrated WF diffusion process with or without a drift may be viewed as a scaling limit of a simple two alleles discrete space-time branching process preserving the total number N of individuals in the subsequent generations (see [8, 12, 21] for example).

3.1. The Neutral Wright-Fisher Model

Consider a discrete-time Galton Watson branching process preserving the total number of individuals in each generation. We start with N individuals. The initial reproduction law is defined as follows: let

and k_N : = (k₁, …, k_N) be integers. Assume that the first-generation random offspring numbers ν_N : = (ν_N(1), …, ν_N(N)) admit the following joint exchangeable polynomial distribution on the discrete simplex |k_N| = N:

()

This distribution can be obtained by conditioning N independent Poisson distributed random variables on summing to N. Assume subsequent iterations of this reproduction law are independent so that the population is with constant size for all generations.

Let N_r(n) be the offspring number of the n first individuals at the discrete generation r ∈ N₀ corresponding to (say) allele A₁ (the remaining number N − N_r(n) counts the number of alleles A₂ at generation r). This sibship process is a discrete-time Markov chain with binomial transition probability given by

()

Assume next that n = [Nx], where x ∈ (0,1). Then, as well known, the dynamics of the continuous space-time rescaled process x_t : = N_[Nt](n)/N, t ∈ R₊ can be approximated for large N, to the leading term in N⁻¹, by a Wright-Fisher-Itô diffusion on [0,1] (the purely random genetic drift case)

()

Here, (w_t; t ≥ 0) is a standard Wiener process. For this scaling limit process, a unit laps of time t = 1 corresponds to a laps of time N for the original discrete-time process, thus time is measured in units of N. If the initial condition is x = N⁻¹, x_t is the diffusion approximation of the offspring frequency of a singleton at generation [Nt].

Equation (3.3) is a one-dimensional diffusion as in (2.1) on [0,1], with zero drift f(x) = 0 and volatility . This diffusion is already in natural coordinate, and so φ(x) = x. The scale function is x and the speed measure [x(1−x)]⁻¹dx. One can check that both boundaries are exit in this case: the stopping time is τ_x = τ_x,0∧τ_x,1 where τ_x,0 is the extinction time and τ_x,1 the fixation time. The corresponding infinitesimal generators are and .

3.2. Nonneutral Cases

Two alleles Wright-Fisher models (with non-null drifts) can be obtained by considering the binomial transition probabilities bin(N, p_N)

()

where

()

is now some state-dependent probability (which is different from the identity x) reflecting some deterministic evolutionary drift from the allele A₁ to the allele A₂. For each r, we have

()

which is amenable to a diffusion approximation in terms of x_t : = N_[Nt](n)/N, t ∈ R₊ under suitable conditions.

For instance, taking p_N(x) = (1 − π_2,N)x + π_1,N(1 − x), where (π_1,N, π_2,N) are small (N-dependent) mutation probabilities from A₁ to A₂ (A₂ to A₁, resp.). Assuming that , leads after scaling to the drift of WF model with positive mutations rates (u₁, u₂).

Taking

()

where s_i,N > 0 are small N-dependent selection parameter satisfying

, leads, after scaling, to the WF model with selective drift σx(1 − x), where σ : = σ₁ − σ₂. Essentially, the drift f(x) is a large N approximation of the bias: N(p_N(x) − x). The WF diffusion with selection is thus

()

where time is measured in units of N. Letting θ_t = Nt define a new time scale with inverse t_θ = θ/N, the time-changed process y_θ = x_θ/N now obeys the SDE

()

with a small diffusion term. Here, s = s₁ − s₂ and time θ is the usual time clock.

The WF diffusion with selection (3.8) tends to drift to ∘ = 1 (∘ = 0, resp.) if allele A₁ is selectively advantageous over A₂:σ₁ > σ₂ (σ₁ < σ₂, resp.) in the following sense: if σ > 0 (<0, resp.), the fixation probability at ∘ = 1, which is [15]

()

increases (decreases) with σ taking larger (smaller) values. Putting x = 1/N, the fixation probability at 1 of an allele A₁ mutant is of order: 2σ/N; see [15].

4. The Neutral WF Model

In this section, we particularize the general ideas developed in the introductory Section 2 to the neutral WF diffusion (3.3) and draw some straightforward conclusions most of which are known which illustrate the use of Doob transforms.

4.1. Explicit Solutions of the Neutral KBE and KFE

As shown by Kimura in [22], it turns out that both Kolmogorov equations are exactly solvable, in this case, using spectral theory. Indeed, the solutions involve a series expansion in terms of eigenfunctions of the KB and KF infinitesimal generators with discrete eigenvalues spectrum. We now consider the specific neutral WF model.

With z ∈ (−1,1), let (P_k(z); k ≥ 0) be the degree-(k + 1) Gegenbauer polynomials solving with ; we let P₀(z) : = (1 − z)/2. When k ≥ 1, we have P_k(±1) = 0 and so P_k(z) = (1 − z²)Q_k(z), where Q_k(z) is a polynomial with degree k − 1 satisfying Q_k(−1) = (−1)^k−1 and Q_k(1) = 1. With x ∈ (0,1), let (u_k(x); k ≥ 0) be defined by: u_k(x) = P_k(1 − 2x). These polynomials clearly constitute a system of eigenfunctions for the KB operator with eigenvalues λ_k = (k(k + 1))/2, k ≥ 0, thus with −G(u_k(x)) = λ_ku_k(x). In particular, u₀(x) = x, u₁(x) = x − x², u₂(x) = x − 3x² + 2x³, u₃(x) = x − 6x² + 10x³ − 5x⁴, u₄(x) = x − 10x² + 30x³ − 35x⁴ + 14x⁵, …. With k ≥ 1, we have u_k(0) = u_k(1) = 0 and and .

The eigenfunctions of the KF operator are given by v_k(y) = m(y) · u_k(y), k ≥ 0, where the Radon measure of weights m(y)dy is the speed measure: m(y)dy = dy/(y(1 − y)), for the same eigenvalues. For instance, v₀(y) = 1/(1 − y), v₁(y) = 1, v₂(y) = 1 − 2y, v₃(y) = 1 − 5y + 5y², v₄(y) = 1 − 9y + 21y² − 14y³, ….

Although λ₀ = 0 really constitutes an eigenvalue, only v₀(y) is not a polynomial. When k ≥ 1, from their definition, the u_k(x) polynomials satisfy u_k(0) = u_k(1) = 0 in such a way that v_k(y) = m(y) · u_k(y), k ≥ 1 is a polynomial with degree k − 1.

Let

. We note that

if j ≠ k and the system u_k(x); k ≥ 1 is a complete orthogonal set of eigenvectors. Therefore, for any square-integrable function ψ(x) ∈ L₂([0,1], m(y)dy) admitting a decomposition in the basis u_k(x), k ≥ 1.

()

where ψ(x) = ∑_k≥1c_ku_k(x). This series expansion solves the KBE: ∂_tu = G(u); u(x, 0) = ψ(x) where u = u(x, t) : = Eψ(x_t).

Moreover, the transition probability density p(x; t, y) of the neutral WF models admits the spectral expansion

()

Starting from x, the cumulated probability masses by time t at the exit boundaries {0,1} are, respectively, (see [17])

()

which tend as t → ∞ toward the extinction and fixation probabilities, namely, here P(τ_x,0 < ∞) = P(τ_x,0 < τ_x,1) = 1 − x and P(τ_x,1 < ∞) = x. Because v_k(0) = 1 and v_k(1) = (−1)^k−1, we get the identities

()

leading to the relationship ∑_k≥1((−1)^k−1b_k/2λ_k) = 1.

The series expansion for p(x; t, y) solves the KFE of the WF model. The transition density p(x; t, y) is reversible with respect to the speed density since for 0 < x, y < 1

()

The measures v_k(y)dy, k ≥ 1 are not probability measures because the v_k(y) are not necessarily positive over [0,1]. This decomposition is not a mixture. We have

the 2-norm for the weight function m. We notice that

so that c₀ = b₀ = 0 although λ₀ = 0 is indeed an eigenvalue, the above sums should be started at k = 1 (expressing the lack of an invariant measure for the WF model as a result of its absorption at the boundaries).

We have

and so

()

is the exact tail distribution of the absorption time.

Since v₁(y) = 1, to the leading order in t, for large time

()

which is independent of y. Integrating over y, ρ_t(x) : = P(τ_x > t) ~ 6e^−t · x(1 − x) so that the conditional probability

()

is asymptotically uniform in the Yaglom limit. As time passes by, given absorption did not occur in the past,

x_∞ (as t → ∞) which is a uniformly distributed random variable on [0,1].

4.2. Additive Functionals for the Neutral WF

Let (x_t; t ≥ 0) be the WF diffusion model defined by (3.3) on the interval I = [0,1], where both endpoints are absorbing (exit). We wish to evaluate the additive quantities

()

where functions c and d are both nonnegative. With

, α(x) solves

()

Take c = lim _ε↓0(1/2ε)1(x ∈ (y − ε, y + ε)) = :δ_y(x) and d = 0, when y ∈ I: in this case, α : = 𝔤(x, y) is the Green function. The solution takes the simple form

()

The Green function solves the above general problem of evaluating additive functionals α(x)

()

As a Few Examples (1) Let c = 1 and d = 0: here, α(x) = E(τ_x) is the mean time of absorption (average time spent in I before absorption). The solution is (the Crow and Kimura formula, see [2])

()

(2) Let c = 0 and d(∘) = 1(∘ = 1). Let α(x) = P(x_t first hits [0,1] at 1 | x₀ = x). Then, α(x) is a G-harmonic function solution to G(α) = 0, with boundary conditions α(0) = 0 and α(1) = 1. The solution for WF model is: α(x) = x. Stated differently, x = P(τ_x,1 < τ_x,0) is the probability that the exit time at ∘ = 1 is less than the one at ∘ = 0, starting from x.

On the contrary, choosing α(x) to be a G-harmonic function with boundary conditions α(0) = 1 and α(1) = 0, α(x) = P(x_t first hits [0,1] at 0 | x₀ = x) = 1 − x. Thus, 1 − x = P(τ_x,0 < τ_x,1).

(3) Let c(x_s) = 2x_s(1 − x_s) measure the heterozygosity of the WF process at time s and assume d(0) = d(1) = 1. A remarkable thing is that the average heterozygosity over the sample paths is

()

which is the initial heterozygosity of the population.

4.3. Transformation of WF Sample Paths, [3]

With p(x; t, y) the transition probability density of WF model, define a new α-transformed stochastic process

by its transition probability

()

(i) Conditioning WF on exit at some boundary. Assume first α solves −G(α) = 0 with boundary conditions α(0) = 0 and α(1) = 1; hence, α reads α(x) = x. In this case,

(no killing), and so

is the absorption time for a process

governed by a new SDE with a drift term. The new process

is just (x_t; t ≥ 0) conditioned on exiting at ∘ = 1. The boundary 1 is exit whereas 0 is entrance. Thus, the model for

becomes

now with linear drift

and

. Its transition probability is

()

where the subscript 1 indicates that this is the conditional transition probability of sample paths whose exit is necessarily at the boundary 1.

Assuming now α solves −G(α) = 0 if x ∈ I with boundary conditions α(0) = 1 and α(1) = 0, the new process

is just (x_t; t ≥ 0) conditioned on exiting at x = 0. Boundary 0 is exit, whereas 1 is entrance; in this case, α is α(x) = 1 − x. Thus, the model for

becomes

with

and

. Its transition probability is

()

where the subscript 0 indicates that this is the conditional transition probability of WF sample paths whose exit now is at ∘ = 0. Recalling that, starting from x, (x_t; t ≥ 0) gets absorbed at ∘ = 1 (0, resp.) with probability x (1 − x, resp.), we recover that

()

Using the solution to KFE for p, we obtain an expression for both

and

, simply by premultiplying it by the corresponding right factor. Integrating the results over y, we get the conditional tail distributions of the exit times at ∘ = 1 or 0, given the exit is at ∘ = 1 or 0.

Exploiting the large time behavior of p(x; t, y), to the first order in t, we get

()

Integrating over y,

and

are the large time behaviors of the absorption times at 1 and 0, respectively. Using this, we get the large time behaviors of the conditional probabilities

()

where we recognize the densities of specific beta-distributed random variables. Specifically, we conclude that as time passes by, given absorption occurs at ∘ = 1 and given it has not occurred in the past,

beta(2,1) distribution on [0,1]. Similarly, given absorption occurs at ∘ = 0 and given it has not occurred previously,

beta(1,2) distribution on [0,1].

In the previously displayed formula,

is just the exit time at ∘ = 1 (at ∘ = 0, resp.) of the conditional transformed WF diffusions. Let

. Then, with

solves

, whose explicit solution is

()

in terms of 𝔤(x, y), the Green function of (x_t; t ≥ 0). For the WF model conditioned on exit at ∘ = 1 (0, resp.), we find, respectively, the Kimura and Ohta′s formulae in [23]

()

This result could have been guessed by observing that

is the expected absorption time of the original WF model. When x → 0⁺, (resp., x → 1⁻), it takes an average time 2 to reach 1 (0, resp.) for the WF model conditioned on exit at ∘ = 1 (0, resp.).

(ii) Selection of WF sample paths with large heterozygosity. Assume that α now solves −G(α) = 2x(1 − x) if x ∈ I with boundary conditions α(0) = α(1) = 0. Then, α = 2x(1 − x) and this α is the right eigenvector of −G associated to the smallest positive eigenvalue λ₁ = 1 of the neutral WF model. In this case study, one selects sample paths of (x_t; t ≥ 0) with large heterozygosity. The dynamics of

in (2.38) is

()

subject to a constant killing rate 1. The boundaries of

are now both entrance boundaries and so

is not absorbed at the boundaries. The stopping time

is just its killing time

which is mean 1 exponentially distributed, independently of the starting point x. Indeed,

()

recalling x(1 − x)v_k(x) = u_k(x) and observing

if k ≥ 2.

As time passes, killing of

occurs, and given killing will never occur in the future,

a random variable with density 6y(1 − y) on [0,1] which is a beta(2,2) density. In this selection of paths procedure, the conditional density of

given

is indeed

, where

. Therefore,

()

Recalling u₁(x) = x(1 − x), v₁(y) = 1 and b₁ = 6, we get

, regardless of the initial condition x. This is the beta(2,2) limit law of the Q-process of the neutral WF diffusion.

(iii) Selection of WF sample paths with large sojourn time density at y. Assume now that α solves −G(α) = δ_y(x) if x ∈ I and so α(x) = :𝔤(x, y). Using the Green function of the neutral WF model, the transition probability density of

()

Thus, given x < y (x > y),

coincides with (x_t; t ≥ 0) conditioned to exit in 1 (0 , resp.) killed at rate δ_y(x) when it passes through y, necessarily at some time.

The stopping time of is just its killing time when the process is at y for the last time with a geometrically number of passages at y with rate 1 (or success probability 1/2).

5. The WF Model with Selection

Now, we focus on the diffusion process (3.8). Let

be the Gegenbauer eigen-polynomials of the KF operator corresponding to the neutral WF diffusion (3.3) and so with eigenvalues λ_k = k(k + 1)/2, k ≥ 1. Define the oblate spheroidal wave functions on [0,1] as

()

where

obey the three-term recurrence defined in [24]. In the latter equality, the l summation is over odd (even) values if k is even (odd).

Define and where m(x) = e^2σx/(x(1 − x)) is the speed measure density of the WF model with selection (3.8).

The system

constitute a system of eigenfunctions for the WF with selection generators −G and −G^* with eigenvalues

implicitly defined in [24], thus with

and

. The eigenfunction expansion of the transition probability density of the WF model with selection is thus, [25],

()

where

. The WF model with selection can be viewed as a perturbation problem of the neutral WF model (see [3]). There exist perturbation developments of

around λ_k with respect to σ², [25]. They are valid and useful for small σ.

The WF diffusion process x_t with selection (3.8) is nonconservative, with finite hitting time τ_x of one of the boundaries. Following the general arguments developed in Section 2, the Yaglom limit of x_t conditioned on τ_x > t is the normalized version of

()

The limit law of x_t conditioned on never hitting the boundaries in the remote future is the normalized version of

()

Because the latter conditioning is more stringent than the former, the probability mass of (5.4) is more concentrated inside the interval than (5.3).

6. From the WF Model with Selection to the Neutral WF Model: Doob Transform and Killing

We shall consider the following transformation of paths for the WF model with selection. Consider the Wright-Fisher diffusion with selection (3.8): , x₀ = x ∈ (0,1). For this model, and both boundaries are exit.

Assume that σ > 0 so that the drift term is bounded above by f_* = σ/4, together with 2f/g² being bounded below (as a constant function here equal to 2σ). We are then in the general framework of the problems under study in this paper. This suggests that for some admissible choice of a superharmonic exponential function α = e^−ax, the α-Doob transform of x_t could lead to a transformed process with bounded killing rate d = −G(α)/α. We shall choose a = σ for its interesting features.

The transition density p(x; t, y) of x_t admits the representation (5.2) in terms of their oblate spheroidal wave eigenfunctions. Let

()

where

is the skewed sample heterozygosity, damped by the factor

. Then, α solves −G(α) = (1/2)σ²x(1 − x)e^−σx, with solution α(x) = e^−σx. In this case study, one selects sample paths of (x_t; t ≥ 0) with large α(y). The dynamics of

is the drift-less neutral WF dynamics

, subject to quadratic killing at rate d(x) = (1/2)σ²x(1 − x) in I, which is bounded above there. The boundaries of

are still exit and the stopping time

, where

is its absorption time at the boundaries and

its killing time. The density of the transformed process is

. Its series expansion is exactly known using (5.2) for p.

The transformed process accounts for a neutral evolution of the allele A₁ frequency subject to the additional extinction opportunity of the population itself due to killing at rate proportional to its heterozygosity. Leaving aside the fact that it can be obtained after a suitable Doob transformation, this model is of importance in population genetics: it first appeared in ([8, Page 272]) as a scaling limit of a population genetics model of recombination.

From the general study of Section 2, we obtain the following.

(i) Conditioned on

, the transformed process

admits a Yaglom limit

. With

the first oblate spheroidal eigenvector of −G^* associated to the smallest positive eigenvalue

is of the product form

()

This limiting probability

is the Yaglom limit law of

conditioned on the event

that both the absorption and killing times exceed t.

(ii) Let

be the infinitesimal generator of

with two stopping times. Now, there is a tradeoff between which of

and

occurs first. To solve it, we need to compute β defined in (2.55) by

, with boundary conditions β(0) = β(1) = 1. This is a Sturm-Liouville problem whose solution in our case is

()

The function

is minimal when x = 1/2, with value β(1/2) = 1/(cosh (σ/2)). This looks natural because when

, the chance to hit {0,1} before getting killed should be the lowest.

The density of the process

conditioned on the event

()

and so is also explicitly known from the oblate spheroidal wave expansion (5.2) of p(x; t, y). The tail distribution of

given

is obtained by integrating

over y.

Similarly, the density of the process

conditioned on the event

()

The tail distribution of

given

is obtained by integrating

over y.

The associated conditioned on absorption first process

obeys the SDE

()

with drift

()

and local variance unchanged g²(x) = x(1 − x). This process has no killing part and it gets eventually absorbed at {0,1}.

In the generator

of the conditioned on killing first process, there is a killing multiplicative part which is enhanced d/(1 − β) > d and a shift in the drift, showing that the associated conditioned process

exhibits a faster killing rate, but the drift shift guarantees that

is not absorbed at the boundaries. With g²(x) = x(1 − x) and β as in (6.3), the drift takes the peculiar explicit form

()

7. From the Neutral WF Model to the WF Model with Selection: Reciprocal Doob Transform and Branching

We now follow the general path indicated in Section 2.7 and apply it to the particular models under concern. We, therefore, illustrate and develop the idea of a reciprocal Doob transform on the specific example of interest.

The starting point is now the neutral Wright-Fisher diffusion:

, x₀ = x ∈ (0,1). For this model,

and both boundaries are exit. Its transition density p(x; t, y) admits the representation

()

in terms of the Gegenbauer eigenpolynomials (see Section 4.1). We shall consider the following reciprocal transformation of paths for the neutral WF model: let α(x) = e^σx and consider

. We now have G(α) = (1/2)σ²x(1 − x)e^σx and b(x) = G(α)/α > 0.

In this case study, one selects sample paths of (x_t; t ≥ 0) with large α(y). The dynamics of

is easily seen to be the WF with selection dynamics

()

subject to quadratic branching at rate b(x) = (1/2)σ²x(1 − x) inside I. We indeed have

()

where

is the KBE operator of the dynamics

With β(x) = α(x)⁻¹ = e^−σx, we clearly have

()

and β is an harmonic function for

and as a result, Doob-transforming

by β, we get

()

which is the infinitesimal generator of the original neutral WF martingale.

The birth (creating) rate b in

is bounded from above on (0,1). It may be put into the canonical form b(x) = b_*(μ(x) − 1), where b_* = max _x∈[0,1](b(x)) = σ²/8 > 0 and

()

whose range is the interval [1,2] as x ∈ [0,1].

The density of the transformed process is . It is exactly known because p is known from (7.1).

The transformed process (with infinitesimal backward generator ) accounts for a branching diffusion (BD), where a diffusing mother particle (with generator and started at x) lives a random exponential time with constant rate b_*. When the mother particle dies, it gives birth to a spatially dependent random number M(x) of particles (with mean μ(x)). If M(x) ≠ 0, M(x) independent daughter particles are started where their mother particle died; they move along a WF diffusion with selection and reproduce, independently and so on.

Because μ(x) is bounded above by 2 and larger than 1 (indicating a supercritical branching process), we actually get a BD with binary scission whose random offspring number satisfies

()

with p₂(x) ≥ p₁(x) (the event that 2 particles are generated in a splitting event is more probable than a single one).

For such a transformed process, the tradeoff is of a different nature: there is a competition between the boundaries {0,1} which are still absorbing for the system of particles and the number of particles N_t(x) in the system at each time t, which may grow due to branching events. The density

of the transformed process has now the following interpretation:

()

where p⁽ⁿ⁾(x; t, y) is the density at (t, y) of the nth alive particle descending from the ancestral one (Eve), started at x. In the latter formula, the sum vanishes if N_t(x) = 0. A particle is alive at time t if it came to birth before t and has not been yet absorbed by the boundaries.

Let

. Then,

is the expected number of particle alive at time t. We have

()

But then,

obeys the forward PDE

()

as a result of

. We have

()

showing that

is the average presence density at (t, y) of the system of particles all descending from Eve started at x.

Clearly,

(and, therefore, also

, because

()

The expected number of particles in the system decays globally at rate λ₁.

The BD transformed process, therefore, admits an integrable Yaglom limit

, solution to

. With v₁(y) = 1, the first eigenvector of −G^* associated to the smallest positive eigenvalue λ₁ = 1,

is of the product form

()

This limiting probability

is the Yaglom limiting average presence density at (t, y) for the BD system of particles (it is also the ground state for

There is also a natural eigenvector

of the backward operator

, satisfying

(the ground state for

). It is explicitly here that

()

In the terminology of [26], both operators

and its adjoint are critical (

(

) is said to be critical if there exists some function

(

, resp.), strictly positive in (0,1), such that:

(

.) and the operators do not possess a minimal positive Green function.). In this context, the constant λ₁ is called the generalized principal eigenvalue. The eigenfunctions

are their associated ground states.

We note that we have the L¹-product property (see [26, Subsection 4.9]).

()

With p_n(x) = P(M(x) = n), let

()

We have the xlog x condition

()

We conclude (following [9, 10]) that, as a result of the condition (7.17) being trivially satisfied, global extinction holds in the following sense:

(i), uniformly in x,

(ii) there exists a constant γ > 0:, uniformly in x,

(iii) For all bounded measurable function ψ on I,

()

From (i), it is clear that the process gets ultimately extinct with probability 1. In the tradeoff between branching and absorption at the boundaries, all particles get eventually absorbed and the global BD process turns out be subcritical (even though μ(x) = EM(x) > 1 for all x ∈ (0,1)): probability mass escapes out of I although the BD survives with positive probability.

In the statement (ii), the quantity 1 − P(N_t(x) = 0) = P(N_t(x) > 0) is also P(T(x) > t) where T(x) is the global extinction time of the particle system descending from an Eve particle started at x. The number −λ₁ is the usual Malthus decay rate parameter. From has a natural interpretation in terms of the propensity of the particle system to survive to its extinction fate: the so-called reproductive value in demography.

(iii) with ψ = 1 reads giving an interpretation of the constant γ (which may be hard to evaluate in practise).

The ground states of

and its adjoint are, thus,

and explicit here. It is useful to consider the process whose infinitesimal generator is given by the Doob transform

()

because product-criticality is preserved under this transformation. The ground states associated to this new operator and its dual are

. Developing, we obtain a process whose infinitesimal generator is

()

with no multiplicative part. In our case study, we get

adding a stabilizing drift towards 1/2 to the original neutral WF model. The associated diffusion process is positive recurrent and so its invariant measure

is integrable. It is the beta(2,2) limit law of the Q-process (see (2.80) and (4.23)) for the neutral WF diffusion.

Remark 7.1. At time t, let denote the positions of the BD particle system. Let stand for the functional generating function (|z| ≤ 1) of the measure-valued branching particle system. u(x, t; z) obeys the nonlinear (quadratic) Kolmogorov-Petrovsky-Piscounoff PDE, [27]

()

where θ(x, z) = E[z^M(x)] − z = (p₂(x)z² + p₁(x)z) − z or

()

is the shifted probability generating function of the branching law of M(x). Thus, the nonlinear part reads b_*θ(x, u(x, t; z)) = b(x)u(x, t; z)(u(x, t; z) − 1), which is quadratic in u.

In particular, if , u(x, t) obeys the linear backward PDE

()

involving

. The latter evolution equation is the backward version of the forward PDE giving the evolution of

References

1 Nagylaki T., Gustave Malécot and the transition from classical to modern population genetics, Genetics. (1989) 122, no. 2, 253–268.
10.1093/genetics/122.2.253
CAS PubMed Web of Science® Google Scholar
2 Crow J. F. and Kimura M., An Introduction to Population Genetics Theory, 1970, Harper & Row, New York, NY, USA, 0274068.
10.1006/tpbi.1995.1025
Google Scholar
3 Maruyama T., Stochastic Problems in Population Genetics, 1977, 17, Springer, Berlin, Germany, Lecture Notes in Biomathematics, 513425.
10.1007/978-3-642-93065-2
Google Scholar
4 Ewens W. J., Mathematical Population Genetics: I. Theoretical Introduction, 2004, 27, 2nd edition, Springer, New York, NY, USA, Interdisciplinary Applied Mathematics, 2026891.
10.1007/978-0-387-21822-9
Google Scholar
5 Durrett R., Probability Models for DNA Sequence Evolution, 2008, 2nd edition, Springer, New York, NY, USA, Probability and Its Applications, 2439767.
10.1007/978-0-387-78168-6
Google Scholar
6 Gillespie J. H., The Causes of Molecular Evolution, 1991, Oxford University Press, New York, NY, USA.
Google Scholar
7 Yaglom A. M., Certain limit theorems of the theory of branching random processes, Doklady Akademii Nauk SSSR. (1947) 56, 795–798, 0022045.
Google Scholar
8 Karlin S. and Taylor H. M., A Second Course in Stochastic Processes, 1981, Academic Press, New York, NY, USA, 611513.
Web of Science® Google Scholar
9 Asmussen S. and Hering H., Strong limit theorems for general supercritical branching processes with applications to branching diffusions, Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete. (1976) 36, no. 3, 195–212, 0420889, ZBL0325.60081.
10.1007/BF00532545
Google Scholar
10 Asmussen S. and Hering H., Some modified branching diffusion models, Mathematical Biosciences. (1977) 35, no. 3-4, 281–299, 0682242, https://doi.org/10.1016/0025-5564(77)90029-3, ZBL0369.60103.
10.1016/0025-5564(77)90029-3
Web of Science® Google Scholar
11 Dynkin E. B., Markov Processes. Vols. I, II, 1965, 122, Academic Press, New York, NY, USA; Springer, Berlin, Germany, Die Grundlehren der Mathematischen Wissenschaften, Bände 121, translated with the Authorization and Assistance of the Author by J. Fabius, V. Greenberg, A. Maitra, G. Majone.
10.1007/978-3-662-00031-1
Web of Science® Google Scholar
12 Ethier S. N. and Kurtz T. G., Markov Processes: Characterization and Convergence, 1986, John Wiley & Sons, New York, NY, USA, Wiley Series in Probability and Mathematical Statistics: Probability and Mathematical Statistics, https://doi.org/10.1002/9780470316658, 838085.
10.1002/9780470316658
Google Scholar
13 Mandl P., Analytical Treatment of One-Dimensional Markov Processes, 1968, 151, Academia Publishing House of the Czechoslovak Academy of Sciences, Prague, Czech Republic; Springer, New York, NY, USA, Die Grundlehren der mathematischen Wissenschaften, 0247667.
Google Scholar
14 Itô K., On stochastic differential equations, Memoirs of the American Mathematical Society. (1951) 1951, no. 4, 0040618, ZBL0054.05803.
Google Scholar
15 Kimura M., On the probability of fixation of mutant genes in a population, Genetics. (1962) 47, 713–729.
10.1093/genetics/47.6.713
CAS PubMed Web of Science® Google Scholar
16 Steinsaltz D. and Evans S. N., Quasistationary distributions for one-dimensional diffusions with killing, Transactions of the American Mathematical Society. (2007) 359, no. 3, 1285–1324, https://doi.org/10.1090/S0002-9947-06-03980-8, 2262851, ZBL1107.60048.
10.1090/S0002-9947-06-03980-8
Web of Science® Google Scholar
17 McKane A. J. and Waxman D., Singular solutions of the diffusion equation of population genetics, Journal of Theoretical Biology. (2007) 247, no. 4, 849–858, 2479629, https://doi.org/10.1016/j.jtbi.2007.04.016.
10.1016/j.jtbi.2007.04.016
CAS PubMed Web of Science® Google Scholar
18 Griffiths R. C., The frequency spectrum of a mutation, and its age, in a general diffusion model, Theoretical Population Biology. (2003) 64, no. 2, 241–251, https://doi.org/10.1016/S0040-5809(03)00075-3, ZBL1104.92045.
10.1016/S0040-5809(03)00075-3
CAS PubMed Web of Science® Google Scholar
19 Lambert A., Population dynamics and random genealogies, Stochastic Models. (2008) 24, no. supplement 1, 45–163, 2466449, https://doi.org/10.1080/15326340802437728.
10.1080/15326340802437728
Web of Science® Google Scholar
20 Huillet T., On Wright-Fisher diffusion and its relatives, Journal of Statistical Mechanics: Theory and Experiment. (2007) 2007, no. 11, P11006, https://doi.org/10.1088/1742-5468/2007/11/P11006.
10.1088/1742-5468/2007/11/P11006
Google Scholar
21 Blythe R. A. and McKane A. J., Stochastic models of evolution in genetics, ecology and linguistics, Journal of Statistical Mechanics. (2007) 7, P07018, https://doi.org/10.1088/1742-5468/2007/07/P07018.
10.1088/1742-5468/2007/07/P07018
Google Scholar
22 Kimura M., Stochastic processes and distribution of gene frequencies under natural selection, Cold Spring Harbor Symposia on Quantitative Biology. (1955) 20, 33–53, https://doi.org/10.1101/SQB.1955.020.01.006.
10.1101/SQB.1955.020.01.006
CAS PubMed Web of Science® Google Scholar
23 Kimura M. and Ohta T., The age of a neutral mutant persisting in a finite population, Genetics. (1973) 75, 199–212.
10.1093/genetics/75.1.199
CAS PubMed Web of Science® Google Scholar
24 Mano S., Duality, ancestral and diffusion processes in models with selection, Theoretical Population Biology. (2009) 75, no. 2-3, 164–175, https://doi.org/10.1016/j.tpb.2009.01.007, ZBL1211.92043.
10.1016/j.tpb.2009.01.007
PubMed Web of Science® Google Scholar
25 Kimura M., Diffusion models in population genetics, Journal of Applied Probability. (1964) 1, 177–232, 0172727, https://doi.org/10.2307/3211856, ZBL0134.38103.
10.2307/3211856
Google Scholar
26 Pinsky R. G., Positive Harmonic Functions and Diffusion, 1995, 45, Cambridge University Press, Cambridge, UK, Cambridge Studies in Advanced Mathematics, https://doi.org/10.1017/CBO9780511526244, 1326606.
10.1017/CBO9780511526244
Google Scholar
27 Kolmogorov A., Petrovsky I., and Piscounov N., Étude de l′équation de la diffusion avec croissance de la quantité de matière et son application à un probléme biologique, Moscow University Mathematics Bulletin. (1937) 1, ZBL0018.32106.
Google Scholar

All articles

Nonconservative Diffusions on [0, 1] with Killing and Branching: Applications to Wright-Fisher Models with or without Selection

Abstract

1. Introduction

2. Diffusion Processes on The Unit Interval: A Reminder

2.1. Generalities on One-Dimensional Diffusions on the Interval [0,1]

2.2. Natural Coordinate, Scale, and Speed Measure

2.3. The Transition Probability Density

2.4. Additive Functionals Along Sample Paths

2.5. Transformation of Sample Paths (Doob-Transform) and Killing

2.6. Normalizing and Conditioning

2.7. Branching and the Reciprocal Doob Transform

3. The Wright-Fisher Example

3.1. The Neutral Wright-Fisher Model

3.2. Nonneutral Cases

4. The Neutral WF Model

4.1. Explicit Solutions of the Neutral KBE and KFE

4.2. Additive Functionals for the Neutral WF

4.3. Transformation of WF Sample Paths, [3]

5. The WF Model with Selection

6. From the WF Model with Selection to the Neutral WF Model: Doob Transform and Killing

7. From the Neutral WF Model to the WF Model with Selection: Reciprocal Doob Transform and Branching

References

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Nonconservative Diffusions on [0, 1] with Killing and Branching: Applications to Wright-Fisher Models with or without Selection

Abstract

1. Introduction

2. Diffusion Processes on The Unit Interval: A Reminder

2.1. Generalities on One-Dimensional Diffusions on the Interval [0,1]

2.2. Natural Coordinate, Scale, and Speed Measure

2.3. The Transition Probability Density

2.4. Additive Functionals Along Sample Paths

2.5. Transformation of Sample Paths (Doob-Transform) and Killing

2.6. Normalizing and Conditioning

2.7. Branching and the Reciprocal Doob Transform

3. The Wright-Fisher Example

3.1. The Neutral Wright-Fisher Model

3.2. Nonneutral Cases

4. The Neutral WF Model

4.1. Explicit Solutions of the Neutral KBE and KFE

4.2. Additive Functionals for the Neutral WF

4.3. Transformation of WF Sample Paths, [3]

5. The WF Model with Selection

6. From the WF Model with Selection to the Neutral WF Model: Doob Transform and Killing

7. From the Neutral WF Model to the WF Model with Selection: Reciprocal Doob Transform and Branching

References

References

Related

Information