Volume 2012, Issue 1 760687
Research Article
Open Access

A Family of Lifetime Distributions

Vasileios Pappas

Vasileios Pappas

Department of Mathematics, University of Ioannina, 45110 Ioannina, Greece uoi.gr

Search for more papers by this author
Konstantinos Adamidis

Corresponding Author

Konstantinos Adamidis

Department of Business Administration of Food and Agricultural Enterprises, University of Ioannina, 30100 Agrinio, Greece uoi.gr

Search for more papers by this author
Sotirios Loukas

Sotirios Loukas

Department of Mathematics, University of Ioannina, 45110 Ioannina, Greece uoi.gr

Search for more papers by this author
First published: 2012
Citations: 21
Academic Editor: Suk joo Bae

Abstract

A four-parameter family of Weibull distributions is introduced, as an example of a more general class created along the lines of Marshall and Olkin, 1997. Various properties of the distribution are explored and its usefulness in modelling real data is demonstrated using maximum likelihood estimates.

1. Introduction

Probability distributions are often used in survival analysis for modeling data, because they offer insight into the nature of various parameters and functions, particularly the failure rate (or hazard) function. Throughout the last decades, a considerable amount of research was devoted to the creation of lifetime models with more than the classical increasing and decreasing hazard rates; apparently, the motivation for this trend was to provide with more freedom of choice in the description of complex practical situations (see e.g., [19], and the references therein). In this paper a general class of models is introduced, by adding an extra parameter to a distribution in the sense of Marshall and Olkin [10], and subsequently used in developing a four-parameter modified Weibull extension distribution, with various failure rate curves that compete well with other alternatives in fitting real data. Specifically, Xie et al. [11] generalized the Chen [12] distribution by adding the lacking scale parameter, thus creating a three-parameter Weibull distribution; although the variety of shapes of the reliability curves was not enriched, the resulting model provided better fit to real data. The proposed distribution extends the Xie et al. [11] distribution by adding a shape parameter; it will be seen that compared to the previous and other models, the cost of the addition is balanced by the improvement in fitting real data.

The paper is organized as follows. Section 2 includes the general class of models and some properties. The proposed four-parameter Weibull model is introduced in Section 3 and some properties and reliability aspects are studied. The parameters are estimated by the method of maximum likelihood and the observed information matrix is obtained; the fit of the proposed distribution to two sets of real data is examined against three and two parameter competitors.

2. The Class of Distributions

It is possible to generalize a distribution by adding a shape parameter, in the sense of Marshall and Olkin [10]. Thus, starting with a distribution with survival function s0, the survival function of the proposed family with the additional parameter p is given by
()
and when p → 1, then ss0. The probability density and hazard functions are readily found to be
()
where f0 and h0 are the probability density and hazard functions corresponding to the distribution with survival function s0, and x, p+ − {0}. Since,
()
it follows from (2) that
()
Therefore, h(x)/h0(x) with x is increasing for p ≥ 1 and decreasing for p ∈ (0,1]. When s0(0) = 1, the hazard function at the origin, h(0), behaves quite differently than the corresponding functions for the Weibull and gamma distributions; for both these families, the distribution can be exponential, or h(0) = 0, or h(0) = , so that h(0) is discontinuous in the shape parameter. This is not the case for the hazard functions in (2), and therefore the proposed family may be useful in fine-tuning the distribution with survival function s0.

3. A Weibull Extension Model

The survival function of the modified Weibull extension distribution, introduced by Xie et al. [11] and studied further by Tang et al. [13], is given by
()
for x, α, β, λ+ − {0}; hereinafter we shall be referring to the distribution with survival function given by (5) as the XTG distribution for brevity. By substituting (5) in (1), the survival function of the proposed distribution is obtained in the form
()
where θ = (α, β, λ, p) and x, α, β, λ, p+ − {0}. From (2) or (6), the pdf is readily obtained as
()
where α is ascale parameter and β, λ, p are shape parameters. It can be shown that the pdf is monotone decreasing, unimodal or even roller-coaster type; the different shapes of the pdf are illustrated in Figure 1 for selected values of the parameters.
Details are in the caption following the image
Probability density functions of the EXTG distribution for α = 2,   β = 0.5,   λ = 0.03,   p = 0.2 (solid line), α = 12,   β = 4,   λ = 2,   p = 0.2 (dashed line), and α = 2,   β = 0.7,   λ = 0.3,   p = 0.2 (dotted line).
Clearly, for p → 1 the proposed distribution reduces to the XTG distribution therefore the proposed model can be viewed as an extension of the XTG model (which is asymptotically related to the usual two-parameter Weibull distribution) and if, in addition, α = 1, then (7) defines the Chen [12] distribution; hereinafter we shall be referring to this distribution as the extended XTG distribution (EXTG distribution for brevity). Furthermore, it can be shown that for p ∈ (0,1) (7) is a compound of the logarithmic and the XTG distributions. Indeed, by incorporating the results of Barlow and Proschan [14] and Arnold et al. [15], consider the lifetime X = min (X1, X2, …, XZ) of a “series-system” of Z identical components, where failure occurs if at least one component ceases to function. If the lifetimes of the components are iid random variables with survivals given by (5) and the distribution of their number Z is logarithmic, independently of the X’s, with pmf
()
for z − {0}, p ∈ (0,1), then the distribution of XZ has pdf
()
for x, α, β, λ+ − {0}, and the distribution of X is the EXTG with pdf given by (7).
The calculations of the rth raw moments of the EXTG distribution involve the use of standard numerical integration procedures available in every mathematical package; for p ∈ (0,1) they can be expressed in the form,
()
By straightforward reversal of the cdf, obtained from (6) using that F(x; θ) = 1 − S(x), the quantile function is calculated to be
()
for p ∈ (0,1); hence the median is .

3.1. Failure Rate and Mean Residual Life Functions

From (6) and (7) the failure rate (also known as hazard rate) function of the EXTG distribution is
()
It can be shown that for β ≥ 1, the EXTG is an IFR distribution [16]. However, for β < 1 it can be IFR, DFR, and BTFR distribution, although it is not easy to determine analytically the ranges of the parameter values; the IFR, DFR, and BTFR characteristics are depicted in Figure 2 for selected values of the parameters. Given that there is no failure prior to x0, the residual life is the period from time x0 until the time of failure. The mean residual lifetime, for p ∈ (0,1), is
()
Other reliability aspects of the distribution can be obtained numerically. For example, the renewal function, which is important for maintenance, can be calculated approximately either by the well-known method of considering its limit at infinity and the first and second raw moments given by (10), or by applying the method of the linear combination of the cdf and the hazard function; see Cui and Xie [17], Jiang [18] and the references therein.
Details are in the caption following the image
Hazard rate functions of the EXTG distribution for α = 0.05,   β = 0.45,   λ = 0.002,   p = 20 (solid line), α = 0.2,   β = 0.2,   λ = 3,   p = 2 (dashed line), and α = 10,   β = 0.9,   λ = 0.5,   p = 200 (dotted line).

3.2. Inference

Assuming a random sample of n observations, yobs = (xi;  i = 1, …, n), from (7), the log-likelihood is given by
()
and by differentiating with respect to θ the gradients are
()
The mle of θ, , is obtained by solving simultaneously the four nonlinear normal equations, /α = 0, /β = 0, /λ = 0 and /p = 0 by any iterative numerical method such as the Newton-Raphson, quasi-Newton, or Nelder-Mead procedures.
If the true parameter vector is an interior point of the parameter space, then can be treated as being approximately trivariate normal with mean θ and covariance matrix the inverse of the Fisher’s expected information matrix J(θ) = E(I; θ), where I(θ) is the observed information matrix with elements Iij = −2/θiθj,  i, j = 1, …, 4 and the expectation is to be taken with respect to the distribution of X. By differentiating the normal equations in (15) the elements in the upper triangular part of I(θ) are found to be
()

The latter is a consistent estimator of J(θ) and can be used for constructing asymptotic confidence intervals for the parameters. However, if any of the true parameter values is zero then the asymptotic distribution of the maximum likelihood estimators is a mixture distribution [19]; in this case obtaining the asymptotic confidence intervals becomes quite difficult and shall not be pursued here.

3.3. Examples

In this section two sets of real data are considered in order to test the goodness of fit of the proposed model. The first set of data consists of times to first failure of fifty devices [20]. The second set of data involves forty-four observations obtained from a life test concerning failure times (in hours) of all subsystems of a machine, that is, engine, hydraulic and air-conditioning subsystems, brakes, transmissions, tyres and wheels, body and chassis [21, 22]; in both cases, the data were grouped and the empirical hazard rate was estimated by many methods to indicate a BT shape. In addition to the EXTG, the XTG distribution, the two-parameter Chen [12] distribution, and the three-parameter model introduced by Dimitrakopoulou et al. [3], were fitted to the datasets; for brevity, hereinafter we shall be referring to the latter two models as the Chen and DAL distributions respectively. The fit of each distribution was examined by the Akaike information criterion (AIC) and the Kolmogorov-Smirnov (K-S) goodness-of-fit test using maximum likelihood estimates; the estimates, the maximized log-likelihoods, the values of the AIC, and the values of the K-S statistic with the associated P-values are presented in Table 1. Furthermore, the values of the likelihood ratio test statistic for testing H0 : p = 1, calculated from the first and the second set of data, were 8.7939 (P = 0.003) and 6.8366 (P = 0.0089), respectively; the analogous computations for testing H0 : p = α = 1 were 11.8371 (P = 0.0027) and 6.479 (P = 0.0392). All the results indicate that the EXTG distribution describes these data better than the other models; these findings are also supported by the empirical and fitted survivor functions, plotted in Figure 3.

Table 1. Parameter estimates, values of the log-likelihood (LL) and Akaike information criterion (AIC), and Kolmogorov-Smirnov (K-S) statistic obtained from the fit of each of the four distributions, to the times of first failure of devices (data set 1) and the machine’s subsystems lifetimes (data set 2).
Data set Distribution LL AIC K-S (P value)
1  (n = 50) EXTG (14.06.4838.042,313.57) 227.25 462.50 .1470 (.2300)
XTG (13.747.588.00876, —) 231.65 469.29 .1597 (.1562)
Chen (—.3444.0205, —) 233.17 470.34 .1676 (.1204)
DAL (6.2975.8074.0053, —) 236.04 478.08 .1981 (.03945)
  
2  (n = 44) EXTG (.0831.2007.0211,20.4087) 348.01 704.02 .1397 (.3262)
XTG (.0806.2086.0057, —) 351.43 708.86 .1667  (.1544)
Chen (—.2473.0024, —) 351.25 706.50 .1737 (.1244)
DAL (1.752.7771.0021, —) 359.44 724.89 .2875 (.0001)
Details are in the caption following the image
Reliability curves of the empirical distribution (starred line), the EXTG distribution (solid line), the XTG distribution (dashed line), the Chen distribution (dotted line), and the DAL distribution (dot-dashed line) for the times of first failure of fifty devices (a) and the forty-four failure times of a machine′s subsystem (b).
Details are in the caption following the image
Reliability curves of the empirical distribution (starred line), the EXTG distribution (solid line), the XTG distribution (dashed line), the Chen distribution (dotted line), and the DAL distribution (dot-dashed line) for the times of first failure of fifty devices (a) and the forty-four failure times of a machine′s subsystem (b).

Acknowledgment

The authors would like to thank a referee for useful comments and suggestions.

      The full text of this article hosted at iucr.org is unavailable due to technical difficulties.