A four-parameter family of Weibull distributions is introduced, as an example of a more general class created along the lines of Marshall and Olkin, 1997. Various properties of the distribution are explored and its usefulness in modelling real data is demonstrated using maximum likelihood estimates.
1. Introduction
Probability distributions are often used in survival analysis for modeling data, because they offer insight into the nature of various parameters and functions, particularly the failure rate (or hazard) function. Throughout the last decades, a considerable amount of research was devoted to the creation of lifetime models with more than the classical increasing and decreasing hazard rates; apparently, the motivation for this trend was to provide with more freedom of choice in the description of complex practical situations (see e.g., [1–9], and the references therein). In this paper a general class of models is introduced, by adding an extra parameter to a distribution in the sense of Marshall and Olkin [10], and subsequently used in developing a four-parameter modified Weibull extension distribution, with various failure rate curves that compete well with other alternatives in fitting real data. Specifically, Xie et al. [11] generalized the Chen [12] distribution by adding the lacking scale parameter, thus creating a three-parameter Weibull distribution; although the variety of shapes of the reliability curves was not enriched, the resulting model provided better fit to real data. The proposed distribution extends the Xie et al. [11] distribution by adding a shape parameter; it will be seen that compared to the previous and other models, the cost of the addition is balanced by the improvement in fitting real data.
The paper is organized as follows. Section 2 includes the general class of models and some properties. The proposed four-parameter Weibull model is introduced in Section 3 and some properties and reliability aspects are studied. The parameters are estimated by the method of maximum likelihood and the observed information matrix is obtained; the fit of the proposed distribution to two sets of real data is examined against three and two parameter competitors.
2. The Class of Distributions
It is possible to generalize a distribution by adding a shape parameter, in the sense of Marshall and Olkin [10]. Thus, starting with a distribution with survival function s0, the survival function of the proposed family with the additional parameter p is given by
()
and when p → 1, then s → s0. The probability density and hazard functions are readily found to be
()
where f0 and h0 are the probability density and hazard functions corresponding to the distribution with survival function s0, and x ∈ ℝ, p ∈ ℝ+ − {0}. Since,
Therefore, h(x)/h0(x) with x ∈ ℝ is increasing for p ≥ 1 and decreasing for p ∈ (0,1]. When s0(0) = 1, the hazard function at the origin, h(0), behaves quite differently than the corresponding functions for the Weibull and gamma distributions; for both these families, the distribution can be exponential, or h(0) = 0, or h(0) = ∞, so that h(0) is discontinuous in the shape parameter. This is not the case for the hazard functions in (2), and therefore the proposed family may be useful in fine-tuning the distribution with survival function s0.
3. A Weibull Extension Model
The survival function of the modified Weibull extension distribution, introduced by Xie et al. [11] and studied further by Tang et al. [13], is given by
()
for x, α, β, λ ∈ ℝ+ − {0}; hereinafter we shall be referring to the distribution with survival function given by (5) as the XTG distribution for brevity. By substituting (5) in (1), the survival function of the proposed distribution is obtained in the form
()
where θ = (α, β, λ, p) and x, α, β, λ, p ∈ ℝ+ − {0}. From (2) or (6), the pdf is readily obtained as
()
where α is ascale parameter and β, λ, p are shape parameters. It can be shown that the pdf is monotone decreasing, unimodal or even roller-coaster type; the different shapes of the pdf are illustrated in Figure 1 for selected values of the parameters.
Probability density functions of the EXTG distribution for α = 2, β = 0.5, λ = 0.03, p = 0.2 (solid line), α = 12, β = 4, λ = 2, p = 0.2 (dashed line), and α = 2, β = 0.7, λ = 0.3, p = 0.2 (dotted line).
Clearly, for p → 1 the proposed distribution reduces to the XTG distribution therefore the proposed model can be viewed as an extension of the XTG model (which is asymptotically related to the usual two-parameter Weibull distribution) and if, in addition, α = 1, then (7) defines the Chen [12] distribution; hereinafter we shall be referring to this distribution as the extended XTG distribution (EXTG distribution for brevity). Furthermore, it can be shown that for p ∈ (0,1) (7) is a compound of the logarithmic and the XTG distributions. Indeed, by incorporating the results of Barlow and Proschan [14] and Arnold et al. [15], consider the lifetime X = min (X1, X2, …, XZ) of a “series-system” of Z identical components, where failure occurs if at least one component ceases to function. If the lifetimes of the components are iid random variables with survivals given by (5) and the distribution of their number Z is logarithmic, independently of the X’s, with pmf
()
for z ∈ ℕ − {0}, p ∈ (0,1), then the distribution of X∣Z has pdf
()
for x, α, β, λ ∈ ℝ+ − {0}, and the distribution of X is the EXTG with pdf given by (7).
The calculations of the rth raw moments of the EXTG distribution involve the use of standard numerical integration procedures available in every mathematical package; for p ∈ (0,1) they can be expressed in the form,
()
By straightforward reversal of the cdf, obtained from (6) using that F(x; θ) = 1 − S(x), the quantile function is calculated to be
()
for p ∈ (0,1); hence the median is .
3.1. Failure Rate and Mean Residual Life Functions
From (6) and (7) the failure rate (also known as hazard rate) function of the EXTG distribution is
()
It can be shown that for β ≥ 1, the EXTG is an IFR distribution [16]. However, for β < 1 it can be IFR, DFR, and BTFR distribution, although it is not easy to determine analytically the ranges of the parameter values; the IFR, DFR, and BTFR characteristics are depicted in Figure 2 for selected values of the parameters. Given that there is no failure prior to x0, the residual life is the period from time x0 until the time of failure. The mean residual lifetime, for p ∈ (0,1), is
()
Other reliability aspects of the distribution can be obtained numerically. For example, the renewal function, which is important for maintenance, can be calculated approximately either by the well-known method of considering its limit at infinity and the first and second raw moments given by (10), or by applying the method of the linear combination of the cdf and the hazard function; see Cui and Xie [17], Jiang [18] and the references therein.
Hazard rate functions of the EXTG distribution for α = 0.05, β = 0.45, λ = 0.002, p = 20 (solid line), α = 0.2, β = 0.2, λ = 3, p = 2 (dashed line), and α = 10, β = 0.9, λ = 0.5, p = 200 (dotted line).
3.2. Inference
Assuming a random sample of n observations, yobs = (xi; i = 1, …, n), from (7), the log-likelihood is given by
()
and by differentiating with respect to θ the gradients are
()
The mle of θ, , is obtained by solving simultaneously the four nonlinear normal equations, ∂ℓ/∂α = 0, ∂ℓ/∂β = 0, ∂ℓ/∂λ = 0 and ∂ℓ/∂p = 0 by any iterative numerical method such as the Newton-Raphson, quasi-Newton, or Nelder-Mead procedures.
If the true parameter vector is an interior point of the parameter space, then can be treated as being approximately trivariate normal with mean θ and covariance matrix the inverse of the Fisher’s expected information matrix J(θ) = E(I; θ), where I(θ) is the observed information matrix with elements Iij = −∂2ℓ/∂θi∂θj, i, j = 1, …, 4 and the expectation is to be taken with respect to the distribution of X. By differentiating the normal equations in (15) the elements in the upper triangular part of I(θ) are found to be
()
The latter is a consistent estimator of J(θ) and can be used for constructing asymptotic confidence intervals for the parameters. However, if any of the true parameter values is zero then the asymptotic distribution of the maximum likelihood estimators is a mixture distribution [19]; in this case obtaining the asymptotic confidence intervals becomes quite difficult and shall not be pursued here.
3.3. Examples
In this section two sets of real data are considered in order to test the goodness of fit of the proposed model. The first set of data consists of times to first failure of fifty devices [20]. The second set of data involves forty-four observations obtained from a life test concerning failure times (in hours) of all subsystems of a machine, that is, engine, hydraulic and air-conditioning subsystems, brakes, transmissions, tyres and wheels, body and chassis [21, 22]; in both cases, the data were grouped and the empirical hazard rate was estimated by many methods to indicate a BT shape. In addition to the EXTG, the XTG distribution, the two-parameter Chen [12] distribution, and the three-parameter model introduced by Dimitrakopoulou et al. [3], were fitted to the datasets; for brevity, hereinafter we shall be referring to the latter two models as the Chen and DAL distributions respectively. The fit of each distribution was examined by the Akaike information criterion (AIC) and the Kolmogorov-Smirnov (K-S) goodness-of-fit test using maximum likelihood estimates; the estimates, the maximized log-likelihoods, the values of the AIC, and the values of the K-S statistic with the associated P-values are presented in Table 1. Furthermore, the values of the likelihood ratio test statistic for testing H0 : p = 1, calculated from the first and the second set of data, were 8.7939 (P = 0.003) and 6.8366 (P = 0.0089), respectively; the analogous computations for testing H0 : p = α = 1 were 11.8371 (P = 0.0027) and 6.479 (P = 0.0392). All the results indicate that the EXTG distribution describes these data better than the other models; these findings are also supported by the empirical and fitted survivor functions, plotted in Figure 3.
Table 1.
Parameter estimates, values of the log-likelihood (LL) and Akaike information criterion (AIC), and Kolmogorov-Smirnov (K-S) statistic obtained from the fit of each of the four distributions, to the times of first failure of devices (data set 1) and the machine’s subsystems lifetimes (data set 2).
Reliability curves of the empirical distribution (starred line), the EXTG distribution (solid line), the XTG distribution (dashed line), the Chen distribution (dotted line), and the DAL distribution (dot-dashed line) for the times of first failure of fifty devices (a) and the forty-four failure times of a machine′s subsystem (b).
Reliability curves of the empirical distribution (starred line), the EXTG distribution (solid line), the XTG distribution (dashed line), the Chen distribution (dotted line), and the DAL distribution (dot-dashed line) for the times of first failure of fifty devices (a) and the forty-four failure times of a machine′s subsystem (b).
Acknowledgment
The authors would like to thank a referee for useful comments and suggestions.
1Adamidis K. and
Loukas S., A lifetime distribution with decreasing failure rate, Statistics & Probability Letters. (1998) 39, no. 1, 35–42, 2-s2.0-0011339598.
2Adamidis K.,
Dimitrakopoulou T., and
Loukas S., On an extension of the exponential-geometric distribution, Statistics & Probability Letters. (2005) 73, no. 3, 259–269, 2-s2.0-20744454768, https://doi.org/10.1016/j.spl.2005.03.013.
3Dimitrakopoulou T.,
Adamidis K., and
Loukas S., A lifetime distribution with an upside-down bathtub-shaped hazard function, IEEE Transactions on Reliability. (2007) 56, no. 2, 308–311, 2-s2.0-34250376002, https://doi.org/10.1109/TR.2007.895304.
4Kus C., A new lifetime distribution, Computational Statistics and Data Analysis. (2007) 51, no. 9, 4497–4509, 2-s2.0-34147131788, https://doi.org/10.1016/j.csda.2006.07.017.
6Barreto-Souza W.,
de Morais A. L., and
Cordeiro G. M., The weibull-geometric distribution, Journal of Statistical Computation and Simulation. (2011) 81, no. 5, 645–657, 2-s2.0-79957536819, https://doi.org/10.1080/00949650903436554.
7Tahmasbi R. and
Rezaei S., A two-parameter lifetime distribution with decreasing failure rate, Computational Statistics and Data Analysis. (2008) 52, no. 8, 3889–3901, 2-s2.0-41249084321, https://doi.org/10.1016/j.csda.2007.12.002.
8Barreto-Souza W. and
Cribari-Neto F., A generalization of the exponential-Poisson distribution, Statistics & Probability Letters. (2009) 79, no. 24, 2493–2500, 2-s2.0-70350408341, https://doi.org/10.1016/j.spl.2009.09.003.
9Chahkandi M. and
Ganjali M., On some lifetime distributions with decreasing failure rate, Computational Statistics and Data Analysis. (2009) 53, no. 12, 4433–4440, 2-s2.0-69449091425, https://doi.org/10.1016/j.csda.2009.06.016.
10Marshall A. W. and
Olkin I., A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families, Biometrika. (1997) 84, no. 3, 641–652, 2-s2.0-0001324423.
11Xie M.,
Tang Y., and
Goh T. N., A modified Weibull extension with bathtub-shaped failure rate function, Reliability Engineering & System Safety. (2002) 76, no. 3, 279–285, 2-s2.0-0036604308, https://doi.org/10.1016/S0951-8320(02)00022-4.
12Chen Z., A new two-parameter lifetime distribution with bathtub shape or increasing failure rate function, Statistics & Probability Letters. (2000) 49, no. 2, 155–161, 2-s2.0-0001137633.
13Tang Y.,
Xie M., and
Goh T. N., Statistical analysis of a Weibull extension model, Communications in Statistics—Theory and Methods. (2003) 32, no. 5, 913–928, 2-s2.0-0038516719, https://doi.org/10.1081/STA-120019952.
17Cui L. and
Xie M., Some normal approximations for renewal function of large Weibull shape parameter, Communications in Statistics B. (2003) 32, no. 1, 1–16, 2-s2.0-0037309794, https://doi.org/10.1081/SAC-120013107.
18Jiang R., A simple approximation for the renewal function with an increasing failure rate, Reliability Engineering & System Safety. (2010) 95, 963–969, 2-s2.0-77951068183, https://doi.org/10.1016/j.ress.2010.04.007.
19Self S. G. and
Liang K.-L., Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under nonstandard conditions, Journal of the American Statistical Association. (1987) 82, 605–610.
21Kumar U.,
Klefsjö B., and
Granholm S., Reliability investigation for a fleet of load haul dump machines in a Swedish mine, Reliability Engineering & System Safety. (1989) 26, no. 4, 341–361, 2-s2.0-0024879264.
22Pulcini G., Modeling the failure data of a repairable equipment with bathtub type failure intensity, Reliability Engineering & System Safety. (2001) 71, no. 2, 209–218, 2-s2.0-0035254439, https://doi.org/10.1016/S0951-8320(00)00101-0.
Please check your email for instructions on resetting your password.
If you do not receive an email within 10 minutes, your email address may not be registered,
and you may need to create a new Wiley Online Library account.
Request Username
Can't sign in? Forgot your username?
Enter your email address below and we will send you your username
If the address matches an existing account you will receive an email with instructions to retrieve your username
The full text of this article hosted at iucr.org is unavailable due to technical difficulties.