ORIGINAL ARTICLE

Open Access

Prediction of death rates for cardiovascular diseases and cancers

Oleg Gaidai

orcid.org/0000-0002-3196-8562

Shanghai Engineering Research Center of Marine Renewable Energy, College of Engineering Science and Technology, Shanghai Ocean University, Shanghai, China

Contribution: Conceptualization (equal)

Search for more papers by this author

Yihan Xing,

Corresponding Author

Yihan Xing

[email protected]

Department of Mechanical and Structural Engineering and Materials Science, University of Stavanger, Stavanger, Norway

Correspondence Yihan Xing, Department of Mechanical and Structural Engineering and Materials Science, University of Stavanger, Kjell Arholms gate 41, Stavanger 4021, Norway.

Email: [email protected]

Contribution: Validation (equal)

Search for more papers by this author

Rajiv Balakrishna,

Rajiv Balakrishna

Department of Mechanical and Structural Engineering and Materials Science, University of Stavanger, Stavanger, Norway

Contribution: Investigation (equal)

Search for more papers by this author

Jiayao Sun,

Jiayao Sun

School of Naval Architecture & Ocean Engineering, Jiangsu University of Science and Technology, Zhenjiang, China

Contribution: Methodology (equal)

Search for more papers by this author

Xiaolong Bai,

Xiaolong Bai

School of Naval Architecture & Ocean Engineering, Jiangsu University of Science and Technology, Zhenjiang, China

Contribution: Validation (equal)

Search for more papers by this author

Oleg Gaidai,

Oleg Gaidai

orcid.org/0000-0002-3196-8562

Shanghai Engineering Research Center of Marine Renewable Energy, College of Engineering Science and Technology, Shanghai Ocean University, Shanghai, China

Contribution: Conceptualization (equal)

Search for more papers by this author

Yihan Xing,

Corresponding Author

Yihan Xing

[email protected]

Department of Mechanical and Structural Engineering and Materials Science, University of Stavanger, Stavanger, Norway

Correspondence Yihan Xing, Department of Mechanical and Structural Engineering and Materials Science, University of Stavanger, Kjell Arholms gate 41, Stavanger 4021, Norway.

Email: [email protected]

Contribution: Validation (equal)

Search for more papers by this author

Rajiv Balakrishna,

Rajiv Balakrishna

Department of Mechanical and Structural Engineering and Materials Science, University of Stavanger, Stavanger, Norway

Contribution: Investigation (equal)

Search for more papers by this author

Jiayao Sun,

Jiayao Sun

School of Naval Architecture & Ocean Engineering, Jiangsu University of Science and Technology, Zhenjiang, China

Contribution: Methodology (equal)

Search for more papers by this author

Xiaolong Bai,

Xiaolong Bai

School of Naval Architecture & Ocean Engineering, Jiangsu University of Science and Technology, Zhenjiang, China

Contribution: Validation (equal)

Search for more papers by this author

First published: 09 February 2023

https://doi.org/10.1002/cai2.47

Citations: 17

Share a link

Email
Wechat
Bluesky

Abstract

Background

To estimate cardiovascular and cancer death rates by regions and time periods.

Design

Novel statistical methods were used to analyze clinical surveillance data.

Methods

A multicenter, population-based medical survey was performed. Annual recorded deaths from cardiovascular diseases were analyzed for all 195 countries of the world. It is challenging to model such data; few mathematical models can be applied because cardiovascular disease and cancer data are generally not normally distributed.

Results

A novel approach to assessing the biosystem reliability is introduced and has been found to be particularly suitable for analyzing multiregion environmental and healthcare systems. While traditional methods for analyzing temporal observations of multiregion processes do not deal with dimensionality efficiently, our methodology has been shown to be able to cope with this challenge.

Conclusions

Our novel methodology can be applied to public health and clinical survey data.

Abbreviations

CVD: cardiovascular disease
MDOF: multidegree of freedom

1 BACKGROUND

Cardiovascular disease (CVD) refers to a range of diseases affecting the heart and blood vessels including hypertension (high blood pressure), coronary heart disease and heart attacks, cerebrovascular diseases (e.g., stroke and heart failure), and various other heart diseases. Cancers are defined by the National Cancer Institute as diseases in which abnormal cells can divide and infiltrate nearby tissues. Cancers can arise in many parts of the body; thus, there is a wide range of cancer types, as shown below, some of which spread to other parts of the body through the blood and lymph systems. CVD and cancer are the leading causes of death worldwide, therefore analyzing bivariate statistics is important. This study is concerned with public health systems rather than health at the level of the individual. The research is not clinical in nature; the goal is to estimate the burden imposed by CVD and cancer on public health systems in different countries at any given time. We analyze mortality literature data for both CVDs [1-8] and cancer [9-29].

Assessing the reliability of healthcare systems and estimating excess mortality from CVDs using conventional statistical methods are challenging [30-35]. To achieve the latter goal over large areas, degrees of freedom are typically calculated for random variables governing dynamic biological systems. In principle, the reliability of a complex biological system can be accurately estimated if there are sufficient measurements or by using Monte Carlo simulations. For CVDs and cancers, however, data are scarce before 1990 [30]. Against this background, we introduce a novel method for assessing the reliability of biological and healthcare systems, to aid prediction and management of excess mortality from CVD. This study focused on cross-correlations in CVD and cancer deaths among countries within the same climatic zone. Worldwide health data and related research are readily available online [30].

Lifetime data analysis with the application of extreme value theory is widespread in the fields of medicine and engineering, [30]. A recent paper presented the arguments for and against using the upper distribution of life expectancy data [1]. A bivariate lifetime distribution is often assumed when analyzing statistical data [3]. A new approach that uses Clayton, Gumbel, and inverse Gaussian power variance functions, as well as conditional sampling and numerical approximation, was applied for survival analysis [2]. However, few studies have aimed to predict excess CVD and cancer mortality; this paper aimed to address this deficit.

In this paper, excess mortality from CVD is viewed as an unexpected event that may occur in any country at any time. The nondimensional factor $urn:x-wiley:27709191:media:cai247:cai247-math-0001$ is used to predict CVD risk. Biological systems are influenced by environmental parameters that can be modeled as ergodic processes. The CVD and cancer incidence data for 195 countries during the period 1990–2019 were retrieved [30]. The biological system under consideration herein can be regarded as a multidegree of freedom (MDOF) dynamic system with highly interrelated regional components/dimensions. This study focused on predicting excess mortality rather than symptoms.

2 METHODS

Consider an MDOF biosystem subjected to random ergodic environmental influences. The other alternative is to view the process as being dependent on specific environmental parameters whose variation in time may be modeled as an ergodic process on its own. The MDOF biomedical response vector process $urn:x-wiley:27709191:media:cai247:cai247-math-0002$ is measured and/or simulated over a sufficiently long time interval $urn:x-wiley:27709191:media:cai247:cai247-math-0003$ . Unidimensional global maxima over the entire time span $urn:x-wiley:27709191:media:cai247:cai247-math-0004$ are denoted as $urn:x-wiley:27709191:media:cai247:cai247-math-0005$ , $urn:x-wiley:27709191:media:cai247:cai247-math-0006$ , $urn:x-wiley:27709191:media:cai247:cai247-math-0007$ . By sufficiently long time $urn:x-wiley:27709191:media:cai247:cai247-math-0008$ , one primarily means a large value of $urn:x-wiley:27709191:media:cai247:cai247-math-0009$ with respect to the dynamic system autocorrelation time.

Let $urn:x-wiley:27709191:media:cai247:cai247-math-0010$ be consequent in the time local maxima of the bioprocess $urn:x-wiley:27709191:media:cai247:cai247-math-0011$ at monotonously increasing discrete time instants $urn:x-wiley:27709191:media:cai247:cai247-math-0012$ in $urn:x-wiley:27709191:media:cai247:cai247-math-0013$ . The analogous definition follows for other MDOF biological system response components $urn:x-wiley:27709191:media:cai247:cai247-math-0014$ with $urn:x-wiley:27709191:media:cai247:cai247-math-0015$ $urn:x-wiley:27709191:media:cai247:cai247-math-0016$ , and so on. For simplicity, all $urn:x-wiley:27709191:media:cai247:cai247-math-0017$ components, and therefore, its maxima are assumed to be nonnegative. The aim is to estimate system failure probability

$urn:x-wiley:27709191:media:cai247:cai247-math-0018$ ()

with

$urn:x-wiley:27709191:media:cai247:cai247-math-0019$ ()

being the probability of nonexceedance for response components $urn:x-wiley:27709191:media:cai247:cai247-math-0020$ , $urn:x-wiley:27709191:media:cai247:cai247-math-0021$ , $urn:x-wiley:27709191:media:cai247:cai247-math-0022$ , … critical values; $urn:x-wiley:27709191:media:cai247:cai247-math-0023$ denotes logical unity operation «or»; and $urn:x-wiley:27709191:media:cai247:cai247-math-0024$ being joint probability density of the global maxima over the entire time span $urn:x-wiley:27709191:media:cai247:cai247-math-0025$ .

In practice, however, it is not feasible to estimate the latter joint probability distribution directly $urn:x-wiley:27709191:media:cai247:cai247-math-0026$ due to its high dimensionality and available data set limitations. In other words, the time instant when either $urn:x-wiley:27709191:media:cai247:cai247-math-0027$ exceeds, $urn:x-wiley:27709191:media:cai247:cai247-math-0028$ exceeds, $urn:x-wiley:27709191:media:cai247:cai247-math-0029$ exceeds, and so on, the system is regarded as immediately failed. Fixed failure levels $urn:x-wiley:27709191:media:cai247:cai247-math-0030$ , $urn:x-wiley:27709191:media:cai247:cai247-math-0031$ , $urn:x-wiley:27709191:media:cai247:cai247-math-0032$ , … are, of course, individual for each unidimensional response component of $urn:x-wiley:27709191:media:cai247:cai247-math-0033$ . $urn:x-wiley:27709191:media:cai247:cai247-math-0034$ , $urn:x-wiley:27709191:media:cai247:cai247-math-0035$ , $urn:x-wiley:27709191:media:cai247:cai247-math-0036$ , and so on, see Naess and Gaidai [32] and Naess and Moan [49].

Next, the local maxima temporal instants $urn:x-wiley:27709191:media:cai247:cai247-math-0037$ in monotonously nondecreasing order being sorted into one single merged synthetic time vector $urn:x-wiley:27709191:media:cai247:cai247-math-0038$ . Note that $urn:x-wiley:27709191:media:cai247:cai247-math-0039$ , $urn:x-wiley:27709191:media:cai247:cai247-math-0040$ . In this case, $urn:x-wiley:27709191:media:cai247:cai247-math-0041$ represents the local maxima of one of the MDOF biosystem response components either $urn:x-wiley:27709191:media:cai247:cai247-math-0042$ , $urn:x-wiley:27709191:media:cai247:cai247-math-0043$ , or $urn:x-wiley:27709191:media:cai247:cai247-math-0044$ , and so on. That means that having $urn:x-wiley:27709191:media:cai247:cai247-math-0045$ time record, one just needs to continuously and simultaneously screen for unidimensional response component local maxima and record its exceedance of the MDOF limit vector $urn:x-wiley:27709191:media:cai247:cai247-math-0046$ in any of its components $urn:x-wiley:27709191:media:cai247:cai247-math-0047$ . The local unidimensional response component maxima are merged into one temporal nondecreasing vector $urn:x-wiley:27709191:media:cai247:cai247-math-0048$ in accordance with the merged time vector $urn:x-wiley:27709191:media:cai247:cai247-math-0049$ . That is to say, each local maxima $urn:x-wiley:27709191:media:cai247:cai247-math-0050$ is the actual encountered local maxima corresponding to either $urn:x-wiley:27709191:media:cai247:cai247-math-0051$ , $urn:x-wiley:27709191:media:cai247:cai247-math-0052$ , or $urn:x-wiley:27709191:media:cai247:cai247-math-0053$ , and so on. Finally, the unified limit vector $urn:x-wiley:27709191:media:cai247:cai247-math-0054$ is introduced with each component $urn:x-wiley:27709191:media:cai247:cai247-math-0055$ is either $urn:x-wiley:27709191:media:cai247:cai247-math-0056$ , $urn:x-wiley:27709191:media:cai247:cai247-math-0057$ , or $urn:x-wiley:27709191:media:cai247:cai247-math-0058$ and so on, depending on which of $urn:x-wiley:27709191:media:cai247:cai247-math-0059$ or $urn:x-wiley:27709191:media:cai247:cai247-math-0060$ or $urn:x-wiley:27709191:media:cai247:cai247-math-0061$ , and so forth, corresponds to the current local maxima with the running index $urn:x-wiley:27709191:media:cai247:cai247-math-0062$ .

Next, a scaling parameter $urn:x-wiley:27709191:media:cai247:cai247-math-0063$ is introduced to artificially simultaneously decreases limit values for all biosystem response components, namely, the new MDOF limit vector $urn:x-wiley:27709191:media:cai247:cai247-math-0064$ with $urn:x-wiley:27709191:media:cai247:cai247-math-0065$ , $urn:x-wiley:27709191:media:cai247:cai247-math-0066$ , $urn:x-wiley:27709191:media:cai247:cai247-math-0067$ , … is introduced. The unified limit vector $urn:x-wiley:27709191:media:cai247:cai247-math-0068$ introduced with each component $urn:x-wiley:27709191:media:cai247:cai247-math-0069$ is either $urn:x-wiley:27709191:media:cai247:cai247-math-0070$ , $urn:x-wiley:27709191:media:cai247:cai247-math-0071$ , or $urn:x-wiley:27709191:media:cai247:cai247-math-0072$ and so on. The latter automatically defines probability $urn:x-wiley:27709191:media:cai247:cai247-math-0073$ as a function of $urn:x-wiley:27709191:media:cai247:cai247-math-0074$ ; note that $urn:x-wiley:27709191:media:cai247:cai247-math-0075$ from Equation (1). Nonexceedance probability $urn:x-wiley:27709191:media:cai247:cai247-math-0076$ can be now estimated as follows:

$urn:x-wiley:27709191:media:cai247:cai247-math-0078$ ()

In practice, the dependency between neighboring $urn:x-wiley:27709191:media:cai247:cai247-math-0079$ values is not always negligible; thus, the following one-step (i.e., “conditioning level”; $urn:x-wiley:27709191:media:cai247:cai247-math-0080$ ) memory approximation is introduced

$urn:x-wiley:27709191:media:cai247:cai247-math-0081$ ()

for $urn:x-wiley:27709191:media:cai247:cai247-math-0082$ (called here conditioning level $urn:x-wiley:27709191:media:cai247:cai247-math-0083$ ). Approximation being introduced by Equation (4) may be further expressed as

$urn:x-wiley:27709191:media:cai247:cai247-math-0084$ ()

where $urn:x-wiley:27709191:media:cai247:cai247-math-0085$ (will be called conditioning level $urn:x-wiley:27709191:media:cai247:cai247-math-0086$ ) and so on. The motivation is to monitor each independent failure that happened locally first in time, thus avoiding cascading local intercorrelated exceedances [36-48].

Equation (5) presents subsequent refinements of the statistical independence assumption. The latter type of approximations enables capturing the statistical dependence effect between neighboring maxima with increased accuracy. Since the original MDOF bioprocess $urn:x-wiley:27709191:media:cai247:cai247-math-0087$ was assumed ergodic and therefore stationary, probability $urn:x-wiley:27709191:media:cai247:cai247-math-0088$ for $urn:x-wiley:27709191:media:cai247:cai247-math-0089$ will be independent of $urn:x-wiley:27709191:media:cai247:cai247-math-0090$ but only dependent on conditioning level $urn:x-wiley:27709191:media:cai247:cai247-math-0091$ . Thus, the nonexceedance probability can be approximated as in the Naess–Gaidai method, see [32, 49], where:

$urn:x-wiley:27709191:media:cai247:cai247-math-0092$ ()

Note that Equation (6) follows from Equation (1) by neglecting $urn:x-wiley:27709191:media:cai247:cai247-math-0093$ , as the design failure probability is usually very small. Further, it is assumed that $urn:x-wiley:27709191:media:cai247:cai247-math-0094$ . Note that Equation (5) is similar to the well-known mean up-crossing rate equation for the probability of exceedance [32, 49]. There is observed convergence with respect to conditioning parameter $urn:x-wiley:27709191:media:cai247:cai247-math-0095$

$urn:x-wiley:27709191:media:cai247:cai247-math-0096$ ()

Note that Equation (6) for $urn:x-wiley:27709191:media:cai247:cai247-math-0097$ turns into the quite well-known nonexceedance probability relationship with the mean up-crossing rate function

$urn:x-wiley:27709191:media:cai247:cai247-math-0098$ ()

where $urn:x-wiley:27709191:media:cai247:cai247-math-0099$ is the mean up-crossing rate of the response level $urn:x-wiley:27709191:media:cai247:cai247-math-0100$ for the above assembled nondimensional vector $urn:x-wiley:27709191:media:cai247:cai247-math-0101$ assembled from scaled MDOF biosystem response $urn:x-wiley:27709191:media:cai247:cai247-math-0102$ . The proposed methodology can also treat nonstationary cases. An illustration of how the methodology can be used to treat nonstationary cases is provided as follows. Consider a scattered diagram of $urn:x-wiley:27709191:media:cai247:cai247-math-0103$ bioenvironmental states, with each short-term bioenvironmental state having probability $urn:x-wiley:27709191:media:cai247:cai247-math-0104$ so that $urn:x-wiley:27709191:media:cai247:cai247-math-0105$ . The corresponding long-term equation is then

$urn:x-wiley:27709191:media:cai247:cai247-math-0106$ ()

with $urn:x-wiley:27709191:media:cai247:cai247-math-0107$ being the same function as in Equation (7) but corresponding to a specific short-term environmental state with the number $urn:x-wiley:27709191:media:cai247:cai247-math-0108$ . Note that this statistical model has already been validated [47, 50-52].

3 RESULTS

Prediction of CVD and cancer has long been a target in the fields of epidemiology and mathematical biology. Public health systems are dynamic, highly nonlinear, multidimensional, and spatially diverse systems that are challenging to analyze. Previous studies have used a variety of approaches to predict CVD and cancer cases. In this section, the above-described methodology is applied to real-world CVD data sets for all countries of the world.

The statistical data in the present section are from the “Our World in Data” website [30], which provides annual CVD death rates for all countries for the period 1990–2019. The death rates for the 195 countries (components $urn:x-wiley:27709191:media:cai247:cai247-math-0109$ ) constitute 195 dimensional (195D) data for a dynamic biological system.

General failure limits ( $urn:x-wiley:27709191:media:cai247:cai247-math-0110$ ), that is, CVD thresholds, are less intuitive than setting failure limits for each individual country according to its population, such that $urn:x-wiley:27709191:media:cai247:cai247-math-0111$ are equal to the annual death rate of a given country. The death rate for cancer is lower than that for CVD, but it is typically more painful to die from cancer. In this paper, the “failure limit” for cancer is lowered fourfold to match that for CVD.

Next, the local maxima from all nondimensionalized time series data are merged into a single time series using Equation (5):

$urn:x-wiley:27709191:media:cai247:cai247-math-0112$ ()

Each maximum, such as $urn:x-wiley:27709191:media:cai247:cai247-math-0113$ , is inserted into single time series according to its temporal occurrence (denoted by subscript $urn:x-wiley:27709191:media:cai247:cai247-math-0114$ ).

Figure 1 presents the annual deaths from CVD and cancer by country and year. Figure 2 presents the number of new deaths as a 195D vector $urn:x-wiley:27709191:media:cai247:cai247-math-0115$ . Data for Uzbekistan were excluded from the analysis because they were regarded as outliers. $urn:x-wiley:27709191:media:cai247:cai247-math-0116$ was assembled from different regional components, that is, CVD data sets. Index $urn:x-wiley:27709191:media:cai247:cai247-math-0117$ is a running index of local maxima encountered in the “non-decreasing” time series.

Details are in the caption following the image — **Figure 1**
Open in figure viewer PowerPoint

Annual deaths from cardiovascular disease and cancer as a percentage of the population for 195 countries.

**Figure 2**
Open in figure viewer PowerPoint

Left: Cross-correlations between cardiovascular disease (CVD) and cancer cases as a percentage of the population. Right: Annual death rates as a 195-dimensional vector $urn:x-wiley:27709191:media:cai247:cai247-math-0118$ , as a percentage of the population of the corresponding country. The cancer rate was increased fourfold to match that of CVD.

Overall, there is a clear East–West divide in the CVD death rates. Rates across North America and Western/Northern Europe tended to be lower than those across Eastern Europe, Asia, and Africa. For most of Latin America, the rates were moderate. As an example, in France, the age-standardized CVD death rate was around 86 per 100,000 in 2017, while across Eastern Europe, it was around five times higher (400–500 per 100,000). Uzbekistan had the highest rate of 724 per 100,000.

Figure 3 presents the predicted annual CVD death rates (percentage relative to the entire population of a given country) over 100 years, extrapolated from Equation (10). $urn:x-wiley:27709191:media:cai247:cai247-math-0119$ was used as a cut-off value. The 95% confidence intervals (CIs) were calculated. According to Equation (5), $urn:x-wiley:27709191:media:cai247:cai247-math-0120$ is directly related to the target failure probability ( $urn:x-wiley:27709191:media:cai247:cai247-math-0121$ ) derived from Equation (1). Therefore, system failure probability can be estimated as $urn:x-wiley:27709191:media:cai247:cai247-math-0122$ . Note that, in Equation (6), $urn:x-wiley:27709191:media:cai247:cai247-math-0123$ corresponds to the total number of local maxima in response vector $urn:x-wiley:27709191:media:cai247:cai247-math-0124$ . Conditioning parameter $urn:x-wiley:27709191:media:cai247:cai247-math-0125$ was found to be sufficient because of the convergence of $urn:x-wiley:27709191:media:cai247:cai247-math-0126$ (see Equation 6). In Figure 3, the 95% CIs are relatively narrow, which represents an advantage of the proposed method. Table 1 compares 100-year predictions based on data for 15- and 30-year periods. The 15-year data set was derived from the full 30-year data set by omitting odd years. The 95% CIs were wider for the truncated data set, as expected.

**Figure 3**
Open in figure viewer PowerPoint

Death rate predictions over 100 years extrapolated from $urn:x-wiley:27709191:media:cai247:cai247-math-0127$ . The critical level is indicated by a star. The 95% confidence intervals are indicated by dotted lines. The percentage of the population is represented by the horizontal axis. Left: Predictions based on 30 years of data; Right: predictions based on 15 years of data.

Table 1. Predicted cardiovascular disease death rates over 100 years based on 30- and 15-year data sets.

	Predicted death rate (%)	95% CI, lower bound	95% CI, upper bound
30-year data set	0.942	0.909	0.966
15-year data set	0.914	0.879	0.949

Abbreviation: CI, confidence interval.

The predicted average annual CVDs over the next 100 years, among all years and countries, were found below 1%. Our methodology uses available data efficiently by assuming that healthcare system data sets are multidimensional and extrapolates death rates even when the data set is relatively limited. The predicted nondimensional factor $urn:x-wiley:27709191:media:cai247:cai247-math-0128$ , indicated by the star in Figure 3, represents the probability of excess CVD mortality for any given country. Our method could be applied to predict cancer clusters, rather than merely death rates over time, which would be of high practical importance.

4 CONCLUSIONS

Traditional methods for assessing the reliability of healthcare systems on the basis of time series data do not efficiently deal with systems characterized by high dimensionality and cross-correlations. The main advantage of our methodology is its ability to assess the reliability of high-dimensional nonlinear dynamic systems. Despite its simplicity, the novel multidimensional modeling strategy introduced herein can be used for accurate forecasting of CVD death rates in individual countries.

We analyzed 195D data, that is, CVD and cancer death rates for 195 countries worldwide, for the period 1990–2019. A novel method for analyzing the reliability of a multidimensional biosystem was applied and the mechanisms of the proposed method were described in detail. Direct measurements and Monte Carlo simulations are both suitable for assessing the reliability of dynamic biological systems; however, the complexity and high dimensionality of such systems necessitate the further development of robust and accurate techniques that can use limited data sets in an efficient manner.

This study predicted an average annual death rate for CVD over a 100-year period of about 1% across countries and years. Under current national health management approaches, CVDs will continue to represent a threat to the health of the world population.

This study introduced a general-purpose, robust, and easy-to-apply method for analyzing the reliability of multidimensional systems. The method has previously been validated by application to a wide range of simulation models but only in the context of one-dimensional systems; in general, highly accurate predictions were obtained. Both measurement and numerically simulated time series data can be analyzed. Applying the method to the data set used in this study yielded reasonable confidence intervals, indicating that it could serve as a useful tool for reliability studies of various nonlinear dynamic biological systems. Finally, the suggested methodology has many potential public health applications beyond the prediction of CVD death rates.

AUTHOR CONTRIBUTIONS

Oleg Gaidai: Conceptualization (equal). Yihan Xing: Validation (equal). Rajiv Balakrishna: Investigation (equal). Jiayao Sun: Methodology (equal). Xiaolong Bai: Validation (equal).

ACKNOWLEDGMENTS

None.

CONFLICT OF INTEREST STATEMENT

The authors declare no conflict of interest.

ETHICS STATEMENT

Not applicable.

INFORMED CONSENT

Not applicable.

Open Research

DATA AVAILABILITY STATEMENT

Data sets analyzed during the current study are available online at https://ourworldindata.org/causes-of-death (“Our World in Data” [30]).

REFERENCES

1Cox B, Vangronsveld J, Nawrot TS. Impact of stepwise introduction of smokefree legislation on population rates of acute myocardial infarction deaths in Flanders, Belgium. Heart. 2014; 100(18): 1430–5. https://doi.org/10.1136/heartjnl-2014-305613
Google Scholar
2Tsao CW, Aday AW, Almarzooq ZI, Alonso A, Beaton AZ, Bittencourt MS, et al. Heart disease and stroke statistics—2022 update: a report from the American Heart Association. Circulation. 2022; 145(8): e153–639. https://doi.org/10.1161/CIR.0000000000001052
Google Scholar
3Smolina K, Wright FL, Rayner M, Goldacre MJ. Determinants of the decline in mortality from acute myocardial infarction in England between 2002 and 2010: linked national database study. BMJ. 2012; 344:d8059. https://doi.org/10.1136/bmj.d8059
Google Scholar
4Balakrishna R, Bjørnerud T, Bemanian M, Aune D, Fadnes LT. Consumption of nuts and seeds and health outcomes including cardiovascular disease, diabetes and metabolic disease, cancer, and mortality: an umbrella review. Adv Nutr. 2022; 13(6): 2136–48. https://doi.org/10.1093/advances/nmac077
Google Scholar
5Mackay DF, Irfan MO, Haw S, Pell JP. Meta-analysis of the effect of comprehensive smoke-free legislation on acute coronary events. Heart. 2010; 96(19): 1525–30. https://doi.org/10.1136/hrt.2010.199026
Google Scholar
6Alzuhairi KS, Søgaard P, Ravkilde J, Gislason G, Køber L, Torp-Pedersen C. Incidence and outcome of first myocardial infarction according to gender and age in Denmark over a 35-year period (1978–2012). Eur Heart J Qual Care Clin Outcomes. 2015; 1(2): 72–8. https://doi.org/10.1093/ehjqcco/qcv016
Google Scholar
7Mirzaei M, Truswell AS, Taylor R, Leeder SR. Coronary heart disease epidemics: not all the same. Heart. 2009; 95: 740–6. https://doi.org/10.1136/hrt.2008.154856
Google Scholar
8 NCDRF Collaboration. Trends in adult body-mass index in 200 countries from 1975 to 2014: a pooled analysis of 1698 population-based measurement studies with 19.2 million participants. Lancet. 2016; 387(10026): 1377–96. https://doi.org/10.1016/S0140-6736(16)30054-X
Google Scholar
9Siegel R, Miller K, Fuchs H, Jemal A. Cancer statistics. CA Cancer J Clin. 2022; 72(1): 7–33. https://doi.org/10.3322/caac.21708
Google Scholar
10 Surveillance, Epidemiology, and End Results (SEER) Program. SEER*Stat Database: North American Association of Central Cancer Registries (NAACCR) Incidence Data—Cancer in North America Analytic File, 1995–2018, With Race/Ethnicity, Custom File With County, American Cancer Society Facts and Figures Projection Project (which includes data from the Center for Disease Control and Prevention's National Program of Cancer Registries, the Canadian Council of Cancer Registries' Provincial and Territorial Registries, and the National Cancer Institute's SEER Registries, certified by the NAACCR as meeting high-quality incidence data standards for the specified time periods). National Cancer Institute, Division of Cancer Control and Population Sciences, Surveillance Research Program; Atlanta, Georgia, 2021.
Google Scholar
11 R Sherman, R Firth, M Charlton, P De, D Prithwish, D Green, et al., editors. Cancer in North America: 2014–2018. Volume two: registry-specific cancer incidence in the United States and Canada. North American Association of Central Cancer Registries Inc.; Springfield, Illinois, 2021.
Google Scholar
12 Surveillance, Epidemiology, and End Results (SEER) Program. SEER*Stat Database: mortality—all causes of death, total U.S. (1969–2019)—Katrina/Rita population adjustment—linked to county attributes—total U.S., 1969–2019 Counties (underlying mortality data provided by the National Center for Health Statistics). Springfield, Illinois: National Cancer Institute, Division of Cancer Control and Population Sciences, Surveillance Research Program; 2021.
Google Scholar
13Wingo PA, Cardinez CJ, Landis SH, Greenlee RT, Ries LAG, Anderson RN, et al. Long-term trends in cancer mortality in the United States, 1930–1998. Cancer. 2003; 97(12 suppl): 3133–275. https://doi.org/10.1002/cncr.11380
Google Scholar
14 A Fritz, C Percy, A Jack, K Shanmugaratnam, L Sobin, DM Parkin, et al., editors. International classification of diseases for oncology. 3rd ed. Geneva, Switzerland: World Health Organization; 2000.
Google Scholar
15 World Health Organization (WHO). International statistical classification of diseases and related health problems, 10th revision. Vol. I-III. Geneva, Switzerland: WHO; 2011.
Google Scholar
16 Surveillance Research Program. SEER*Stat software, version 8.3.8. Bethesda, Maryland: National Cancer Institute; 2020.
Google Scholar
17 Surveillance Research Program. Joinpoint Regression Program version 4.9.0.1. Bethesda, Maryland: National Cancer Institute, Statistical Research and Applications Branch; 2021.
Google Scholar
18Mariotto AB, Zou Z, Johnson CJ, Scoppa S, Weir HK, Huang B. Geographical, racial and socio-economic variation in life expectancy in the US and their impact on cancer relative survival. PLoS One. 2018; 13(7):e0201034. https://doi.org/10.1371/journal.pone.0201034
Google Scholar
19Clegg LX, Feuer EJ, Midthune DN, Fay MP, Hankey BF. Impact of reporting delay and reporting error on cancer incidence rates and trends. J Natl Cancer Inst. 2002; 94(20): 1537–45. https://doi.org/10.1093/jnci/94.20.1537
Google Scholar
20Yabroff KR, Wu XC, Negoita S, Stevens J, Coyle L, Zhao J, et al. Association of the COVID-19 pandemic with patterns of statewide cancer services. J Natl Cancer Inst. 2021; 114(6): 907–9.
Google Scholar
21 Surveillance, Epidemiology, and End Results (SEER) Program. SEER*Stat Database: incidence—SEER 9 registries research data with delay—adjustment, malignant only, November 2020 submission (1975–2018)—Katrina/Rita population adjustment—linked to county attributes—total U.S., 1969–2018 counties. Bethesda, Maryland: National Cancer Institute, Division of Cancer Control and Population Sciences, Surveillance Research Program, Surveillance Systems Branch; 2021.
Google Scholar
22 Surveillance, Epidemiology, and End Results (SEER) Program. SEER*Stat Database: incidence—SEER 18 registries research data + Hurricane Katrina impacted Louisiana cases, November 2020 submission (2000–2018)—Katrina/Rita population adjustment—linked to county attributes—total U.S., 1969–2018 counties. Bethesda, Maryland: National Cancer Institute, Division of Cancer Control and Population Sciences, Surveillance Research Program, Surveillance Systems Branch; 2021.
Google Scholar
23 Surveillance Research Program. SEER*Explorer: an interactive website for SEER cancer statistics. Bethesda, Maryland: National Cancer Institute; 2021. https://seer.cancer.gov/explorer/. Accessed 15 April 2021.
Google Scholar
24 Surveillance, Epidemiology, and End Results (SEER) Program. SEER*Stat Database: incidence—SEER research limited—field data with delay—adjustment, 21 registries, malignant only, November 2020 submission (2000–2018)—linked to county attributes—time dependent (1990–2018) income/rurality, 1969–2019 counties. Bethesda, Maryland: National Cancer Institute, Division of Cancer Control and Population Sciences, Surveillance Research Program; 2021.
Google Scholar
25 Surveillance Research Program, Statistic Methodology and Applications. DevCan: probability of developing or dying of cancer software. Version 6.7.9. Bethesda, Maryland: National Cancer Institute; 2021.
Google Scholar
26Murphy SL, Kochanek KD, Xu J, Heron M. Deaths: final data for 2012. National Vital Statistics Reports. Vol. 63, No. 9. Hyattsville, Maryland: National Center for Health Statistics; 2015.
Google Scholar
27Steliarova-Foucher E, Stiller C, Lacour B, Kaatsch P. International classification of childhood cancer, third edition. Cancer. 2005; 103(7): 1457–67. https://doi.org/10.1002/cncr.20910
Google Scholar
28Gumbel E. Statistics of extremes. New York: Columbia University Press; 1958.
Google Scholar
29 R Sherman, R Firth, M Charlton, P De, D Green, B Hofer, et al., editors. Cancer in North America: 2014–2018. Volume one: combined cancer incidence for the United States, Canada and North America. North American Association of Central Cancer Registries Inc.; 2021.
Google Scholar
30Ritchie H, Spooner F, Roser M. Causes of death. https://ourworldindata.org/causes-of-death
Google Scholar
31Choi S-K, Grandhi RV, Canfield RA. Reliability-based structural design. London: Springer-Verlag; 2007.
Google Scholar
32Naess A, Gaidai O. Estimation of extreme values from sampled time series. Struct Saf. 2009; 31(4): 325–34. https://doi.org/10.1016/j.strusafe.2008.06.021
Google Scholar
33Madsen HO, Krenk S, Lind NC. Methods of structural safety. Englewood Cliffs: Prentice-Hall Inc.; 1986.
Google Scholar
34Ditlevsen O, Madsen HO. Structural reliability methods. Chichester (World): John Wiley & Sons Inc.; 1996.
Google Scholar
35Melchers RE. Structural reliability analysis and prediction. New York: John Wiley & Sons Inc.; 1999.
Google Scholar
36Xing Y, Gaidai O, Ma Y, Naess A, Wang F. A novel design approach for estimation of extreme responses of a subsea shuttle tanker hovering in ocean current considering aft thruster failure. Appl Ocean Res. 2022; 123:103179. https://doi.org/10.1016/j.apor.2022.103179
Google Scholar
37Gaidai O, Wang F, Wu Y, Xing Y, Medina AR, Wang J. Offshore renewable energy site correlated wind-wave statistics. Probabilistic Eng Mech. 2022; 68:103207. https://doi.org/10.1016/j.probengmech.2022.103207
Google Scholar
38Xu X, Xing Y, Gaidai O, Wang K, Sandipkumar Patel K, Dou P, et al. A novel multi-dimensional reliability approach for floating wind turbines under power production conditions. Front Marine Sci. 2022; 9:970081. https://doi.org/10.3389/fmars.2022.970081
Google Scholar
39Sun J, Gaidai O, Wang F, Naess A, Wu Y, Xing Y, et al. Extreme riser experimental loads caused by sea currents in the Gulf of Eilat. Probabilistic Eng Mech. 2022; 68:103243. https://doi.org/10.1016/j.probengmech.2022.103243
Google Scholar
40Xu X, Wang F, Gaidai O, Naess A, Xing Y, Wang J. Bivariate statistics of floating offshore wind turbine dynamic response under operational conditions. Ocean Eng. 2022; 257:111657. https://doi.org/10.1016/j.oceaneng.2022.111657
Google Scholar
41Gaidai O, Xing Y, Wang F, Wang S, Yan P, Naess A. Improving extreme anchor tension prediction of a 10-MW floating semi-submersible type wind turbine, using highly correlated surge motion record. Front Mech Eng. 2022; 8:888497. https://doi.org/10.3389/fmech.2022.888497
Google Scholar
42Gaidai O, Xing Y, Xu X. COVID-19 epidemic forecast in USA East coast by novel reliability approach. Res Sq. 2022. https://doi.org/10.21203/rs.3.rs-1573862/v1
Google Scholar
43Gaidai O, Xing Y, Balakrishna R. Improving extreme response prediction of a subsea shuttle tanker hovering in ocean current using an alternative highly correlated response signal. Results Eng. 2022; 15:100593. https://doi.org/10.1016/j.rineng.2022.100593
Google Scholar
44Cheng Y, Gaidai O, Yurchenko D, Xu X, Gao S. The 32nd International Ocean and Polar Engineering Conference, paper number: ISOPE-I-22-342, Shanghai, China. 2022.
Google Scholar
45Gaidai O, Storhaug G, Wang F, Yan P, Naess A, Wu Y, et al. On-Board Trend Analysis for Cargo Vessel Hull Monitoring Systems. The 32nd International Ocean and Polar Engineering Conference, paper number: ISOPE-I-22-541, Shanghai, China. 2022.
Google Scholar
46Gaidai O, Xu X, Naess A, Cheng Y, Ye R, Wang J. Bivariate statistics of wind farm support vessel motions while docking. Sh Offshore Struct. 2020; 16(2): 135–43. https://doi.org/10.1080/17445302.2019.1710936
Google Scholar
47Gaidai O, Fu S, Xing Y. Novel reliability method for multidimensional nonlinear dynamic systems. Mar Struct. 2022; 86:103278. https://doi.org/10.1016/j.marstruc.2022.103278
Google Scholar
48Gaidai O, Xu J, Yan P, Xing Y, Wu Y, Zhang F. Novel methods for wind speeds prediction across multiple locations. Sci Rep. 2022; 12:19614. https://doi.org/10.1038/s41598-022-24061-4
Google Scholar
49Naess A, Moan T. Stochastic dynamics of marine structures. New York: Cambridge University Press; 2013.
Google Scholar
50Gaidai O, Xing Y. A novel multi regional reliability method for COVID-19 death forecast. Eng Sci. 2022. https://doi.org/10.30919/es8d799
Google Scholar
51Gaidai O, Yihan Y. A novel bio-system reliability approach for multi-state COVID-19 epidemic forecast. Eng Sci. 2022. https://doi.org/10.30919/es8d797
Google Scholar
52Gaidai O, Yan P, Xing Y, Xu J, Wu Y. A novel statistical method for long-term coronavirus modelling. F1000 Res. 2022; 11:1282.
Google Scholar

Citing Literature

Volume2, Issue2

April 2023

Pages 140-147

This article also appears in:

Prediction of death rates for cardiovascular diseases and cancers

Abstract

Background

Design

Methods

Results

Conclusions

Abbreviations

1 BACKGROUND

2 METHODS

3 RESULTS

4 CONCLUSIONS

AUTHOR CONTRIBUTIONS

ACKNOWLEDGMENTS

CONFLICT OF INTEREST STATEMENT

ETHICS STATEMENT

INFORMED CONSENT

Open Research

DATA AVAILABILITY STATEMENT

REFERENCES

Citing Literature

Figures

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Prediction of death rates for cardiovascular diseases and cancers

Abstract

Background

Design

Methods

Results

Conclusions

Abbreviations

1 BACKGROUND

2 METHODS

3 RESULTS

4 CONCLUSIONS

AUTHOR CONTRIBUTIONS

ACKNOWLEDGMENTS

CONFLICT OF INTEREST STATEMENT

ETHICS STATEMENT

INFORMED CONSENT

Open Research

DATA AVAILABILITY STATEMENT

REFERENCES

Citing Literature

Figures

References

Related

Information