Volume 2025, Issue 1 8980195

Research Article

Open Access

Exploring the Causality of Accident Severity on Mountainous Freeways With a Two-Stage Approach

Lingzhi Kong

National Engineering Research Center of Geological Disaster Prevention in Land Transportation , Broadvision Engineering Consultants Co., Ltd. , Kunming , China

Search for more papers by this author

Changan Xiong,

Changan Xiong

National Engineering Research Center of Geological Disaster Prevention in Land Transportation , Broadvision Engineering Consultants Co., Ltd. , Kunming , China

Yunnan Key Laboratory of Digital Communications , Kunming , China

Search for more papers by this author

Wenchen Yang,

Wenchen Yang

orcid.org/0000-0002-7855-9336

National Engineering Research Center of Geological Disaster Prevention in Land Transportation , Broadvision Engineering Consultants Co., Ltd. , Kunming , China

Yunnan Key Laboratory of Digital Communications , Kunming , China

Search for more papers by this author

Weiliang Zeng,

Corresponding Author

Weiliang Zeng

[email protected]

orcid.org/0000-0002-0046-8485

School of Automation , Guangdong University of Technology , Guangzhou , China , gdut.edu.cn

Joint International Research Laboratory of Intelligent Information Processing and System Integration of IoT , Ministry of Education , Guangzhou , China , meb.gov.tr

Guangdong Provincial Key Laboratory of Intelligent Transportation System , Guangzhou , China

Guangdong Provincial Key Laboratory of Intelligent Systems and Optimization Integration , Guangzhou , China

Search for more papers by this author

Lingzhi Kong,

Lingzhi Kong

National Engineering Research Center of Geological Disaster Prevention in Land Transportation , Broadvision Engineering Consultants Co., Ltd. , Kunming , China

Search for more papers by this author

Changan Xiong,

Changan Xiong

National Engineering Research Center of Geological Disaster Prevention in Land Transportation , Broadvision Engineering Consultants Co., Ltd. , Kunming , China

Yunnan Key Laboratory of Digital Communications , Kunming , China

Search for more papers by this author

Wenchen Yang,

Wenchen Yang

orcid.org/0000-0002-7855-9336

National Engineering Research Center of Geological Disaster Prevention in Land Transportation , Broadvision Engineering Consultants Co., Ltd. , Kunming , China

Yunnan Key Laboratory of Digital Communications , Kunming , China

Search for more papers by this author

Weiliang Zeng,

Corresponding Author

Weiliang Zeng

[email protected]

orcid.org/0000-0002-0046-8485

School of Automation , Guangdong University of Technology , Guangzhou , China , gdut.edu.cn

Joint International Research Laboratory of Intelligent Information Processing and System Integration of IoT , Ministry of Education , Guangzhou , China , meb.gov.tr

Guangdong Provincial Key Laboratory of Intelligent Transportation System , Guangzhou , China

Guangdong Provincial Key Laboratory of Intelligent Systems and Optimization Integration , Guangzhou , China

Search for more papers by this author

First published: 04 June 2025

https://doi.org/10.1155/atr/8980195

Academic Editor: Yajie Zou

Share a link

Email
Wechat
Bluesky

Abstract

Studies on accident severity on mountainous freeways have predominantly centered on the personal injury level, rather than the aggregation level. However, for quantifying the accident causality, clustering the accident severity from multidimensional perspectives based on data-driven approach is seldom investigated in existing studies. To address this research gap, we propose a two-stage methodology that integrates accident clustering with Bayesian inference. Initially, a Gaussian mixture clustering algorithm is developed to categorize accident severity. Subsequently, a Bayesian network is constructed to explore the risk factors associated with accident severity. The proposed model is calibrated and validated using accident data collected from mountainous freeways in Yunnan Province, China, spanning the period from 2016 to 2021. The findings suggest that our proposed accident clustering method exhibits superior robustness compared to alternative clustering techniques. Bayesian inference analysis further elucidates that accident severity is significantly influenced by factors such as driving behavior, weather conditions, and road surface conditions.

1. Introduction

Mountainous freeways globally have been plagued by fatal traffic accidents. Notably, between 2012 and 2021, more than 50% of severe traffic accidents in China occurred on mountainous freeways [1]. As assisted driving and semiautonomous driving technologies, such as adaptive cruise control, automatic lane changing, and emergency braking systems, continue to advance, the safety of mountainous expressways is poised to garner significant attention [2, 3]. Consequently, investigating the causal relationship between the severity of accidents on mountainous expressways and their contributing factors holds substantial significance.

In contrast to urban highways, mountainous freeways are characterized by challenging terrain, featuring numerous sharp curves and steep downgrade segments [4]. The intricate topography of mountainous freeways necessitates drivers to continuously adjust their speed and steering, thereby elevating the risk of driving errors. For instance, on a hairpin bend of a mountain highway, a driver’s miscalculation of the turning radius can readily precipitate a rollover accident. Research has demonstrated that the complex roadway conditions can divert drivers’ attention from normal driving behaviors [5]. Furthermore, unlike the meteorological conditions prevalent in urban road networks, mountainous regions are more susceptible to diverse and rapidly changing weather phenomena, including fog, rain, snow, and strong winds. These adverse weather conditions can substantially impair visibility and render road surfaces slippery, thereby increasing braking distances and compromising vehicle stability. In urban settings, the impact of weather on driving is comparatively mitigated due to the relatively flat terrain and the moderating effects of urban heat islands. Consequently, it is necessary to investigate the causality of risk factors resulting in mountainous freeway traffic accidents in order to take appropriate and effective countermeasures to prevent them.

The accident severity was usually defined at the individual level in terms of the injuries sustained by the driver, the injuries sustained by traffic occupants in the accident or the highest injury to individual occupants [6]. However, these individual-level measures may be not a good indicator to reveal the true nature of the accidents. Since an accident may include casualties, property damage, and traffic jam, it is not reasonable to classify the accident severity level merely based on the individual injury. Some studies identified the accident severity level based on predefined rules that integrate individual level to crash level [7, 8]. However, such subjective integration methods are not easy to generalize without expert knowledge.

Given the discrete nature of accident data, discrete outcome models, such as logit and probit models, are usually employed to analyze the relationship between potential risk factors and accident severity [9–11]. Because traditional logit and probit models assumed that all parameters are fixed [12], they are unable to capture unobserved heterogeneity. Furthermore, the explanatory variables such as road surface condition (wet or not wet) and weather in the crash databases usually have intuitive association, which may not fulfill the noncollinearity requirement for independent variables in parametric modeling.

In this study, we conduct an in-depth analysis on the causality of accident severity of mountainous freeways in Yunnan Province of China to find out what and how driving factors act on the severity level, and finally make an accident prevention policy. The contribution of this study is as follows:

1.
A data-driven clustering approach is proposed to classify the traffic accident severity from multidimensional perspectives instead of the most serious personal injury, since an accident may include casualties, injuries, property damage, and traffic jam. Specifically, a Gaussian mixture clustering method is proposed to classify accident severity levels, which will provide an objective tool for specifying the severity level for arbitrary dataset without using predefined rules.
2.
A Bayesian network (BN) model is proposed to quantify the uncertain traffic safety factors and capture complex dependencies between accident severity and multiple risk factors. Specifically, the BN model provides probabilistic reasoning, which can quantify the likelihood of different accident severities under various conditions such as fatigue driving, weather, alignment, and surface conditions. Furthermore, one- and two-dimensional inference analyses are proposed to examine the magnitude of the effects of various risk factors contributing to the severity of collision injuries.

The rest of this paper is structured as follows. Section 2, following the introduction, offers a brief literature review related to the definition of accident severity, risk factors, and methodological approaches for accident analysis. Section 3 describes the data used for this study, followed by Section 4 on the analytical approach. Section 5 presents the results and the insight of accident severity analysis. Finally, research conclusions and future work are outlined in Section 6.

2. Literature Review

2.1. Definition of Accident Severity in Accidents

The accident severity is usually recorded at the individual level in terms of driver injury, the highest injury to passengers or vehicle occupants involved in accidents. For instance, Newnam et al. [13] classified injury severity into fatal and nonfatal based on the highest injury to the vehicle occupants involved in the crash. Rahimi et al. [8] defined the injury severity of a single-vehicle accident based on the driver’s injury.

The Federal Highway Administrations [14] defines accident severity on the KABCO scale that measures crash severity based on fatality (K), incapacitating injury (A), evident injury (B), minor injury (C), and property damage only (O). A fatal injury is defined as any injury that results in death within 30 days after the crash occurring. Incapacitating injury is defined as any injury other than a fatal injury which results in severe laceration, broken or distorted extremity, crush injuries, suspected skull, significant burns, unconsciousness when taken from the crash scene, or paralysis. Evident injury is defined as any injury that is evident at the scene of the crash other than fatal or serious injuries. Examples include lump on the head, abrasions, bruises, and minor lacerations. Minor injury is any injury reported or claimed which is not fatal or suspected serious, which includes momentary loss of consciousness, claim of injuries not evident, limping, or complaint of pain. Property damage only injury is defined as no personal injury includes harm to wild animals, or birds, which have monetary value and others. This scale is usually used by law enforcement for classifying crash injuries. Due to the low proportion of some injury categories using the KABCO scale in a specific scenario, Naik et al. [15] combined the injury into KA, B, C, and O categories. Similarly, Rezapour et al. [7] combined the adjacent injuries into one category, that is, KA, BC, and O categories. Ahmed et al. [16] further merged B, C, and O injury into one category, which classified injury levels into severe and nonsevere ones. However, most of the previous literature defined accident severity using the KABCO scale at the individual level, which may fail to represent the very nature of the accidents globally.

2.2. Risk Factors in Mountainous Freeways

Scholars have explored risk factors influencing accident severity in freeways [17–19]. Identified risk factors include the characteristics of the driver, vehicle, road, weather, light, and the specifics of the crash [4, 7, 20].

Driver-specific characteristics, including gender, age, alcohol consumption, driving fatigue, driving distractions, illicit drug use, and failure to use seatbelts, significantly influence the severity of traffic accidents [13, 21–23]. Regarding gender, numerous studies have demonstrated its heterogeneous impact on accident severity [24]. Specifically, some research suggests that female drivers are more susceptible to severe injuries, as they may exhibit less proficiency in handling emergencies compared to their male counterparts [25]. Chen and Chen [26] observed that the likelihood of severe injury escalates when drivers are distracted or drowsy.

Vehicle characteristics such as vehicle type, vehicle age, overloading, lane changing, and carrying hazardous materials were found to be associated with accidents [27, 28]. Rezapour et al. [7] found that collisions involving trucks in mountainous terrain have a heightened likelihood of resulting in severe and even fatal injuries, primarily attributed to brake failures or the loss of control while navigating downhill sections.

Road-specific characteristics such as curved alignment, downward slope, roadway surface conditions, speed limit, average daily traffic volume, and roadside barriers were found to significantly affect the severity of accidents in mountainous freeways [25, 29]. Wen et al. [30] found that a roadway curve featuring a moderate radius and slope exhibits a notably heightened probability of resulting in medium severity incidents, as opposed to a curve designed with a larger radius and a flatter slope. This finding confirms the critical role that the geometry of curves plays in influencing the severity of traffic accidents.

Environment characteristics such as weather conditions, lighting, and time of day also significantly influence the severity of accidents [17, 31]. Adverse weather conditions, encompassing phenomena such as rain, snow, fog, and strong winds, present a heightened threat to road safety, significantly increasing the likelihood of severe or even fatal injuries in comparison with favorable, clear weather conditions [26, 27] Wen and Xue, 2020).

2.3. Statistical Approaches for Accident Severity Analysis

Statistical approaches such as logit and probit models have been employed to estimate the effects of risk factors on accident severity [4, 30, 33]. However, statistical models assumed that the estimated parameters are fixed for all observations, which may lead to biased parameter estimates and erroneous inferences (Mannering et al., 2016). To address this issue, a number of studies leveraged a random parameters model with heterogeneity in means and variances, enabling the capture of multiple unobserved heterogeneity [18]. For instance, Pervez et al. [34] proposed a random parameter model accounting for heterogeneity in both means and variances to explore the multifaceted impacts of various factors encompassing the environment, driver behavior, crash dynamics, vehicle attributes, and tunnel-specific conditions. Wen et al. [30] introduced a correlated random parameter logit model and investigated the effects of the combination of curve and slope factor on the injury severity of truck crashes.

While logit or probit methodologies provide valuable insights into the correlation between risk factors and accident severity, they are inherently constrained by certain presuppositions or predefined functional forms, thus failing to fully capture the intricate interrelationships among variables. In contrast, BNs represent a probabilistic modeling paradigm that excels in depicting the dependencies and causal linkages among a diverse array of variables [35]. BNs have garnered significant interest in accident analysis and decision-making within the transportation domain. For instance, Li et al. [36] proposed a robust Bayesian robit model, incorporating the Student’s t distribution as the link function to address anomalous data points within traffic accident datasets. Liu et al. [37] introduced a Bayesian deep learning model for detecting freeway incidents with uncertainty quantification. Wu et al. [6] developed a BN to analyze crash injury severity, revealing significant interactions among risk factors. Their findings indicated that vehicle weight and crash mode notably influenced airbag deployment, which in turn had a substantial impact on crash severity.

In summary, previous studies usually focused on the personal injury level rather than the accident level, few of them further quantifying the accident causality by clustering the accident severity from multidimensional perspectives. In this study, we endeavor to mitigate the aforementioned shortcomings by using a two-stage data-driven approach. First, Gaussian mixture clustering is proposed to cluster the accident severity at the aggregation level. Then, a BN is constructed to uncover the interplay among risk factors of accident severity and is further used for one- and two-dimensional inference analysis to maximize the potential of BN to influence policy. Finally, the accident data on mountainous freeways in Yunnan Province of China between 2016 and 2021 are used to calibrate and validate the proposed model.

3. Data

The mountainous freeway segment examined in this study constitutes a 105 km stretch of the Mazhao freeway located in Yunnan Province, China. This segment is an integral part of the G85 Yinchuan–Kunming national highway network, characterized by a design speed of 100 km/h, a roadbed width of 33.5 m, and a bridge-tunnel ratio of 50.79%. The analysis utilized a crash database provided by the Yunnan Provincial Department of Traffic Police. A comprehensive dataset comprising 796 accidents that occurred on this freeway between 2016 and 2021 was compiled for this study.

Based on the available data and the objectives of this study, the variables were categorized into four main groups: driver-related factors (e.g., fatigue, speed limit violations, lane-change violations, improper braking), environment-related factors (e.g., weather conditions, season, lighting, and day of the week), road-related characteristics (e.g., alignment, slope, road type, surface conditions, roadside barriers, and speed limits), and vehicle-related characteristics (e.g., traffic volume levels, vehicle types). The descriptive statistics of the key variables are presented in Table 1.

Table 1. Descriptive statistics of accident variables.

Category	Variables	Value	Frequency	Percentage (%)
Driver-related	Fatigue	Yes	73	9.22
	Fatigue	No	719	90.78
	Speed limit violation	Yes	35	4.42
	Speed limit violation	No	757	95.58
	Lane-change violation	Yes	9	1.14
	Lane-change violation	No	720	90.91
	Misoperation	Yes	451	56.94
	Misoperation	No	278	35.10
	Without maintaining a safe distance	Yes	95	11.99
	Without maintaining a safe distance	No	634	80.05

Environment-related	Weather	Clear	288	36.36
		Foggy	4	0.51
		Cloudy	460	58.08
		Rain	36	4.55
		Snow	4	0.51
	Season	Spring	161	20.33
		Summer	196	24.75
		Autumn	178	22.47
		Winter	257	32.45
	Light	Daylight	589	74.37
	Light	Dark, dawn, or dusk	203	25.63
	Day of week	Weekday	570	71.97
	Day of week	Weekend	222	28.03

Road-related	Alignment	Straight	503	63.51
		Curve	130	16.41
		Flat, or upgrade	41	5.18
		Downgrade	118	14.90
	Road type	Ordinary road section	348	43.94
		Bridge	149	18.81
		Tunnel	245	30.93
		Ramp	50	6.31
	Surface condition	Wet	107	13.51
		Dry	667	84.22
		Snow cover	13	1.64
	Speed limit	80 km/h	685	86.49
	Speed limit	100 km/h	107	13.51
	Roadside barrier	Protective guard	485	61.24
		Anticollision wall	224	28.28
		No defence	10	1.26

Vehicle-related	Traffic volume level	High (traffic flow ≥ 1000 veh/h)	302	38.13
		Medium (500 ≤ traffic flow < 1000 veh/h)	230	29.04
		Low (traffic flow < 500 veh/h)	260	32.83
	Type of vehicle	Car	528	66.67
		Coach	7	0.88
		Truck	251	31.69

Severity-related	Death	0 person	768	96.97
		1 person	20	2.53
		> 1 person	4	0.51
	Disability injury	0 person	776	97.98
		1 person	13	1.64
		> 1 person	3	0.38
	Nondisability injury	0 person	695	87.75
		1 person	62	7.83
		> 1 person	35	4.42
	Property damage	Yes	719	90.78
	Property damage	No	73	9.22
	Traffic disruption	Yes	76	9.60
	Traffic disruption	No	716	90.40

4. Methodology

This study endeavors to quantify the causal relationship of accident severity, thereby facilitating the prevention of traffic collisions on mountainous freeways. To thoroughly explore the factors contributing to accident severity, two crucial questions require examination. The first question is how to develop an objective and adaptive scenario classification framework for accident severity. In the majority of prior research, the accident severity level has been categorized using a predefined rule, which is typically based on the most severe injury sustained by an individual involved in the crash. Nevertheless, such an individual-level predefined approach may not effectively reveal the inherent nature of accidents and often struggles to adapt to diverse freeway scenarios. The second question is how to characterize dependency relationships among contributing factors and quantify their respective influences on the accident severity level. In previous studies, the relationship between contributing factors and accident severity was estimated under predefined functions. Unfortunately, these methods were inadequate in describing the dependencies or causal relationships among a set of variables.

To address these questions, a two-stage data-driven approach used in this study is briefly discussed in this section. First, the Gaussian mixture model (GMM) clustering-based method is developed to classify accident severity into different levels. Then, a BN model is constructed to examine the relationship between the risk factors and accident severity levels.

4.1. GMM-Based Severity-Level Clustering

In previous studies, accident severity was usually defined by using the KABCO scale at the individual level [7, 33, 38]. However, accident severity based on the worst injury experience by passengers or drivers involved in the accident may not fully demonstrate the nature of the accident. In this study, accident severity is determined from multidimensional perspectives including the injury level of all people involved in the accident. To be specific, accident severity is related not only to the degree of worst injury, but also to the number of people injured, property damage, and traffic disruption. Since the accident severity is multidimensional and there is no specific probability function available to represent the multimodal nature of the accident covering all types of situations, we need to use a mixture model to represent the accident severity level.

The significant advantage of GMM clustering method is to give the probability distribution of each cluster, which is useful for dealing with inaccurate or overlapping data points, such as some accidents may be between two serious categories. Different from GMM, K-means employs a hard assignment approach requiring explicit cluster allocation, rendering it suboptimal for datasets with overlapping clusters. This technique demonstrates notable limitations in its sensitivity to the initialization of cluster centers, often leading to convergence at local optima rather than global solutions. The GMM clustering addresses these constraints through a probabilistic framework utilizing the EM algorithm for iterative parameter optimization, exhibiting superior capability in modeling complex data distributions through its soft assignment mechanism. Hierarchical clustering, while effective for dendrogram-based analysis, suffers from inherent computational complexity that grows quadratically with dataset size, imposing significant scalability limitations for large-scale applications. Density-based approaches such as DBSCAN, though proficient in identifying arbitrary-shaped clusters, rely on predetermined density thresholds that may exhibit suboptimal performance when handling clusters with heterogeneous densities or non–density-based structures. Therefore, to classify the accident severity with a multimodal distribution, a GMM clustering-based method is proposed to learn the optimal classification.

A GMM for the probability density of a specific accident can be formed as follows, as a weighted sum of k-component Gaussian distributions [39]:

()

where p(x_j) represents the probability density of a specific accident x_j; x_j is a vector composed of severity-related variables such as the number of death, the number of disability injury, the number of nondisability injury, property damage, and traffic disruption; a_i, i = 1, 2, …, k, are the mixture coefficients that indicate the percentage of the accident x_j belonging to each severity category i and satisfy the constraint

; and p(x_j|μ_i, Σ_i) with mean μ_i and variance matrix Σ_i are the component Gaussian density functions. Following previous studies, the accident severity is assumed to be divided into three categories, that is, low severity, medium severity, and high severity [6, 34]. Each category density function is a multivariate Gaussian function of the following form:

()

Assuming that the accident sample generation process is given by Gaussian mixture distribution, that is, equation (1), the Gaussian mixture component is selected according to the prior distribution of a₁, a₂, …, a_k first, and then, the accident sample can be generated by sampling operation with the probability density function of the selected mixture component. Let random variable z_j ∈ {1, 2, …, k} representing the Gaussian mixture component of the accident sample x_j, the posterior probability of z_j can be derived by Bayes theorem as follows:

()

where p(z_j = i) is the prior probability of category i, which is consistent with a_i and p(x_j|z_j = i) represents the probability of accident x_j in cluster z_j = i.

Let λ_j representing the cluster label (severity level) of accident x_j and γ_ji representing the posterior probability p(z_j = i|x_j), the optimal cluster λ_j for each accident x_j can be determined by the maximum posterior probability as follows:

()

The model parameters, that is, θ = (a_i, μ_i, Σ_i), can be estimated by maximum likelihood estimation (MLE). Given the training dataset D = {x₁, x₂, …, x_m}, the maximum log likelihood of equation (1) can be formulated as follows:

()

Unfortunately, this formulation is a nonlinear function with latent variable a_i and direct maximization is not possible. The EM algorithm can be applied to solve this problem [40]. In each iteration, the posterior probability γ_ji for each sample x_j belonging to each cluster is calculated first (E step), and then, the model parameters (a_i, μ_i, Σ_i) are updated according to equations (9), (10), and (12) (M step). If parameters θ = (a_i, μ_i, Σ_i) enable to maximize equation (5), then the partial derivation of (a_i, μ_i, Σ_i) with respect to

equals to zero:

()

Taking the derivative of a series of matrices, μ_i and Σ_i can be obtained as follows:

()

Considering that a_i should satisfy a_i ≥ 0 and

, we reformulate equation (6) by Lagrangian function as follows:

()

where λ is the Lagrangian multiplier, λ ≥ 0.

Taking the partial derivation of a_i with respect to equation (10), we have

()

The model parameters are updated iteratively according to equations (8)–(11). If the stop condition is satisfied, for example, reaching the maximum number of iteration rounds or the likelihood value (LL(D)) stopping growing, the cluster classification can be determined.

4.2. BN

A BN model is subsequently developed to investigate the relationship between risk factors and the level of accident severity. BN models have been employed in prior studies for risk analysis [41, 42], owing to their ability to accurately estimate the occurrence probability of an event through a graphical approach. A BN model, also referred to as a Belief network model, constitutes a directed acyclic graph (DAG), where nodes represent system variables and arcs denote dependencies or causal relationships among these variables. The interdependent relationships among nodes are defined by a joint probability distribution, and each node is associated with a conditional probability table (CPT). The CPT outlines the probability of node states given the states of their predecessor nodes [35]. Consequently, the joint probability of a BN comprising d variables can be expressed as follows:

()

where X₁, X₂, …, X_d are the nodes (variables in Table 1) in a given BN model, π_i are the parent nodes of node X_i, and P(X_i|π_i) is the conditional probability of X_i given π_i.

Based on Bayesian theory, the severity level of an accident can be estimated with the posterior probability examined by the BN as follows:

()

where X_s denotes the severity level of an accident and E = {X₁, X₂, …, X_d}\{X_s} denotes the new observations named evidence.

The construction of a BN model comprises two primary steps: (1) determining the network structure (structure learning) and (2) estimating the parameters that best fit the network structure (parameter learning). The network structure can be derived through data-driven learning or by leveraging domain knowledge. Given that data-driven learning may introduce unreasonable links between variables, the BN structure is determined by integrating both data-driven and domain knowledge approaches. An initial framework for the BN structure, based on domain knowledge, is depicted in Figure 1. Prior to data-driven learning, constraints are specified as prior knowledge to preclude unreasonable relationships. We posit that driver-related and road-related factors may be influenced by environment-related factors. Additionally, driver-related factors are also affected by road-related factors. For instance, adverse weather conditions, such as heavy rain and fog, may impair drivers’ visibility and subsequently lead to erroneous operations. The road surface condition is also intimately linked to weather conditions. Subsequently, information gain (IG) is utilized to select the variables that most significantly influence the severity level, thereby streamlining the network structure. In information theory, IG represents the reduction in information entropy, quantifying the extent to which knowledge about the target variable can be gained when an explanatory variable is provided. A higher IG value indicates a stronger correlation between the explanatory variable and the target variable (severity level). IG is calculated as shown in equation (14):

()

Details are in the caption following the image — **Figure 1**
Open in figure viewer PowerPoint

Initial framework for the BN structure.

Then, the K2 algorithm [43] is employed to learn the optimal BN structure. With the given BN structure, parameter learning aims to search for the optimal parameters of the conditional probability distribution for each node. Considering that the EM algorithm [40] is capable of estimating parameters from datasets that contain missing values, the EM algorithm is employed in this study for BN parameter estimation.

5. Results

5.1. Clustering Validity Analysis

To evaluate the efficacy of the accident severity clustering in fitting the data, the prevailing approach involves utilizing internal cluster validity indices. Consequently, three representative indices—the Calinski–Harabasz (CH) index [44], the Silhouette coefficient (SC) [45], and the Davies–Bouldin (DB) index [46]—are employed to assess the performance of clustering methods. The CH index is computed based on the ratio of intracluster variance to intercluster variance, with a higher CH index value indicating superior clustering performance. The SC index, which spans from −1 to 1, encapsulates both the compactness of samples within clusters and the distinctness between clusters. A value closer to 1 signifies a more effective clustering outcome. The DB index is derived from the maximum mean value of the ratio of intracluster compactness to intercluster separation, with a smaller DB index value suggesting enhanced clustering efficacy. In summary, a greater degree of similarity among objects within clusters and a lesser degree of similarity between clusters denote improved clustering performance.

Table 2 presents the performance metrics of four clustering methods [47–49]. The GMM-based clustering approach exhibits the highest values for both the CH and SC indices, along with the lowest value for the DB index. These results indicate that the GMM-based method outperforms the other approaches across all evaluated indices. Consequently, the GMM-based clustering method is chosen for classifying the accident severity levels.

Table 2. Comparison of four clustering methods.

Method	Index
Method	CH	SC	DB
k-means	729	0.56	0.96
Density-based spatial clustering	803	0.68	1.05
Hierarchical clustering	751	0.45	0.98
GMM-based clustering	920	0.80	0.92

Figure 2 illustrates the centroids of the three accident severity levels classified by the GMM-based clustering approach. These levels account for 60.4%, 33.7%, and 5.8% of the cases, respectively. Level 1 is distinguished by accident cases involving no fatalities or disability injuries, representing the lowest severity level. Level 2 encompasses a relatively small number of fatal and disability injuries but the highest proportion of nondisability injuries, thus being categorized as the medium severity level. Level 3 is characterized by the highest incidence of fatalities, disability injuries, and property damage cases, indicating the most severe accidents. By contrasting the classification results with the KABCO scale at the individual level, we observed that the clustering outcome exhibits greater rationality at the aggregate level, and the cases within each level are more amenable to interpretation for modeling purposes. Consequently, BN inference is conducted using the clustering results derived from the GMM approach.

5.2. BN Structure Analysis

As illustrated in Figure 3, 12 discrete risk factors, meticulously selected from the domains of driver characteristics, environmental conditions, road attributes, and vehicle-related parameters, demonstrate significant associations with the probability of accident severity. The BN structure underwent rigorous validation via the Monte Carlo permutation test, confirming that all established links within the network exhibit statistical significance at a 95% confidence level.

Specifically, Figure 3 highlights that driving fatigue, misoperation, light, traffic volume level, surface condition, alignment, road type, speed limit, roadside barrier, and vehicle type are directly linked to accident severity. Notably, while the BN structure does not depict a direct causal relationship between weather conditions and accident severity, this absence does not imply the lack of an underlying association. Instead, an indirect dependency emerges, mediated by three intermediate risk factors: misoperation, surface condition, and traffic volume. Similarly, although no direct connection between seasonality and accident severity is observed, an indirect relationship is evident through the sequential pathways linking season to weather conditions and season to road surface conditions.

Furthermore, the network reveals a direct association between fatigue driving and misoperation, suggesting that driver fatigue can precipitate abnormal driving behaviors. This finding represents a novel contribution to the literature, as previous studies relying on regression-based methodologies were inherently limited in their ability to uncover complex interdependencies among dependent variables, thereby overlooking this critical association.

To identify the risk factors that mostly influence the accident severity level, IG is employed to measure the feature importance. The relative importance for 12 risk factors in the BN model is shown in Figure 4. It demonstrates that weather condition plays the most important role in accident severity, while the season has slight impact on the severity. Driving fatigue and misoperation occupy more than 38% relative importance to the BN model. It indicates that driver-related factors should be taken seriously for accident prevention.

5.3. BN Model Performance

The BN model must be verified before implementation. The accuracy is usually selected as the evaluation metric to assess the performance of the model. However, it is problematic to assess the accuracy of the cases with unbalanced positive and negative samples. Alternatively, it is reasonable to assess the predictive performance by the area under the curve (AUC) [50]. The AUC, which represents the area under the receiver operating characteristic (ROC) curve, serves as a reliable metric for evaluating the classifier performance. The ROC curve is constructed by plotting the true positive rate (TPR) against the false positive rate (FPR) for all relevant categories.

To rigorously validate the reliability and predictive accuracy of our proposed BN model, we conducted comparative experiments against several state-of-the-art machine learning models, including random forest (RF), convolutional neural network (CNN), and support vector machine (SVM). As illustrated in Figure 5, ROC curves were plotted for each accident severity level. The experimental results demonstrate that the BN model outperforms its counterparts, achieving AUC values of 0.804, 0.843, and 0.822 for severity levels 1, 2, and 3, respectively. These AUC scores surpass those reported in prior research [6], thereby underscoring the superior performance of our BN model. Consequently, the BN model exhibits robust reliability in accurately predicting accident severity levels.

5.4. Inference Analysis for BN Model

An inference analysis is conducted to investigate the extent of the impact of each risk factor on accident severity level. The default value for each factor is set to median. The tested evidence is manually set to a specific state with 100% probability. Then, the posterior probability of accident severity level on this evidence is obtained. In the following sections, the unidimensional and two-dimensional inference analysis is reported for the top five most influential risk factors.

5.4.1. Unidimensional Inference Analysis

Figure 6 presents the results of a univariate inference analysis. With regard to the weather variable, adverse weather conditions, including fog and snow, exhibit a higher proportion of accidents classified as severity levels 2 and 3 compared to clear, cloudy, and rainy conditions. This finding suggests that accidents occurring under adverse weather conditions are associated with increased severity. However, our analysis reveals that the rain condition variable does not attain statistical significance. A plausible explanation for this observation is that drivers exhibit heightened caution on mountainous freeways during rainfall, potentially mitigating the severity of accidents.

Regarding driver-related factors, our analysis indicates that the likelihood of more severe accidents (severity levels 2 and 3) is significantly elevated under conditions of fatigue driving. Empirical evidence pertaining to the variable “misoperation” reveals that if a driver engages in misoperations such as illegal lane changes, unsafe overtaking, or improper throttle control, the resulting accident is more prone to exhibit higher severity. Consequently, a comprehensive understanding of the impacts of fatigue driving and driver misoperations is pivotal for the development and design of effective countermeasures aimed at mitigating the risks associated with driver-related crashes.

Regarding road-related factors, the inference of road alignment suggests that the probability of more severe accidents is higher if the crash occurs on a curve and downgrade. For road surface conditions, the results suggest that snow cover and frozen conditions may cause more severe accidents. A possible reason is that the braking performance may degrade severely when the vehicle runs on the snow cover and frozen surface. However, the wet and dry surface conditions have a comparable impact on the severity level.

5.4.2. Two-Dimensional Inference Analysis

The advantages of using a two-dimensional Bayesian inferential analysis stem from its ability to model the joint distribution of two parameters, capturing their interdependencies and providing a comprehensive framework for uncertainty quantification. When analyzing two variables, Bayesian inferential analysis can easily handle multivariate relationships and interaction effects. It can estimate not only the main effects of each variable but also the interaction between them. This is important in accident-related risk factor analysis where the relationship between risk factors is often complex and nonadditive. Considering that the weather is changeable and the road conditions are complex in the studied freeway section, a two-dimensional inference analysis is conducted to reveal the association between the weather and other risk factors. Weather, as the risk factor with the most relative importance, is taken in combination with each of the remaining four risk factors with higher relative importance.

Figure 7(a) shows the accident severity distribution under the joint inference of weather and fatigue driving. The results suggest that fatigue driving increases the probability of Level 2 and Level 3, regardless of the weather conditions in the accident. The probability of Level 2 and Level 3 raises by up to more than 50% when the driver is in the fatigue driving state in the foggy weather condition. This demonstrates that policymakers should consider installing driver fatigue prevention devices (e.g., deceleration strips or warning signs) on mountainous freeways since those devices help reducing driver fatigue as well as injury severity [51].

Figure 7(b) illustrates the simultaneous inference of weather and misoperation. The results suggest that when the driver makes a misoperation during a crash, regardless of the weather condition, the probability of higher severity (Level 2 and Level 3) is more than when a crash involves no misoperation. This demonstrates the necessity of the vehicle safety assistant driving devices including advanced cruise control system to ensure that a control system can take over the vehicle when the driver makes an operation error.

Figure 7(c) presents the impact of weather and alignment on the accident severity level. In foggy weather condition, the probability of higher severity (Level 2 and Level 3) is more than when an accident occurs in other weather conditions. By analyzing the inference results of the two risk factors on injury severity, it can be seen that severe injury are most likely to occur in foggy conditions of downgrade road sections.

The inference of weather and road surface condition on the severity of the accident is shown in Figure 7(d). The results suggest that when the road surface is snow covered and the weather is foggy, the probability of severe accident is the largest due to the poor visibility and braking performance. Therefore, it is necessary to close the snow-covered mountainous freeway sections in the foggy weather conditions.

6. Conclusions

This study examines the impact of driver-related, environment-related, road-related, and vehicle-related factors on the severity of traffic accidents on mountainous freeways. By employing Gaussian mixture clustering, the severity levels are classified based on a comprehensive assessment of injuries, property damage, and traffic disruption. Furthermore, a BN model is developed to analyze and predict the severity of traffic accidents, taking into account the interactions among these risk factors. The main conclusions are as follows:

1.
The GMM clustering method classifies accident severity into three distinct levels. Level 1 comprises accident cases with no fatalities or disabling injuries. Level 2 includes a small number of fatalities and disabling injuries, along with the highest proportion of nondisabling injuries. Level 3 is characterized by the highest number of fatalities, disabling injuries, and property damage cases. Compared to the KABCO scale classification at the individual level, our aggregation-level classification method demonstrates greater rationality and practical relevance.
2.
In this study, IG values were incorporated into the BN structure learning process to determine the relative importance of each variable on the decision variables. The results reveal that weather conditions exert the most significant influence on accident severity, whereas the season has a relatively minor impact. Other critical factors include driving fatigue, misoperation, road alignment, and surface conditions, which rank among the top five most influential variables.
3.
Unidimensional and two-dimensional Bayesian inferential analyses were conducted to explore the causal relationships between the top five risk factors and accident severity. The findings indicate that adverse weather conditions, such as fog and snow, are prevalent in most combined scenarios leading to severe accidents. Foggy conditions, in particular, are most strongly associated with fatal crashes. Driver-related factors, such as fatigue and misoperation, significantly increase the likelihood of severe outcomes. Additionally, road-related factors, including alignment and surface conditions, play a substantial role in determining severity levels. The analysis suggests that the synergistic interaction of foggy weather, fatigue driving, misoperation, downhill gradients, and snow-covered surfaces significantly elevates the risk of severe accidents.

This study has several limitations that warrant discussion. First, the effective performance of BN relies heavily on the availability of large datasets. However, since traffic accidents are inherently low-probability events, the limited amount of data poses a significant challenge to the accuracy and reliability of this research. Second, the influence of weather on accidents is primarily determined by the overall weather conditions on the day of the incident rather than the immediate weather conditions preceding the accident. This factor may obscure the true impact of weather on accident occurrence. Third, in the construction of the BN, the same network structure was applied across all time slices, which may fail to capture the potential variations in variable relationships under different temporal conditions.

For future research directions, the traffic conflict index could serve as a valuable risk assessment metric to proactively identify potential traffic safety hazards. Additionally, further investigation into the effects of weather on traffic conflicts should consider the persistence and intensity of adverse weather conditions, as these factors are likely to play a critical role in influencing traffic dynamics.

Disclosure

Broadvision Engineering Consultants Co., Ltd., had no role in manuscript preparation or decision to publish.

Conflicts of Interest

The authors Lingzhi Kong, Changan Xiong, and Wenchen Yang are affiliated with Broadvision Engineering Consultants Co., Ltd., which provided partial funding for this study through a general research grant. However, the grant was awarded through an open competition with no stipulations on research outcomes. The remaining author declares no conflicts of interest.

Author Contributions

Lingzhi Kong: conceptualization, methodology, and writing the original draft; Changan Xiong: data curation and analysis; Wenchen Yang: supervision and editing; Weiliang Zeng: conceptualization, methodology, writing the original draft, and reviewing. All data were analyzed independently by the authors.

Funding

This work was supported by the Science and Technology Program of the Department of Transportation, Yunnan Province (No. 2022-107, 2019303), the Science and Technology Research Project of YCIC, China (No. YCIC-YF-2022-06), the National Natural Science Foundation of China (No. 62273102), the Guangdong Basic and Applied Basic Research Foundation (No. 2024A1515010629), and the Open Funding of Guangdong Provincial Key Laboratory of Intelligent Transportation System (No. 202005003).

Open Research

Data Availability Statement

The data that support the findings of this study are available from the corresponding author, Weiliang Zeng, upon reasonable request. The data are not publicly available due to privacy.

References

1 Li L., Li G. Z., Zhang D., Fang R., and Yang W. C., Effects of Emergency Escape Ramps on Crash Injury Severity Reduction on Mountain Freeways: A Case Study in China, Journal of Transportation Safety & Security. (2022) 14, no. 8, 1269–1288, https://doi.org/10.1080/19439962.2021.1909682.
10.1080/19439962.2021.1909682
Web of Science® Google Scholar
2 Peng Y., Liu D., Wu S., Yang X., Wang Y., and Zou Y., Enhancing Mixed Traffic Flow With Platoon Control and Lane Management for Connected and Autonomous Vehicles, Sensors. (2025) 25, no. 3, https://doi.org/10.3390/s25030644.
10.3390/s25030644
Web of Science® Google Scholar
3 Kuang J., Tan G., Guo X., Pei X., and Peng D., A Two-Stage Cooperative Adaptive Cruise Control for Connected Automated Vehicles in Multislope Roads Considering Communication Delay and Actuator Delay, Journal of Advanced Transportation. (2024) 2024, no. 1, https://doi.org/10.1155/2024/8848475.
10.1155/2024/8848475
Web of Science® Google Scholar
4 Huang H., Peng Y., Wang J., Luo Q., and Li X., Interactive Risk Analysis on Crash Injury Severity at a Mountainous Freeway With Tunnel Groups in China, Accident Analysis & Prevention. (2018) 111, 56–62, https://doi.org/10.1016/j.aap.2017.11.024, 2-s2.0-85034632972.
10.1016/j.aap.2017.11.024
PubMed Web of Science® Google Scholar
5 Domenichini L., La Torre F., Vangi D., Virga A., and Branzi V., Influence of the Lighting System on the Driver’s Behavior in Road Tunnels: A Driving Simulator Study, Journal of Transportation Safety & Security. (2017) 9, no. 2, 216–238, https://doi.org/10.1080/19439962.2016.1173155, 2-s2.0-84980378388.
10.1080/19439962.2016.1173155
Google Scholar
6 Wu J., Rasouli S., Zhao J., Qian Y., and Cheng L., Large Truck Fatal Crash Severity Segmentation and Analysis Incorporating All Parties Involved: A Bayesian Network Approach, Travel behaviour and society. (2023) 30, 135–147, https://doi.org/10.1016/j.tbs.2022.09.003.
10.1016/j.tbs.2022.09.003
Web of Science® Google Scholar
7 Rezapour M., Wulff S. S., and Ksaibati K., Examination of the Severity of Two-Lane Highway Traffic Barrier Crashes Using the Mixed Logit Model, Journal of Safety Research. (2019) 70, 223–232, https://doi.org/10.1016/j.jsr.2019.07.010, 2-s2.0-85073648803.
10.1016/j.jsr.2019.07.010
PubMed Web of Science® Google Scholar
8 Rahimi E., Shamshiripour A., Samimi A., and Mohammadian A. K., Investigating the Injury Severity of Single-Vehicle Truck Crashes in a Developing Country, Accident Analysis & Prevention. (2020) 137, https://doi.org/10.1016/j.aap.2020.105444.
10.1016/j.aap.2020.105444
PubMed Web of Science® Google Scholar
9 Zeng Q., Gu W., Zhang X., Wen H., Lee J., and Hao W., Analyzing Freeway Crash Severity Using a Bayesian Spatial Generalized Ordered Logit Model With Conditional Autoregressive Priors, Accident Analysis & Prevention. (2019) 127, 87–95, https://doi.org/10.1016/j.aap.2019.02.029, 2-s2.0-85062407330.
10.1016/j.aap.2019.02.029
PubMed Web of Science® Google Scholar
10 Alogaili A. and Mannering F., Unobserved Heterogeneity and the Effects of Driver Nationality on Crash Injury Severities in Saudi Arabia, Accident Analysis & Prevention. (2020) 144, https://doi.org/10.1016/j.aap.2020.105618.
10.1016/j.aap.2020.105618
PubMed Web of Science® Google Scholar
11 Alrejjal A., Farid A., and Ksaibati K., A Correlated Random Parameters Approach to Investigate Large Truck Rollover Crashes on Mountainous Interstates, Accident Analysis & Prevention. (2021) 159, https://doi.org/10.1016/j.aap.2021.106233.
10.1016/j.aap.2021.106233
PubMed Web of Science® Google Scholar
12 Mannering F., Temporal Instability and the Analysis of Highway Accident Data, Analytic Methods in Accident Research. (2018) 17, 1–13, https://doi.org/10.1016/j.amar.2017.10.002, 2-s2.0-85032566625.
10.1016/j.amar.2017.10.002
Web of Science® Google Scholar
13 Newnam S., Blower D., Molnar L., Eby D., and Koppel S., Exploring Crash Characteristics and Injury Outcomes Among Older Truck Drivers: An Analysis of Truck-Involved Crash Data in the United States, Safety Science. (2018) 106, 140–145, https://doi.org/10.1016/j.ssci.2018.03.012, 2-s2.0-85043588728.
10.1016/j.ssci.2018.03.012
Web of Science® Google Scholar
14 Federal Highway Administrations, KABCO Injury Classification Scale and Definitions, 2019, https://safety.fhwa.dot.gov/hsip/spm/conversion_tbl/pdfs/kabco_ctable_by_state.pdf.
Google Scholar
15 Naik B., Tung L. W., Zhao S., and Khattak A. J., Weather Impacts on Single-Vehicle Truck Crash Injury Severity, Journal of Safety Research. (2016) 58, 57–65, https://doi.org/10.1016/j.jsr.2016.06.005, 2-s2.0-84978880094.
10.1016/j.jsr.2016.06.005
PubMed Web of Science® Google Scholar
16 Ahmed M. M., Franke R., Ksaibati K., and Shinstine D. S., Effects of Truck Traffic on Crash Injury Severity on Rural Highways in Wyoming Using Bayesian Binary Logit Models, Accident Analysis & Prevention. (2018) 117, 106–113, https://doi.org/10.1016/j.aap.2018.04.011, 2-s2.0-85045768463.
10.1016/j.aap.2018.04.011
PubMed Web of Science® Google Scholar
17 Yu R., Xiong Y., and Abdel-Aty M., A Correlated Random Parameter Approach to Investigate the Effects of Weather Conditions on Crash Risk for a Mountainous Freeway, Transportation Research Part C: Emerging Technologies. (2015) 50, 68–77, https://doi.org/10.1016/j.trc.2014.09.016, 2-s2.0-84919782030.
10.1016/j.trc.2014.09.016
Web of Science® Google Scholar
18 Song D., Yang X., Ch Anastasopoulos P., Zu X., Yue X., and Yang Y., Temporal Stability of the Impact of Factors Determining Drivers’ Injury Severities Across Traffic Barrier Crashes in Mountainous Regions, Analytic Methods in Accident Research. (2023) 39, https://doi.org/10.1016/j.amar.2023.100282.
10.1016/j.amar.2023.100282
Web of Science® Google Scholar
19 Su X., Zhi D., Song D., Tian L., and Yang Y., Exploring Weather-Related Factors Affecting the Delay Caused by Traffic Incidents: Mitigating the Negative Effect of Traffic Incidents, Science of the Total Environment. (2023) 877, https://doi.org/10.1016/j.scitotenv.2023.162938.
10.1016/j.scitotenv.2023.162938
Web of Science® Google Scholar
20 Yu R. and Abdel-Aty M., Analyzing Crash Injury Severity for a Mountainous Freeway Incorporating Real-Time Traffic and Weather Data, Safety Science. (2014) 63, 50–56, https://doi.org/10.1016/j.ssci.2013.10.012, 2-s2.0-84888249196.
10.1016/j.ssci.2013.10.012
Web of Science® Google Scholar
21 Zhu X. and Srinivasan S., A Comprehensive Analysis of Factors Influencing the Injury Severity of Large-Truck Crashes, Accident Analysis & Prevention. (2011) 43, no. 1, 49–57, https://doi.org/10.1016/j.aap.2010.07.007, 2-s2.0-78649943153.
10.1016/j.aap.2010.07.007
PubMed Web of Science® Google Scholar
22 Chang L. Y. and Chien J. T., Analysis of Driver Injury Severity in Truck-Involved Accidents Using a Non-Parametric Classification Tree Model, Safety Science. (2013) 51, no. 1, 17–22, https://doi.org/10.1016/j.ssci.2012.06.017, 2-s2.0-84865437277.
10.1016/j.ssci.2012.06.017
Web of Science® Google Scholar
23 Yu M., Zheng C., Ma C., and Shen J., The Temporal Stability of Factors Affecting Driver Injury Severity in Run-Off-Road Crashes: A Random Parameters Ordered Probit Model With Heterogeneity in the Means Approach, Accident Analysis & Prevention. (2020) 144, https://doi.org/10.1016/j.aap.2020.105677.
10.1016/j.aap.2020.105677
Web of Science® Google Scholar
24 Behnood A. and Mannering F. L., The Temporal Stability of Factors Affecting Driver-Injury Severities in Single-Vehicle Crashes: Some Empirical Evidence, Analytic Methods in Accident Research. (2015) 8, 7–32, https://doi.org/10.1016/j.amar.2015.08.001, 2-s2.0-84942411319.
10.1016/j.amar.2015.08.001
Google Scholar
25 Molan A. M., Moomen M., and Ksaibati K., Estimating the Effect of Geometric Features of Side Traffic Barriers on Crash Severity of Interstate Roads in Wyoming, Accident Analysis & Prevention. (2020) 144, https://doi.org/10.1016/j.aap.2020.105639.
10.1016/j.aap.2020.105639
PubMed Web of Science® Google Scholar
26 Chen F. and Chen S., Injury Severities of Truck Drivers in Single-And Multi-Vehicle Accidents on Rural Highways, Accident Analysis & Prevention. (2011) 43, no. 5, 1677–1688, https://doi.org/10.1016/j.aap.2011.03.026, 2-s2.0-79958207517.
10.1016/j.aap.2011.03.026
PubMed Web of Science® Google Scholar
27 Wang Y. and Prato C. G., Determinants of Injury Severity for Truck Crashes on Mountain Expressways in China: A Case-Study With a Partial Proportional Odds Model, Safety Science. (2019) 117, 100–107, https://doi.org/10.1016/j.ssci.2019.04.011, 2-s2.0-85064249477.
10.1016/j.ssci.2019.04.011
Web of Science® Google Scholar
28 Chen M., Chen P., Gao X., and Yang C., Examining Injury Severity in Truck-Involved Collisions Using a Cumulative Link Mixed Model, Journal of Transport & Health. (2020) 19, https://doi.org/10.1016/j.jth.2020.100942.
10.1016/j.jth.2020.100942
PubMed Web of Science® Google Scholar
29 Azimi G., Rahimi A., Asgari H., and Jin X., Severity Analysis for Large Truck Rollover Crashes Using a Random Parameter Ordered Logit Model, Accident Analysis & Prevention. (2020) 135, https://doi.org/10.1016/j.aap.2019.105355.
10.1016/j.aap.2019.105355
PubMed Web of Science® Google Scholar
30 Wen H., Ma Z., Chen Z., and Luo C., Analyzing the Impact of Curve and Slope on Multi-Vehicle Truck Crash Severity on Mountainous Freeways, Accident Analysis & Prevention. (2023) 181, https://doi.org/10.1016/j.aap.2022.106951.
10.1016/j.aap.2022.106951
PubMed Web of Science® Google Scholar
31 Uddin M. and Huynh N., Truck-Involved Crashes Injury Severity Analysis for Different Lighting Conditions on Rural and Urban Roadways, Accident Analysis & Prevention. (2017) 108, 44–55, https://doi.org/10.1016/j.aap.2017.08.009, 2-s2.0-85027974739.
10.1016/j.aap.2017.08.009
PubMed Web of Science® Google Scholar
32 Mannering F. L., Shankar V., and Bhat C. R., Unobserved Heterogeneity and the Statistical Analysis of Highway Accident Data, Analytic Methods in Accident Research. (2016) 11, 1–16.
10.1016/j.amar.2016.04.001
Web of Science® Google Scholar
33 Yuan Y., Yang M., Guo Y., Rasouli S., Gan Z., and Ren Y., Risk Factors Associated With Truck-Involved Fatal Crash Severity: Analyzing Their Impact for Different Groups of Truck Drivers, Journal of Safety Research. (2021) 76, 154–165, https://doi.org/10.1016/j.jsr.2020.12.012.
10.1016/j.jsr.2020.12.012
PubMed Web of Science® Google Scholar
34 Pervez A., Lee J., and Huang H., Exploring Factors Affecting the Injury Severity of Freeway Tunnel Crashes: A Random Parameters Approach With Heterogeneity in Means and Variances, Accident Analysis & Prevention. (2022) 178, https://doi.org/10.1016/j.aap.2022.106835.
10.1016/j.aap.2022.106835
PubMed Web of Science® Google Scholar
35 Kjaerulff U. B. and Madsen A. L., Bayesian Networks and Influence Diagrams: A Guide to Construction and Analysis, 2008, Springer Science+Business Media.
10.1007/978-0-387-74101-7
Google Scholar
36 Li Z., Liao H., Tang R., Li G., Li Y., and Xu C., Mitigating the Impact of Outliers in Traffic Crash Analysis: A Robust Bayesian Regression Approach With Application to Tunnel Crash Data, Accident Analysis & Prevention. (2023) 185, https://doi.org/10.1016/j.aap.2023.107019.
10.1016/j.aap.2023.107019
PubMed Web of Science® Google Scholar
37 Liu G., Jin H., Li J., Hu X., and Li J., A Bayesian Deep Learning Method for Freeway Incident Detection With Uncertainty Quantification, Accident Analysis & Prevention. (2022) 176, https://doi.org/10.1016/j.aap.2022.106796.
10.1016/j.aap.2022.106796
PubMed Web of Science® Google Scholar
38 Zheng Z., Lu P., and Lantz B., Commercial Truck Crash Injury Severity Analysis Using Gradient Boosting Data Mining Model, Journal of Safety Research. (2018) 65, 115–124, https://doi.org/10.1016/j.jsr.2018.03.002, 2-s2.0-85045536929.
10.1016/j.jsr.2018.03.002
PubMed Web of Science® Google Scholar
39 Titterington D. M., Smith A. F., and Makov U. E., Statistical Analysis of Finite Mixture Distributions, 1985, John Wiley & Sons.
Google Scholar
40 Dempster A. P., Laird N. M., and Rubin D. B., Maximum Likelihood from Incomplete Data Via the EM Algorithm, Journal of the Royal Statistical Society—Series B: Statistical Methodology. (1977) 39, no. 1, 1–22, https://doi.org/10.1111/j.2517-6161.1977.tb01600.x.
10.1111/j.2517-6161.1977.tb01600.x
Web of Science® Google Scholar
41 Weber P., Medina-Oliva G., Simon C., and Iung B., Overview on Bayesian Networks Applications for Dependability, Risk Analysis and Maintenance Areas, Engineering Applications of Artificial Intelligence. (2012) 25, no. 4, 671–682, https://doi.org/10.1016/j.engappai.2010.06.002, 2-s2.0-84859434921.
10.1016/j.engappai.2010.06.002
Web of Science® Google Scholar
42 Tang J., Heinimann H., Han K., Luo H., and Zhong B., Evaluating Resilience in Urban Transportation Systems for Sustainability: A Systems-Based Bayesian Network Model, Transportation Research Part C: Emerging Technologies. (2020) 121, https://doi.org/10.1016/j.trc.2020.102840.
10.1016/j.trc.2020.102840
Web of Science® Google Scholar
43 Cooper G. F. and Herskovits E., A Bayesian Method for the Induction of Probabilistic Networks from Data, Machine Learning. (1992) 9, no. 4, 309–347, https://doi.org/10.1007/bf00994110.
10.1007/BF00994110
Web of Science® Google Scholar
44 Calinski T. and Harabasz J., A Dendrite Method for Cluster Analysis, Communications in Statistics—Simulation and Computation. (1974) 3, no. 1, 1–27, https://doi.org/10.1080/03610917408548446.
10.1080/03610917408548446
Google Scholar
45 Rousseeuw P. J., Silhouettes: a Graphical Aid to the Interpretation and Validation of Cluster Analysis, Journal of Computational and Applied Mathematics. (1987) 20, 53–65, https://doi.org/10.1016/0377-0427(87)90125-7, 2-s2.0-0023453329.
10.1016/0377-0427(87)90125-7
Web of Science® Google Scholar
46 Davies D. L. and Bouldin D. W., A Cluster Separation Measure, IEEE Transactions on Pattern Analysis and Machine Intelligence. (1979) 2, 224–227, https://doi.org/10.1109/tpami.1979.4766909, 2-s2.0-0017953820.
10.1109/TPAMI.1979.4766909
Web of Science® Google Scholar
47 Ikotun A. M., Ezugwu A. E., Abualigah L., Abuhaija B., and Heming J., K-Means Clustering Algorithms: A Comprehensive Review, Variants Analysis, and Advances in the Era of Big Data, Information Sciences. (2023) 622, 178–210, https://doi.org/10.1016/j.ins.2022.11.139.
10.1016/j.ins.2022.11.139
Web of Science® Google Scholar
48 Ester M., Kriegel H. P., Sander J., and Xu X., A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases With Noise, Kdd. (1996) 96, no. 34, 226–231.
Google Scholar
49 Murtagh F. and Contreras P., Algorithms for Hierarchical Clustering: an Overview, WIREs Data Mining and Knowledge Discovery. (2012) 2, no. 1, 86–97, https://doi.org/10.1002/widm.53, 2-s2.0-84864660152.
10.1002/widm.53
Web of Science® Google Scholar
50 Huang J. and Ling C. X., Using AUC and Accuracy in Evaluating Learning Algorithms, IEEE Transactions on Knowledge and Data Engineering. (2005) 17, no. 3, 299–310, https://doi.org/10.1109/tkde.2005.50, 2-s2.0-14644390912.
10.1109/TKDE.2005.50
CAS Web of Science® Google Scholar
51 Li Y., Yamamoto T., and Zhang G., The Effect of Fatigue Driving on Injury Severity Considering the Endogeneity, Journal of Safety Research. (2018) 64, 11–19, https://doi.org/10.1016/j.jsr.2017.12.007, 2-s2.0-85041423588.
10.1016/j.jsr.2017.12.007
PubMed Web of Science® Google Scholar

All articles

Exploring the Causality of Accident Severity on Mountainous Freeways With a Two-Stage Approach

Abstract

1. Introduction

2. Literature Review

2.1. Definition of Accident Severity in Accidents

2.2. Risk Factors in Mountainous Freeways

2.3. Statistical Approaches for Accident Severity Analysis

3. Data