ORIGINAL ARTICLE

Open Access

FNIRS-Based Energy Landscape Analysis to Signify Brain Activity Dynamics of Individuals With Depression

Yushan Wu

orcid.org/0009-0005-3872-9600

Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, China

Search for more papers by this author

Shi Qiao,

Shi Qiao

Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, China

Search for more papers by this author

Jitao Zhong,

Jitao Zhong

orcid.org/0000-0002-2187-2604

Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, China

Search for more papers by this author

Lu Zhang,

Lu Zhang

Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, China

Search for more papers by this author

Juan Wang,

Juan Wang

Department of Psychological Medicine, Seventh Medical Center of PLA General Hospital, Beijing, China

Search for more papers by this author

Bin Hu,

Bin Hu

orcid.org/0000-0003-3514-5413

Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, China

Search for more papers by this author

Hong Peng,

Corresponding Author

Hong Peng

[email protected]

orcid.org/0000-0003-1558-1269

Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, China

Key Laboratory of Special Functional Materials and Structural Design, Ministry of Education, Lanzhou University, Lanzhou, China

Correspondence:

Hong Peng ([email protected])

Search for more papers by this author

Yushan Wu,

Yushan Wu

orcid.org/0009-0005-3872-9600

Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, China

Search for more papers by this author

Shi Qiao,

Shi Qiao

Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, China

Search for more papers by this author

Jitao Zhong,

Jitao Zhong

orcid.org/0000-0002-2187-2604

Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, China

Search for more papers by this author

Lu Zhang,

Lu Zhang

Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, China

Search for more papers by this author

Juan Wang,

Juan Wang

Department of Psychological Medicine, Seventh Medical Center of PLA General Hospital, Beijing, China

Search for more papers by this author

Bin Hu,

Bin Hu

orcid.org/0000-0003-3514-5413

Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, China

Search for more papers by this author

Hong Peng,

Corresponding Author

Hong Peng

[email protected]

orcid.org/0000-0003-1558-1269

Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, China

Key Laboratory of Special Functional Materials and Structural Design, Ministry of Education, Lanzhou University, Lanzhou, China

Correspondence:

Hong Peng ([email protected])

Search for more papers by this author

First published: 01 December 2024

https://doi.org/10.1111/cns.70139

Citations: 1

Funding: This work was supported in part by STI2030-Major Projects (no. 2021ZD0200600), in part by the Fundamental Research Funds for the Central Universities, China (grant no. lzujbky-2023-it30), in part by Gansu Provincial Science and Technology Fundamental Research Funds for Excellent Doctoral Student, China (grant no. 24JRRA492), in part by the National Key Research and Development Program of China (grant no. 2019YFA0706200), and in part by the National Natural Science Foundation of China (grant no. 62227807).

Share a link

Email
Wechat
Bluesky

ABSTRACT

Background

Major depressive disorder (MDD) is one of the most common mental disorders, and the number of individuals with MDD (MDDs) continues to increase. Therefore, there is an urgent need for an objective characterization and real-time detection method for depression. Functional near-infrared spectroscopy (fNIRS) is a non-invasive tool, which is widely used in depression research. However, the process of how the brain activity of MDDs changes in response to external stimuli based on fNIRS signals is not yet clear.

Method

Energy landscape (EL) can describe the brain dynamics under task conditions by assigning energy values to each state. The higher the energy value, the lower the probability of the state occurring. This study compares the EL features of 60 MDDs with 60 healthy controls (HCs).

Results

Compared to HCs, MDDs have more local minima, smaller energy differences, smaller variations in basin sizes, and longer duration in the basin of global minimum (GM). The classification results indicate that using the four features above for depression detection yields an accuracy of 86.53%. Simultaneously, there are significant differences between the two groups in the duration of the major states.

Conclusion

The dynamic brain networks of MDDs exhibit more constraints and lower degrees of freedom, which might be associated with depressive symptoms such as negative emotional bias and rumination. In addition, we also demonstrate the strong depression detection capability of EL features, providing a possibility for their application in clinical diagnosis.

1 Introduction

Depression has become a significant public health concern, ranking as the second most prevalent human ailment, only behind heart disease [1]. According to statistics, there are approximately 350 million people globally suffering from depression [2]. However, the clinical detection rate of depression is less than one-third [3]. The current diagnosis of major depressive disorder (MDD) is predominantly based on self-assessment scales and clinical interviews, heavily reliant on the individual's subjective perception and the clinician's professional expertise [4]. Therefore, there is an urgent need to design more objective and effective assessment tools for MDD, thereby achieving auxiliary early screening for MDD.

Methods for disease screening based on physiological and behavioral signals have been extensively researched [5-9]. Physiological signals are difficult to disguise, making it easier to establish a mapping relationship between MDD's symptoms and signal indicators [10]. Functional near-infrared spectroscopy (fNIRS) is an emerging brain imaging technique that possesses strong portability, real-time monitoring, device portability, and cost-effectiveness as advantages [11-13]. FNIRS can non-invasively measure cortical hemodynamic changes, but it lacks specific biomarkers that accurately describe brain network activity [14].

Previous studies have demonstrated a link between the increased risk of MDD and difficulties in brain resource allocation when individuals face internal or external stimuli [15]. Specifically, this is characterized by decreased connectivity within the frontotemporal control region [16]. Additionally, impaired cooperation and synchronization between this control system and networks involved in internal and external attention are observed [15]. These alterations in network functions tend to sustain and exacerbate depressive symptoms, such as rumination, negative biases, and cognitive abnormalities [17]. Therefore, MDD can be regarded as a dysfunction in brain network functionality.

Currently, most brain network analyses based on fNIRS data utilize the functional connectivity (FC) analysis method. Chao et al. [18] found that compared to healthy controls (HCs), individuals with MDD (MDDs) exhibited abnormal FC of the bilateral ventrolateral prefrontal cortex (VLPFC) and bilateral dorsolateral prefrontal cortex (DLPFC). Dong et al. [19] assessed FC deviations in MDDs during cognitive tasks, observing reduced activation in the prefrontal cortex (PFC) of MDDs. FC analysis can reveal pairwise correlations between channels, providing valuable insights into the dynamics of brain networks and neural circuitry [19]. However, FC assumes that pairwise interactions between channels are independent of each other, potentially overlooking information related to higher-order interactions [20]. For instance, the observed correlation between channel A and channel B in FC might be a combination of the correlations between channel A and channel C as well as between channel B and channel C [21]. For this issue, energy landscape (EL) analysis can serve as an alternative approach to FC analysis. The EL analysis embeds the pairwise maximum entropy model (pMEM), allowing estimation of large-scale patterns of brain activity. In comparison with FC, pMEM has been shown to uncover more physiological information as it infers global activity patterns (i.e., activities across all channels) rather than independent pairwise interactions between channels [22].

In recent years, EL is predominantly applied for analyzing functional magnetic resonance imaging (fMRI) data, focusing on the spatial exploration of brain functional networks within specific regions of interest (ROI) [22-24]. ROIs are manually selected in these studies, potentially introducing human bias that could affect experimental consistency. In addition, EL is consistently utilized to quantify the intricate neural activity of the brain during resting state [25-27]. However, rest is an unconstrained state [28], which may make it difficult to capture the full extent of differences [29, 30]. Many studies involve emotional tasks to facilitate the activation of brain activity and induce emotional responses in participants [31-34]. Greene et al. [35] pointed out that task states have greater research potential for brain networks compared to resting states and could be used to identify depression. Therefore, the application of traditional EL in understanding brain dynamics during task states remains relatively limited. Meanwhile, prior research has primarily focused on investigating brain functional impairments and cognitive disorders such as epilepsy [36] and Alzheimer's disease [37], lacking sufficient exploration into the brain's EL among individuals with mental disorders such as MDD.

To address the aforementioned issues, we propose a channel-level data-driven EL for the analysis of brain neural activity during task states in MDDs. Technically, a channel selection algorithm based on CANDECOMP/PARAFAC Decomposition (CPD) [38, 39] is utilized to choose seven channels. Subsequently, each channel signal is binarized using the mean signal value as a threshold. pMEM is then fitted to match the empirical data distribution of the binarized network states, ultimately constructing an EL. This approach establishes the most informative channel set instead of manually selecting ROIs, mitigating biases, and reducing redundant data. At the same time, our study provides a new perspective on task-related EL analysis in MDDs.

Based on the proposed data-driven approach, this study construct ELs using fNIRS data collected from 120 participants (60 MDDs and 60 HCs) under audio stimulation. The experimental results indicate that pMEM achieves excellent fitting of brain networks in both MDD and HC groups. The two groups share a pair of major states (0000111 and 1111000), but differences are observed in the duration within the basins of these major states, which are related to negative emotional bias and rumination. MDDs exhibit an increase in the number of LMs and the duration of the GM, along with a decrease in the standard deviation of basin sizes and the energy difference. We further demonstrate that EL features can be used for depression detection. These findings suggest the presence of network abnormalities in MDD, typically characterized by more constraints and lower degrees of freedom.

2 Paradigm and Data

2.1 Participants

By means of rigorous screening, meticulous matching, and efficient data organization, 120 participants took part in this research, comprising 60 MDDs and 60 HCs. All participants recruited for this study were screened by psychiatrists from the Department of Psychiatry at the Third Hospital of Tianshui, Gansu, China, using the Mini International Neuropsychiatric Interview (M.I.N.I) [40] and the Mini-Mental State Examination (MMSE) [41]. The 9-item Patient Health Questionnaire (PHQ-9) [42] and 17-item Hamilton Depression Scale (HAM-D-17) [43] scale were utilized to measure the severity of depression in the MDDs. This experiment received approval from the local research ethics committee and obtained written informed consent from all participants after explaining the experimental paradigm.

All participants are between the ages of 18 and 60, have not taken any psychotropic drug or prescribed controlled substances in the past 2 weeks, have no brain damage, epilepsy, or severe physical illnesses, and are not pregnant or lactating. MDDs are diagnosed using the M.I.N.I and MMSE by professional doctors, with a PHQ-9 score ≥ 10 and HAM-D-17 score ≥ 17. Only MDDs meeting one of the following two conditions are selected: (1) first episode of depression, with no history of self-medication prior to this; (2) recurrent episodes, but the previous episode has concluded treatment, and no medication has been taken for at least 6 months prior to this consultation. MDDs have no comorbidities of schizophrenia, anxiety, or other mental disorders, and they do not exhibit high-risk suicidal tendencies. HCs have no history of mental health issues in themselves or within the family, with a PHQ-9 score ≤ 4 and HAM-D-17 score ≤ 7.

The independent-sample t-test is used to assess the differences in age, PHQ-9, and HAM-D-17 between groups, while the chi-square test is used to evaluate gender differences between groups. Table 1 shows the statistical results for clinical features, indicating no statistical differences between the two groups in terms of gender and age. The MDD group has higher PHQ-9 and HAM-D-17 scores compared to the HC group. Hence, we disregard the potential effects of age and gender in the subsequent experiments.

TABLE 1. Clinical characteristics of MDDs and HCs.

Characteristics	MDDs	HCs	p
Age (years)	37.63 ± 13.99	36.95 ± 13.18	0.81
PHQ-9	15.48 ± 4.94	1.43 ± 1.61	0.00
HAM-D-17	27.25 ± 5.66	4.27 ± 2.50	0.00
Gender (male/female)	30/30	30 / 30	1.00

Note: The independent-sample t-test and the chi-square test are used to compare the clinical characteristics between the MDDs and HCs.
Abbreviations: HCs, healthy controls; MDDs, individuals with MDD.

2.2 Paradigm

The use of audio stimuli as a simple and effective method for inducing emotions has found wide application in the fields of fNIRS and affective computing [44, 45]. We design a paradigm based on audio stimuli, consisting of four blocks, each comprised of four types of trials: happy, calm, fear, and white noise. The 16 trials are arranged using a Latin square design to reduce the impact of sequence order on the experiment. Each trial lasts for 18 s, with a 20-s rest period between two trials to facilitate the restoration of the hemodynamic response to its baseline level. The entire experiment takes approximately 15 min. The experiment takes place in a quiet room, and participants are instructed to keep their eyes closed and maintain bodily stillness throughout the entire experimental procedure. Figure S1 in the Supporting Information depicts the overall flow of the paradigm.

2.3 Data Acquisition and Preprocessing

In accordance with the 10–20 international system, the fNIRS system is configured with 22 channels in the prefrontal cortex region for this study. Figure S2 in the Supporting Information illustrates the electrode arrangement of the fNIRS system. Channels 2, 7, 9, 14, 16, 21, and 22 are located in the superior frontal gyrus (SFG) region, while the remaining channels are situated in the middle frontal gyrus (MFG) region [46]. The experiment gathers all fNIRS signals utilizing a multi-channel continuous-wave fNIRS system (NIRx Medical Technologies LLC) at a sampling rate of 7.81 Hz. The NIRStar data acquisition software (version 15.1) is used to document the configuration of the optode placement. The raw signal from fNIRS consists of the time series of optical intensity values at two wavelengths, 760 and 850 nm, for each channel. The modified Beer–Lambert law (MBLL) is utilized to compute the concentrations of oxygenated hemoglobin (HbO), deoxygenated hemoglobin (HbR), and total hemoglobin (HbT) corresponding to the optical intensity data [47].

Then, we categorize the fNIRS signals into positive (happy), neutral (calm), negative (fear), and all stimuli based on the types of stimulus tasks. This further segmentation aids in a more detailed exploration of how different emotional stimuli impact MDDs. Therefore, the fNIRS signals are divided into four sequences.

Hemodynamic signals unavoidably contain various sources of interference, such as experimental noise, physiological noise, and instrument noise [48, 49]. The Homer2 toolbox [50] in MATLAB 2019a can be used for preprocessing. The hmrMotionArtifactByChannel function recognizes motion artifacts in the time periods, with specific parameters set as follows: AMPthresh = 5.0, tMotion = 0.5, STDEVthresh = 20.0, and tMask = 3.0 [18]. The hmrMotionCorrectSpline function with pSpline = 0.99 performs correction. Additionally, the hmrBandpassFilt function with a cutoff frequency of 0.01 to 0.2 Hz is applied to remove heartbeats (> 1 Hz), respiration (0.2~0.5 Hz), and high-frequency noise [51].

3 Method

We analyze the EL of task-state fNIRS signals to explore the physiological differences in response to external emotional stimuli between MDDs and HCs. The flowchart of the EL analysis in this study is illustrated in Figure 1. The framework mainly comprises four steps: channel selection, signal binarization, fitting the pMEM model, and constructing the EL. Firstly, the preprocessed data are structured into a three-dimensional tensor, with dimensions representing channels, participants, and signal frequency, respectively. The channel factor matrix is extracted by CPD and utilized for channel selection. To better capture brain activity patterns [21], we binarize the continuous fNIRS signals. Following this, the relative frequency of occurrence for each state is computed. Subsequently, the pseudo-likelihood maximization method is employed to estimate the pMEM model, with accuracy index r serving as the model evaluation metric. Finally, the EL is plotted using the energy of each state, and multiple EL features are extracted to compare the differences and similarities between MDDs and HCs.

Details are in the caption following the image — **FIGURE 1**
Open in figure viewer PowerPoint

The flow chart of EL analysis. (A) fNIRS data is organized as a three-dimensional tensor, with the three dimensions representing channel, frequency, and subject. Among these, the channel dimension requires further selection. (B) Seven channels are selected through the CPD algorithm and Pearson correlation analysis. (C) Signal binarization. Data points above the mean are marked as 1, while those below are marked as 0. (D) Each column represents the 7-digit code corresponding to a state, and the empirical appearance probability is calculated. (E) A pairwise maximum entropy model is simulated to obtain energy corresponding to each state. (F) Energy landscape model schematic. The higher the energy of a state, the lower its frequency of appearance.

The EL conceptualizes brain signals as a network composed of various states, each characterized by an energy value inversely related to its probability of occurrence [37]. Consequently, states with lower energy levels are more likely to occur. Due to varying energy levels among different states, there emerge unstable energy peaks (with low probabilities of occurrence) and stable energy basins (with high probabilities of occurrence), creating distinct energy constraints [52]. In the EL, the brain's neural activity is mapped as the movement of a ball constrained by the aforementioned energy constraints. Consequently, the trajectory of this “ball” within the EL tends to roll from peaks toward basins, lingering within the basins repeatedly. Due to random fluctuations, the ball occasionally moves uphill and may transition to another basin [26].

3.1 Channel Selection

In EL analysis, the brain network has 2^C possible states at each moment, where C represents the number of channels. Because calculating the empirical distribution for each state in pMEM incurs exponential growth in fitting costs as the number of channels increases, it becomes essential to reduce the number of channels. The traditional channel selection methods can be classified into filter methods and wrapper methods. However, all these methods rely on matrix analysis, which involves flattening the original three-dimensional fNIRS signal. They only utilize spatial or temporal information, overlooking the interaction between these two types of information. As a result, they cannot accurately characterize multidimensional fNIRS signals. Therefore, we employ tensor decomposition methods to capture the latent spatio-temporal correlations between channels and unearth the hidden structure within the signals.

Tensor decomposition is a method used to break down a tensor into factor matrices and factor vectors, which helps avoid the loss of information caused by data flattening [53]. CPD is a classical method for dimensionality reduction of tensor data, which expresses an N-dimensional tensor as a sum of rank-one tensors [54]. Figure 1A illustrates the CPD.

Suppose

\chi \in {\mathrm{\mathbb{R}}}^{\mathrm{F}\times \mathrm{C}\times \mathrm{S}}

represents the tensor of fNIRS signals for all participants, where C, S, and F, respectively, denote the number of channels, participants, and signal frequencies. The tensor can be written as

\chi \approx {\sum}_{t=1}^T\left({a}_t\circ {b}_t\circ {d}_t\right)

(1)

where T is a positive integer denoting the rank of the tensor.

{}^{{}^{\circ}}

represents the inner product of vectors, and

{\boldsymbol{\upalpha}}^{\mathbf{t}}\in {\mathbb{R}}^{\mathrm{F}}

{\mathbf{b}}^{\mathbf{t}}\in {\mathbb{R}}^{\mathrm{C}}

{\mathbf{d}}^{\mathbf{t}}\in {\mathbb{R}}^{\mathrm{S}}

for t = 1, …, T. Equation (1) can be converted into an elemental form as follows

{x}_{fcs}\approx {\sum}_{t=1}^T\left({a}_{ft}\circ {b}_{ct}\circ {d}_{st}\right)

(2)

\mathrm{for}\ \mathrm{f}=1,\cdots, \mathrm{F},\kern0.5em \mathrm{c}=1,\cdots, \kern0.5em \mathrm{C},\mathrm{s}=1,\cdots, \mathrm{S}

We can integrate multiple rank-one tensors into factor matrices, that is, $A=\left[{\alpha}_1,\dots, {\alpha}_t\right]$ , $B=\left[{b}_1,\dots, {b}_t\right]$ , and $D=\left[{d}_1,\dots, {d}_t\right]$ . Meanwhile, the factor matrices are normalized to unit length, and a weight vector $\lambda \in {\mathbb{R}}^T$ is introduced. Hence, the matrix form of Equation (2) is as follows.

\chi \approx \left[\lambda; A,B,D\right]\equiv \sum \limits_{t=1}^T{\lambda}_t\ {a}_t\circ {b}_t\circ {d}_t

(3)

where

A\in {\mathbb{R}}^{F\times T}

B\in {\mathbb{R}}^{C\times T}

, and

D\in {\mathbb{R}}^{S\times T}

are three factor matrices representing information from frequencies, channels, and participants, respectively.

Through the aforementioned CPD steps, we obtain the channel factor matrix B. This matrix integrates complex information regarding the interactions between the temporal, frequency, and spatial domains. Compared to traditional matrix analysis, channel selection based on CPD yields better results. We take the transpose of matrix B and conduct Pearson correlation analysis on it to obtain the correlation matrix $P\in {\mathbb{R}}^{C\times C}$ . P represents the correlation between different channels. Then, P is averaged along its rows and arranged in descending order according to the mean values, resulting in the importance sequence of C channels. The CPD-based method is applied to the four types of stimuli, and the top 7 ranked channels are extracted as the final channel sets for each stimulus.

3.2 Pairwise Maximum Entropy Model

[21] has demonstrated that the pMEM model based on binarized physiological signals can better simulate the dynamic transitions of brain states. Therefore, continuous fNIRS signals are binarized, which is shown in Figure 1C. For each stimulus, participant, and channel, the mean value of the fNIRS signal is used as the threshold [25]. Signals above the threshold are represented as 1; otherwise, they are represented as 0. Afterward, the activity patterns of each channel can be represented by $\left\{{\sigma}_m(1),\dots, {\sigma}_m(F)\right\}$ , where F is the number of sampling points. ${\sigma}_m(f)=1$ means that the brain region corresponding to the m-th channel is active at time f. For each participant, the brain states of the entire system are given by $s(f)=\left\{\sigma (1),\dots, \sigma (7)\right\}\in {\left\{-1,1\right\}}^7$ . There are 2⁷ = 128 possible states ${s}_i$ , i = 1, …, 128.

Firstly, pMEM calculate the empirical probability

{P}_e\left({s}_i\right)

of each state

{s}_i

{P}_e\left({s}_i\right)=\frac{n_{s_i}}{F}

(4)

where

{n}_{si}

is the number of occurrences of state

{s}_i

across the entire time series.

{\left\langle {\upsigma}_{\mathrm{m}}\right\rangle}_{\mathrm{e}}

denotes the empirical activation rate of m-th channel, and

{\left\langle {\upsigma}_{\mathrm{m}}{\upsigma}_{\mathrm{n}}\right\rangle}_{\mathrm{e}}

denotes the pairwise co-occurrence of m-th and n-th channels.

{\left\langle {\sigma}_m\right\rangle}_e=\frac{1}{F}\sum \limits_{f=1}^F{\sigma}_m(f)

(5)

{\left\langle {\sigma}_m{\sigma}_n\right\rangle}_e=\frac{1}{F}\sum \limits_{f=1}^F{\sigma}_m(f){\sigma}_n(f)

(6)

Next, ${P}_e\left({s}_i\right)$ is fitted to the Boltzmann distribution as follows.

P\left({s}_i|h,J\right)=\frac{\exp \left[-E\left({s}_i|h,J\right)\right]}{\sum_{i^{\prime }=1}^{128}\exp \left[-E\left({s}_{i^{\prime }}|h,J\right)\right]}

(7)

E\left({s}_i\right)=-\sum \limits_{m=1}^7{h}_m{\sigma}_m\left({s}_i\right)-\frac{1}{2}\sum \limits_{m=1}^7\sum \limits_{\begin{array}{c}n=1\\ {}n\ne m\end{array}}^7{J}_{mn}{\sigma}_m\left({s}_i\right){\sigma}_n\left({s}_i\right)

(8)

where

E\left({s}_i\right)

is the energy of

{s}_i

, and

{\sigma}_m\left({s}_i\right)

is the m-th element of

{s}_i

h=\left\{{h}_m\right\}

and

J=\left\{{J}_{mn}\right\}

are the parameters of the pMEM model, which are shown in Figure 1D. h estimates the baseline activity of all channels, while J estimates the interactions between pairs of channels. According to the maximum entropy theory, we obtain two equations:

{\left\langle {\sigma}_m\right\rangle}_e={\left\langle {\sigma}_m\right\rangle}_{\mathrm{mod}}

and

{\left\langle {\sigma}_m{\sigma}_n\right\rangle}_e={\left\langle {\sigma}_m{\sigma}_n\right\rangle}_{\mathrm{mod}}

, and choose h and J that satisfy these conditions.

{\left\langle {\sigma}_m\right\rangle}_{\mathrm{mod}}

and

{\left\langle {\sigma}_m{\sigma}_n\right\rangle}_{\mathrm{mod}}

represent the activation rate and pairwise co-occurrence predicted by pMEM, respectively.

{\left\langle {\sigma}_m\right\rangle}_{\mathrm{mod}}=\sum \limits_{i=1}^{2^7}{\sigma}_m\left({s}_i\right)P\left({s}_i|h,J\right)

(9)

{\left\langle {\sigma}_m{\sigma}_n\right\rangle}_{\mathrm{mod}}=\sum \limits_{i=1}^{2^7}{\sigma}_m\left({s}_i\right){\sigma}_n\left({s}_i\right)P\left({s}_i|h,J\right)

(10)

The pseudo-likelihood maximization method [26] is used to estimate h and J, with a termination condition of 5 × 10⁻⁶ and a learning rate of 0.1.

3.3 Accuracy Index

The accuracy index r can measure the disparity between the results estimated by the pMEM model and the empirical data [26, 37], given by

r=\frac{K_1-{K}_2}{K_1}

(11)

where K₁ is the Kullback–Leibler divergence between empirical data and the probability distribution fitted by the independent maximum entropy model (MEM). The MEM model disregards pairwise interactions, meaning J = 0. K₂ is the Kullback–Leibler divergence between pMEM and empirical data, which is as follows

{K}_2=\sum \limits_{i=1}^{2^7}{P}_e\left({s}_i\right){\log}_2\left(\frac{P_e\left({s}_i\right)}{P\left({s}_i\right)}\right)

(12)

Note that, $r\in \left[0,1\right]$ quantifies the contribution of interactions between pairs of channels. We separately employ pMEM and MEM to predict the empirical distribution and compare the differences between each of them and the empirical distribution. If the distribution fitted by pMEM matches the empirical distribution perfectly, then r = 1. If the disparities between the distributions generated by pMEM and MEM compared to the empirical distribution are close, then r = 0. In other words, pairwise interactions do not contribute to predicting the empirical distribution when r = 0.

3.4 Energy Landscape

The first step in constructing an EL is defining the adjacency relationships between states. When the Hamming distance ${D}_{mn}$ between state ${s}_m$ and state ${s}_n$ is 1, there exists an edge ${E}_{mn}$ , which represents the transition between state m and state n. Therefore, the EL is constructed as an undirected graph composed of 2⁷ states and their corresponding energies.

We traverse through all states and their adjacent states' energy values, searching for multiple local minima (LMs) in EL [52]. The local minimum (LM) is defined as a state whose energy value is lower than the energy values of all its adjacent states. Additionally, the global minimum (GM) is also identified. For each participant, the number of LM and the energy difference between the LM and the GM are utilized as features for further analysis.

To evaluate the influence of each LM, we calculate the basin size of each LM [52]. More specifically, we randomly select an initial state ${s}_m$ . If the energy value of its adjacent state ${s}_n$ is lower than that of ${s}_m$ , we move to state ${s}_n$ using the gradient descent method. Otherwise, we stay in place, indicating that state ${s}_m$ is an LM. The process is repeated until reaching an LM ${s}_g$ , making state ${s}_m$ belong to ${s}_g$ . By traversing the 2⁷ states and executing the aforementioned procedure, each state is allocated to its corresponding LM. The basin size of an LM is defined as the number of states contained within that LM. We extract the standard deviation of the basin sizes of all LMs for each participant as the third analytical feature.

Finally, we develop a random walking model based on the EL to simulate brain neural activity. Markov chain Monte Carlo sampling is employed to simulate state transitions. We randomly select an initial state ${s}_m$ , assuming that state ${s}_m$ has N neighboring states, and each of these neighboring states is chosen with equal probability. We choose a particular neighboring state ${s}_n$ with a probability of $\frac{1}{N}$ . If $E\left({s}_m\right)>E\left({s}_n\right)$ , we move to state ${s}_n$ ; otherwise, the probability of transition is $\exp \left[E\left({s}_m\right)-E\left({s}_n\right)\right]$ . For each participant's EL, we conduct 20,000 iterations and calculate the duration that the system stays within the basin of the GM. This is used as the fourth EL feature in this study.

3.5 Method Summary

As described in section 2, fNIRS data for each participant can be categorized into four types of stimuli: calm, happy, fear, and full stimuli. For all combinations of participants and stimuli, we estimate pMEM and compare their goodness of fit to exclude the impact of model fitting differences on subsequent EL analysis. To comprehensively characterize abnormal neural activity in MDDs, we devise two approaches for EL analysis, focusing on both state-level and individual-level analyses. Before the statistical analysis, we perform the Shapiro–Wilk test [55] on all data. The results show that, except for the number of LM, the remaining data conform to a normal distribution. Therefore, we choose the Kruskal–Wallis nonparametric test [56] to analyze the number of LM, while the remaining data are analyzed using analysis of variance (ANOVA) and Tukey's post hoc tests.

EL analysis based on major states: although the LM sets differ for each combination, the 120 participants all choose the same two LMs (0000111 and 1111000) across four stimulus categories, which we refer to as major states. The durations within the basins of major states are exacted for ANOVA and Tukey's post hoc tests between groups and stimuli.
EL analysis based on participants: four energy features are used for statistical analysis (Section 3.4). We use the Kruskal–Wallis test for the number of LM and ANOVA with Tukey's post hoc tests for energy differences, basin sizes, and the duration of GM.

Finally, we train two classification models using the connectivity features and energy features separately. For each participant, the connectivity feature vector is composed of the J values from the pMEM, while the energy feature is a concatenated vector of the energy of 2⁷ states, the number of LM, energy differences, basin sizes, and the duration of GM. The MinMaxScaler method is used for data standardization. A support vector machine (SVM) is employed for classification, with the following parameters: kernel = “rbf,” tol = 0.001, class weight = “balanced,” gamma = 10⁻⁸~10⁸, C = 10⁻⁸~10⁸. We use grid search to select the optimal gamma and C, and leave-one-out cross-validation to evaluate the predictive abilities of the models.

4 Experiment and Results

4.1 Channel Selection

Figure 2 displays the channels selected under four types of stimuli, which exhibit a high degree of similarity. The selected channels are predominantly concentrated in the medial frontal gyrus (MFG) region, including channels 2, 9, 7, 14, 16, 21, and 22. This finding is consistent with previous research, indicating that, irrespective of the emotional stimuli, the activation levels in the MFG are consistently higher in MDDs compared to HCs [57, 58]. We believe that channels located within the MFG might possess more discriminative features and spatio-temporal correlations, hence being repeatedly selected.

4.2 Accuracy Index of pMEM

We construct a pMEM model for each participant and evaluate the model's goodness of fit using the coefficient r. The fitting results reveal a high accuracy in estimating the empirical distribution by pMEM, with a mean of 0.851 and a standard deviation of 0.083 for the coefficient r. ANOVA is used to assess the differences in r among pMEMs. There are no significant differences between groups (F(1, 472) = 2.849, p = 0.092), but significant main effects exist among stimuli (F(3, 472) = 160.547, p < 0.001). This indicates that the changes in neural activity across different types of stimuli influence the goodness of fit of pMEMs in participants. The results of Tukey's post hoc test show a significant difference between full and calm (p < 0.001), happy and calm (p = 0.048), full and fear (p < 0.001), and full and happy (p < 0.001).

4.3 Energy Landscape Analysis Based on Major States

We categorize all EL models into four stimuli, with each stimulus comprising 120 participants' ELs. Each EL contains several LMs. In general, the sets of LMs for each EL are distinct. However, some states contain rich information, leading them to be selected as LM by multiple ELs. We compile the six most frequently occurring LMs for each stimulus, assigning them numbers 1~18, as shown in Figure 3. Filled and blank cells, respectively, represent whether the corresponding channel is activated (denoted as 1) or not activated (denoted as 0) in that LM. The filled color indicates the frequency of the state being chosen as an LM. Taking major state 1 under full stimuli as an example, it is denoted as 1111000, indicating that ch22, ch16, ch21, and ch7 are activated, while ch11, ch10, and ch3 are abandoned. The dark color indicates that it is chosen as an LM by all 120 ELs.

The experimental results reveal that even under different stimuli, States 1 and 2 (1111000 and 0000111) are present in the LM sets of all participants, and we refer to them as major states. We also observe that although the selected channels differ under each stimulus, channels activated by major state 1 under all four stimuli belong to the MFG region, while channels activated by major state 2 belong to the SFG region. We extract the average duration of the major states to explore differences between groups and stimuli.

We first present the distribution of the average duration using a dot plot (Figure S3), followed by a bar chart (Figure 4) to display the mean and standard deviation of the data. In Figure 4, it is noted that the duration of the two major states in MDDs under full stimuli is greater than that in HCs. When facing negative stimuli, MDDs show longer duration in the SFG region than HC and shorter duration in the MFG region. Under positive stimuli, MDDs exhibit shorter duration than HCs in the SFG region, while in the MFG region, it is the opposite. The inter-group changes induced by neutral stimuli are similar to those induced by full stimuli. We believe these differences are related to abnormal neural activity in the brains of MDDs when facing different stimuli. The specific details will be explained in section 5.1.

We conduct separate ANOVA tests for the duration of the two major states. There are significant differences in the duration of major state 1 between groups (F(1, 472) = 3.975, p = 0.047) and group: stimuli (F(3, 472) = 4.914, p = 0.002). The difference between the stimuli is not statistically significant (F(3, 472) = 2.032, p = 0.109). Similarly, the duration of major state 2 exhibits significant main effects for stimuli (F(3, 472) = 5.419, p = 0.001), groups (F(1, 472) = 4.708, p = 0.031) and group: stimuli (F(3, 472) = 2.975, p = 0.031). The Tukey's post hoc test is employed for further analysis. In the case of major state 1, there are significant differences between full and calm (p = 0.049). In the case of major state 2, the interactions of the following stimuli are significant: fear-calm (p = 0.004) and full-calm (p = 0.007).

4.4 Energy Landscape Analysis Based on Participants

Figure S4 shows the distribution of four energy features under different stimuli, with the number of LM does not exhibit a normal distribution trend, while others approximately following a normal distribution.

All types of stimuli show that the number of LMs is greater in MDDs than in the HCs ( ${\chi}^2$ (1) = 22.368, p < 0.001) (Figure 5A). Additionally, there are significant differences between stimuli ( ${\chi}^2$ (3) = 18.780, p < 0.001).

For each combination of stimuli and participants, the average energy difference between LMs and GM is used as the second indicator (Figure 5B). HCs consistently exhibit a larger energy difference than MDDs (F(1, 472) = 4.521, p = 0.034). No significant main effects between stimuli (F(3, 472) = 0.674, p = 0.568) and group: stimuli (F(3, 472) = 0.416, p = 0.742).

The HCs uniformly have a larger standard deviation in basin size (F(1, 472) = 8.902, p = 0.003) (Figure 5C). No significant differences between stimuli (F(3, 472) = 1.441, p = 0.230). The interaction between group and stimuli is not statistically significant (F(3, 472) = 0.117, p = 0.950).

The duration of the GM shows significant differences in the group (F(1, 472) = 4.267, p = 0.039) and group: stimuli (F(3, 472) = 2.648, p = 0.048) (Figure 5D). No significant main effects between stimuli (F(3, 472) = 0.343, p = 0.794).

4.5 Comparison Between pMEM Features and Energy Features

For the depression detection model based on connectivity features and energy features, we compare the classification abilities of the two. We extract the accuracy, recall, precision, and F1 score to visualize the differences between the two models, as shown in Table 2.

TABLE 2. The comparative results using the SVM classifier.

			Acc	Pre	Rec	Spec	F1
Connectivity	Fear	Mean	72.46	74.41	74.31	74.46	0.72
	Fear	Std.	12.66	10.19	11.89	13.54	0.12
	Happy	Mean	65.12	67.82	68.43	67.38	0.63
	Happy	Std.	11.63	12.48	11.09	12.31	0.12
	Calm	Mean	58.92	62.48	61.62	62.86	0.59
	Calm	Std.	9.77	14.80	15.44	14.97	0.10
	Full	Mean	64.05	68.54	65.68	68.34	0.64
	Full	Std.	10.62	9.76	12.01	11.73	0.12
Energy	Fear	Mean	86.53	86.76	88.30	86.73	0.87
	Fear	Std.	8.71	7.85	9.47	10.54	0.08
	Happy	Mean	76.11	76.94	79.44	77.18	0.76
	Happy	Std.	10.01	12.23	10.63	9.81	0.10
	Calm	Mean	72.67	73.43	75.98	73.52	0.72
	Calm	Std.	13.91	9.08	11.38	12.22	0.13
	Full	Mean	81.30	81.44	82.64	82.45	0.81
	Full	Std.	8.17	10.58	11.69	12.38	0.11

Note: Bold values indicate the best classification performance in each column.
Abbreviations: Acc, accuracy; F1, F1 score; Pre, precision; Rec, recall; Spec, specificity.

The results indicate that the model based on energy features exhibits superior classification performance. All parameters show significant differences between models (accuracy: F(1, 792) = 228.389, p < 0.001, precision: F(1, 792) = 75.310, p < 0.001, recall: F(1, 792) = 126.927, p < 0.001, specificity: F(1, 792) = 85.044, p < 0.001, F1 score: F(1, 792) = 199.249, p < 0.001), and stimuli (accuracy: F(3, 792) = 35.820, p < 0.001, precision: F(3, 792) = 19.556, p < 0.001, recall: F(3, 792) = 16.093, p < 0.001, specificity: F(1, 792) = 16.333, F1 score: F(3, 792) = 36.084, p < 0.001). This suggests that on the basis of simulating brain activity with pMEM, conducting EL analysis is essential. By extracting features from EL to characterize LMs, transforming neurodynamics into attractor dynamics can significantly enhance the capabilities of the depression detection model.

Tukey's post hoc tests reveal significant differences among fear and calm (accuracy: p < 0.001, precision: p < 0.001, recall: p < 0.001, specificity: p < 0.001, F1 score: p < 0.001), full and calm (accuracy: p < 0.001, precision: p < 0.001, recall: p = 0.014, specificity: p < 0.001, F1 score: p < 0.001), full and fear (accuracy: p < 0.001, precision: p = 0.002, recall: p < 0.001, specificity: p = 0.028, F1 score: p < 0.001), happy and calm (accuracy: p = 0.016, precision: p = 0.049, recall: p = 0.021, specificity: p = 0.011, F1 score: p = 0.002), as well as happy and fear (accuracy: p < 0.001, precision: p < 0.001, recall: p < 0.001, specificity: p < 0.001, F1 score: p < 0.001) in four classification metrics. Among them, the fear stimulus segment exhibits the highest classification performance, with an accuracy of 86.53% and a recall of 88.30%. This may be related to the negative bias observed in MDDs.

5 Discussion

In this study, we divide the fNIRS data of each participant into four stimulus segments and construct EL for each stimulus of each participant to characterize the neural dynamics of MDDs. For all 480 ELs, we conduct analyses based on both major states and participants. The statistical results reveal that the number of LMs, basin size, duration of GM, and the energy difference between LMs and GM can serve as significant indicators for MDDs. We also attempt to relate energy features to the mechanisms of depression and find that these analysis results are highly correlated with existing studies on depressive symptoms, such as negative emotional bias, rumination, and restricted brain networks.

5.1 State-Level Analysis and Depressive Symptoms

Given that this study employs an affective stimulus paradigm, we initially explore whether there is a connection between ELs and emotional abnormalities. Current research suggests an association between the negative emotions in MDDs and abnormalities in local cerebral blood flow [59]. Additionally, both positive and negative thinking are associated with the activation of specific brain regions [60]. These results imply that the emotional symptoms in MDDs may serve as a plausible explanation for the findings of EL analysis.

Negative emotional bias is one of the endophenotypes of depression, referring to the tendency of MDDs to exhibit preferential attention to negative information and impaired inhibition [61]. Negative emotional bias leads to intensified processing of negative information in MDDs, and MDDs tend to interpret positive stimuli as negative experiences, thereby perpetuating depression [62]. Past research has already demonstrated a strong association between negative emotional bias and depression. When exposed to positive stimuli, MDDs show increased activation in the SFG region and decreased activation in the MFG region [63]. Conversely, when facing negative stimuli, the pattern is reversed, with decreased activation in the SFG region and increased activation in the MFG region [64]. We will further provide robust evidence for this association through EL features.

In Section 4.3, we extract major states 1 and 2, representing the activation of channels in the MFG and SFG, respectively. The duration in Figure 4 represents the time that the brain spends in the basins of the major states during activity, quantifying the influence of the major states on the brain's dynamic network. Using the fear stimuli as an example, let's illustrate the relationship between duration and regional activation. When MDDs listen to negative audio, the duration in the SFG region is longer than in the HCs. This indicates that the small ball randomly wandering in the EL is more likely to be captured by the SFG region and is difficult to break free from its control. The ball staying in the same state basin for an extended period prevents it from switching states frequently. This indicates a less active network in the SFG region, corresponding to a decreased activity frequency and reduced activation in the SFG area. Therefore, the level of regional activation is inversely proportional to the duration of the corresponding state in that region.

From this, we can further interpret the experimental results of Figure 4. Compared to HCs, MDDs show a longer duration of activation in the SFG region and a shorter duration in the MFG region during negative stimuli, which suggests reduced activation in the SFG region and increased activation in the MFG region. In the case of positive stimuli, the opposite results in duration imply that the activation levels are also opposite. They are consistent with previous research findings related to negative emotional bias.

In full stimuli and calm stimuli, MDDs exhibit longer durations in both brain regions, suggesting the brain is trapped in a single basin, unable to escape. This appears similar to the rumination behavior of MDDs, where they immerse themselves in the emotions of pain and worry, engaging in repetitive self-analysis [65]. In response to external stimuli, rumination is marked by reduced activity in the prefrontal cortex region [66], particularly the dlPFC (equivalent to MFG) [67]. Based on the previous conclusions, inactive networks imply longer durations; thus, the rumination characteristics align with the inter-group comparisons across full stimuli and calm stimuli.

5.2 Individual-Level Analysis and Depressive Symptoms

In this section, we attempt to interpret the results of the EL analysis based on participants in Section 4.4. Figure 5A indicates that MDDs generally have more LMs than HCs. LMs, as attractors that frequently appear in state transitions, can be considered as constraints within the network. The network constraints uncovered in this experiment might reveal a general lack of flexibility in the response systems of MDDs [68], indicating that the brain's degrees of freedom are fewer in MDDs. This discovery may be associated with the pathology of MDDs, specifically, structural alterations in the basal ganglia circuits, characterized by smaller volumes in the bilateral caudate nucleus and putamen [69, 70]. We hypothesize that the neurodegenerative process linked to the atrophy of the caudate nucleus might lead to the doping of many interfering LMs in addition to normally functioning LMs [71]. These additional LMs might result in information interruptions, impeding the normal transmission of information.

GM is the most influential attractor among all LMs, shaping the overall structure of the EL. The energy difference between GM and LMs suggests the degree of ease in transition between the two. The larger the energy difference, the more likely a state transition is to occur. We compute the average of all differences (Figure 5B). MDDs exhibit smaller energy differences, that is, fewer state transitions, implying lower degrees of freedom in their brains.

Similarly, GM possesses the largest basin in the EL, thereby exerting significant influence on the variation in the basin sizes of LMs. We calculate the standard deviation of the basin sizes for each LM to measure the variation in basin sizes, essentially quantifying the similarity between the basins of LMs and GM (Figure 5C). The basins in MDDs display smaller variations, signifying a greater similarity between LM and GM. This finding needs to be jointly explained with the number of LMs. In the definition of basin size, each state can only be assigned to one basin, and the size of each basin is equal to the number of states it contains. Due to the constant total number of states, when there are more LMs in the EL, their basins will be smaller. In conclusion, MDDs have mutually similar small basins, which possess stronger attractor characteristics [25]. This reveals that the ELs of MDDs have stronger constraints, consistent with the earlier conclusions.

Finally, we analyze the duration of the simulated random walk system in the GM. Except for positive stimuli, MDDs spend a longer time in the GM basin in the remaining three conditions (Figure 5D). This further confirms that MDDs are constrained by the strong GM basin, resulting in a prolonged stay of the system within the GM basin. On the contrary, HCs experience a large yet weak attraction from GM, allowing them to more easily escape the influence of GM. This result once again provides compelling evidence for the restricted and disrupted brain information transmission in MDDs.

5.3 Limitations and Challenges

This study discusses the neurodynamic changes in MDDs, but there are still limitations and challenges that need to be addressed. First, EL analysis requires binarization of the continuous fNIRS signals, which leads to a certain degree of information loss. Second, this study does not achieve a highly accurate pMEM model, with an average r-value of 0.851. This may be due to the low sampling rate of the fNIRS device, resulting in a limited number of usable data points per recording, which is insufficient to meet the requirements for fitting a more precise pMEM model. Finally, the computational cost of pMEM increases exponentially with the number of channels, so we are only able to select 7–10 channels for model fitting. Data from most channels are not included in the EL analysis, leading to data wastage, which may have further impacted the accuracy of the pMEM model. In future research, the effects of different thresholds on the experimental results can be discussed to select the optimal threshold and minimize the impact of binarization on the signals. Additionally, increasing the duration of the experimental paradigm could enhance the amount of information in each fNIRS recording, allowing for a higher-precision pMEM model.

6 Conclusion

In this study, we customize a data-driven EL model for fNIRS signals, enabling the observation of the brain dynamics in MDDs while eliminating artificial interference. There are significant differences in the duration of major states, the number of LMs, the energy difference between GM and LMs, the standard deviation of basin size, and the duration of GM between MDDs and HCs, which indicate that MDDs imply constrained and rigid characteristics in their brain dynamic networks. We further explore potential connections between experimental results and symptoms of depression such as rumination and negative emotional bias. In addition, we demonstrate that pMEM and EL features can be utilized for depression detection. These results suggest that EL might have potential applications in the clinical diagnosis of depression, providing a novel perspective for the study of its pathological mechanisms.

Author Contributions

Yushan Wu led the experiment design and drafted the manuscript. Shi Qiao, Jitao Zhong, and Lu Zhang assisted with experimental procedures and data collection. Bin Hu contributed to the research design and methodology. Juan Wang focused on data interpretation, particularly in psychiatric aspects. Hong Peng, the corresponding author, coordinated the research, interpreted the results, and finalized the manuscript.

Acknowledgments

The authors would like to thank Editor-in-chief and the referees for their suggestions to improve the paper.

Conflicts of Interest

The authors declare no conflicts of interest.

We report how we determined our sample size, all data exclusions, all manipulations, and all measures in the study. Data are analyzed using R, version 4.3.1.

Open Research

Data Availability Statement

The datasets used and analyzed during the current study are available from the corresponding author upon reasonable request.

Supporting Information

References

1D. F. Santomauro, A. M. Mantilla, J. S. Herrera, et al., “Global Prevalence and Burden of Depressive and Anxiety Disorders in 204 Countries and Territories in 2020 Due to the Covid-19 Pandemic,” Lancet 398, no. 10312 (2021): 1700–1712.
10.1016/S0140-6736(21)02143-7
PubMed Web of Science® Google Scholar
2G. Y. Lim, W. W. Tam, L. Yanxia, C. S. Ho, M. W. Zhang, and R. C. Ho, “Prevalence of Depression in the Community From 30 Countries Between 1994 and 2014,” Scientific Reports 8, no. 1 (2018): 2861.
10.1038/s41598-018-21243-x
PubMed Web of Science® Google Scholar
3Q. Wang, H. Yang, and Y. Yanhong, “Facial Expression Video Analysis for Depression Detection in Chinese Patients,” Journal of Visual Communication and Image Representation 57 (2018): 228–233.
10.1016/j.jvcir.2018.11.003
Web of Science® Google Scholar
4J. C. Mundt, P. J. Snyder, M. S. Cannizzaro, K. Chappie, and D. S. Geralts, “Voice Acoustic Measures of Depression Severity and Treatment Response Collected via Interactive Voice Response (Ivr) Technology,” Journal of Neurolinguistics 20, no. 1 (2007): 50–64.
10.1016/j.jneuroling.2006.04.001
PubMed Web of Science® Google Scholar
5F. Ceccarelli and M. Mahmoud, “Multimodal Temporal Machine Learning for Bipolar Disorder and Depression Recognition,” Patrern Analysis and Applications 25, no. 3 (2022): 493–504.
Google Scholar
6J. Zhong, D. Wang, W. Hongtong, et al., “Filterable Sample Consensus Based on Angle Variance for Pupil Segmentation,” Digital Signal Processing 130 (2022): 103695.
10.1016/j.dsp.2022.103695
Web of Science® Google Scholar
7Y. Tao, M. Yang, W. Yushan, K. Lee, A. Kline, and H. Bin, “Depressive Semantic Awareness From Vlog Facial and Vocal Streams via Spatio-Temporal Transformer,” Digital Communications and Networks 10, no. 3 (20): 577–585
10.1016/j.dcan.2023.03.007
Google Scholar
8M. Yang, X. Feng, R. Ma, X. Li, and C. S. Mao, “Orthogonal-Moment-Based Attraction Measurement With Ocular Hints in Video-Watching Task,” IEEE Transactions on Computational Social Systems 10 (2023): 900–909.
10.1109/TCSS.2023.3268505
Web of Science® Google Scholar
9M. Yang, W. Yushan, Y. Tao, H. Xiping, and H. Bin, “Trial Selection Tensor Canonical Correlation Analysis (Tstcca) for Depression Recognition With Facial Expression and Pupil Diameter,” IEEE Journal of Biomedical and Health Informatics (2023), https://doi.org/10.1109/JBHI.2023.3322271.
10.1109/JBHI.2023.3322271
Google Scholar
10S. D. Raut and V. T. Humbe, “ Palm Vein Recognition System Based on Corner Point Detection,” in 2015 IEEE International WIE Conference on Electrical and Computer Engineering (WIECON ECE) (Dhaka, Bangladesh: IEEE, 2015), 499–502.
10.1109/WIECON-ECE.2015.7443978
Google Scholar
11A. Gallagher, F. Wallois, and H. Obrig, “Functional Near-Infrared Spectroscopy in Pediatric Clinical Research: Different Pathophysiologies and Promising Clinical Applications,” Neurophotonics 10, no. 2 (2023): 23517.
10.1117/1.NPh.10.2.023517
CAS PubMed Web of Science® Google Scholar
12F. Scholkmann, S. Kleiser, A. J. Metz, et al., “A Review on Continuous Wave Functional Near-Infrared Spectroscopy and Imaging Instrumentation and Methodology,” NeuroImage 85 (2014): 6–27.
10.1016/j.neuroimage.2013.05.004
PubMed Web of Science® Google Scholar
13J. Zhong, L. Guangzhi Ma, Q. W. Zhang, S. Qiao, H. Peng, and H. Bin, “Spatio-Temporal Scale Information Fusion of Functional Near-Infrared Spectroscopy Signal for Depression Detection,” Knowledge-Based Systems 283 (2024): 111165.
10.1016/j.knosys.2023.111165
Web of Science® Google Scholar
14X. Cui, S. Bray, and A. L. Reiss, “Functional Near Infrared Spectroscopy (Nirs) Signal Improvement Based on Negative Correlation Between Oxygenated and Deoxygenated Hemoglobin Dynamics,” NeuroImage 49, no. 4 (2010): 3039–3046.
10.1016/j.neuroimage.2009.11.050
CAS PubMed Web of Science® Google Scholar
15P. Fossati, “Circuit Based Anti-Correlation, Attention Orienting, and Major Depression,” CNS Spectrums 24, no. 1 (2019): 94–101.
10.1017/S1092852918001402
PubMed Web of Science® Google Scholar
16R. H. Kaiser, J. R. Andrews-Hanna, T. D. Wager, and D. A. Pizzagalli, “Large-Scale Network Dysfunction in Major Depressive Disorder: A Meta-Analysis of Resting-State Functional Connectivity,” JAMA Psychiatry 72, no. 6 (2015): 603–611.
10.1001/jamapsychiatry.2015.0071
PubMed Web of Science® Google Scholar
17J. P. Roiser and B. J. Sahakian, “Hot and Cold Cognition in Depression,” CNS Spectrums 18, no. 3 (2013): 139–149.
10.1017/S1092852913000072
PubMed Web of Science® Google Scholar
18J. Chao, S. Zheng, W. Hongtong, et al., “Fnirs Evidence for Distinguishing Patients With Major Depression and Healthy Controls,” IEEE Transactions on Neural Systems and Rehabilitation Engineering 29 (2021): 2211–2221.
10.1109/TNSRE.2021.3115266
PubMed Web of Science® Google Scholar
19S.-Y. Dong, J. K. Choi, Y. Park, et al., “Prefrontal Functional Connectivity During the Verbal Fluency Task in Patients With Major Depressive Disorder: A Functional Near-Infrared Spectroscopy Study,” Frontiers in Psychiatry 12 (2021): 659814.
10.3389/fpsyt.2021.659814
PubMed Web of Science® Google Scholar
20P. R. Regonia, M. Takamura, T. Nakano, et al., “Modeling Heterogeneous Brain Dynamics of Depression and Melancholia Using Energy Landscape Analysis. Frontiers,” Psychiatry 12 (2021): 780997.
Google Scholar
21T. Watanabe, S. Hirose, H. Wada, et al., “A Pairwise Maximum Entropy Model Accurately Describes Resting-State Human Brain Networks,” Nature Communications 4, no. 1 (2013): 1370.
10.1038/ncomms2388
PubMed Google Scholar
22T. Watanabe, S. Kan, T. Koike, et al., “Network-Dependent Modulation of Brain Activity During Sleep,” NeuroImage 98 (2014): 1–10.
10.1016/j.neuroimage.2014.04.079
PubMed Web of Science® Google Scholar
23T. Watanabe and G. Rees, “Brain Network Dynamics in High-Functioning Individuals With Autism,” Nature Communications 8, no. 1 (2017): 16048.
10.1038/ncomms16048
CAS PubMed Google Scholar
24T. Ezaki, M. Sakaki, T. Watanabe, and N. Masuda, “Age-Related Changes in the Ease of Dynamical Transitions in Human Brain Activity,” Human Brain Mapping 39, no. 6 (2018): 2673–2688.
10.1002/hbm.24033
PubMed Web of Science® Google Scholar
25J. Kang, C. Pae, and H.-J. Park, “Graph-Theoretical Analysis for Energy Landscape Reveals the Organization of State Transitions in the Resting-State Human Cerebral Cortex,” PLoS One 14, no. 9 (2019): e0222161.
10.1371/journal.pone.0222161
CAS PubMed Web of Science® Google Scholar
26T. Ezaki, T. Watanabe, M. Ohzeki, and N. Masuda, “Energy Landscape Analysis of Neuroimaging Data,” Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 375, no. 2096 (2017): 20160287.
10.1098/rsta.2016.0287
PubMed Web of Science® Google Scholar
27A. Ashourvan, S. Gu, M. G. Mattar, J. M. Vettel, and D. S. Bassett, “The Energy Landscape Underpinning Module Dynamics in the Human Brain Connectome,” NeuroImage 157 (2017): 364–380.
10.1016/j.neuroimage.2017.05.067
CAS PubMed Web of Science® Google Scholar
28R. L. Buckner, F. M. Krienen, and B. T. Thomas Yeo, “Opportunities and Limitations of Intrinsic Functional Connectivity Mri,” Nature Neuroscience 16, no. 7 (2013): 832–837.
10.1038/nn.3423
PubMed Web of Science® Google Scholar
29E. S. Finn, D. Scheinost, D. M. Finn, X. Shen, X. Papademetris, and R. T. Constable, “Can Brain State Be Manipulated to Emphasize Individual Differences in Functional Connectivity?,” NeuroImage 160 (2017): 140–151.
10.1016/j.neuroimage.2017.03.064
PubMed Web of Science® Google Scholar
30L. Geerligs, M. Rubinov, R. N. Henson, et al., “State and Trait Components of Functional Connectivity: Individual Differences Vary With Mental State,” Journal of Neuroscience 35, no. 41 (2015): 13949–13961.
10.1523/JNEUROSCI.1324-15.2015
CAS PubMed Web of Science® Google Scholar
31G. Prete, P. Croce, F. Zappasodi, L. Tommasi, and P. Capotosto, “Exploring Brain Activity for Positive and Negative Emotions by Means of Eeg Microstates,” Scientific Reports 12, no. 1 (2022): 3404.
10.1038/s41598-022-07403-0
PubMed Web of Science® Google Scholar
32T. Matsubara, K. Matsuo, M. Nakashima, et al., “Prefrontal Activation in Response to Emotional Words in Pa Tients With Bipolar Disorder and Major Depressive Disorder,” NeuroImage 85 (2014): 489–497.
10.1016/j.neuroimage.2013.04.098
PubMed Web of Science® Google Scholar
33Y. Nishizawa, T. Kanazawa, Y. Kawabata, et al., “Fnirs Assessment During an Emotional Stroop Task Among Patients With Depression: Replication and Extension,” Psychiatry Investigation 16, no. 1 (2019): 80–86.
10.30773/pi.2018.11.12.2
PubMed Web of Science® Google Scholar
34J. L. Stewart, J. A. Coan, D. N. Towers, and J. J. B. Allen, “Frontal Eeg Asymmetry During Emotional Challenge Differentiates Individuals With and Without Lifetime Major Depressive Disorder,” Journal of Affective Disorders 129, no. 1–3 (2011): 167–174.
10.1016/j.jad.2010.08.029
PubMed Web of Science® Google Scholar
35A. S. Greene, S. Gao, D. Scheinost, and R. T. Constable, “Task-Induced Brain State Manipulation Improves Prediction of Individual Traits,” Nature Communications 9, no. 1 (2018): 2807.
10.1038/s41467-018-04920-3
PubMed Web of Science® Google Scholar
36D. Krzemiński, N. Masuda, K. Hamandi, K. D. Singh, B. Routley, and J. Zhang, “Energy Landscape of Resting Magnetoencephalography Reveals Fronto-Parietal Network Impairments in Epilepsy. Network,” Neuroscience 4, no. 2 (2020): 374–396.
Google Scholar
37D. Klepl, F. He, W. Min, M. De Marco, D. J. Blackburn, and P. G. Sarrigiannis, “Characterising Alzheimer's Disease With Eeg-Based Energy Landscape Analysis,” IEEE Journal of Biomedical and Health Informatics 26, no. 3 (2021): 992–1000.
10.1109/JBHI.2021.3105397
Google Scholar
38H. A. L. Kiers, “Towards a Standardized Notation and Terminology in Multiway Analysis,” Journal of Chemometrics: A Journal of the Chemometrics Society 14, no. 3 (2000): 105–122.
10.1002/1099-128X(200005/06)14:3<105::AID-CEM582>3.0.CO;2-I
CAS Web of Science® Google Scholar
39Z. Huang and Q. Wei, “Tensor Decomposition-Based Channel Selection for Motor Imagery-Based Brain-Computer Interfaces,” Cognitive Neurodynamics 18, no.3 (2024): 877–892.
10.1007/s11571-023-09940-4
PubMed Web of Science® Google Scholar
40Y. Lecrubier, D. V. Sheehan, E. Weiller, et al., “The Mini International Neuropsychiatric Interview (Mini). A Short Diagnostic Structured Interview: Reliability and Validity According to the Cidi,” European Psychiatry 12, no. 5 (1997): 224–231.
10.1016/S0924-9338(97)83296-8
Web of Science® Google Scholar
41M. F. Folstein, S. E. Folstein, and P. R. McHugh, “‘Mini-Mental State’: A Practical Method for Grading the Cognitive State of Patients for the Clinician,” Journal of Psychiatric Research 12, no. 3 (1975): 189–198.
10.1016/0022-3956(75)90026-6
CAS PubMed Web of Science® Google Scholar
42K. Kroenke, R. L. Spitzer, and J. B. W. Williams, “The Phq-9: Validity of a Brief Depression Severity Measure,” Journal of General Internal Medicine 16, no. 9 (2001): 606–613.
10.1046/j.1525-1497.2001.016009606.x
CAS PubMed Web of Science® Google Scholar
43M. Hamilton, “A Rating Scale for Depression,” Journal of Neurology, Neurosurgery, and Psychiatry 23, no. 1 (1960): 56–62.
10.1136/jnnp.23.1.56
CAS PubMed Web of Science® Google Scholar
44T. M. Rutkowski, T. Tanaka, A. Cichocki, D. Erickson, J. Cao, and D. P. Mandic, “Interactive Component Extraction From Feeg, Fnirs and Peripheral Biosignals for Affective Brain–Machine Interfacing Paradigms,” Computers in Human Behavior 27, no. 5 (2011): 1512–1518.
10.1016/j.chb.2010.10.016
Web of Science® Google Scholar
45J. Zhong, Z. Shan, X. Zhang, L. Haifeng, H. Peng, and H. Bin, “Robust Discriminant Feature Extraction for Automatic Depression Recognition,” Biomedical Signal Processing and Control 82 (2023): 104505.
10.1016/j.bspc.2022.104505
Web of Science® Google Scholar
46E. T. Rolls, M. Joliot, and N. Tzourio-Mazoyer, “Implementation of a New Parcellation of the Orbitofrontal Cortex in the Automated Anatomical Labeling Atlas,” NeuroImage 122 (2015): 1–5.
10.1016/j.neuroimage.2015.07.075
PubMed Web of Science® Google Scholar
47D. T. Delpy and M. Cope, “Quantification in Tissue Near–Infrared Spectroscopy,” Philosophical Transactions of the Royal Society of London. Series B: Biological Sciences 352, no. 1354 (1997): 649–659.
10.1098/rstb.1997.0046
CAS Web of Science® Google Scholar
48F. Klein and C. Kranczioch, “Signal Processing in Fnirs: A Case for the Removal of Systemic Activity for Single Trial Data,” Frontiers in Human Neuroscience 13 (2019): 331.
10.3389/fnhum.2019.00331
PubMed Web of Science® Google Scholar
49J. Zhong, D. Wenyan, L. Zhang, H. Peng, and H. Bin, “Feature Extraction Based on Sparse Graphs Embedding for Automatic Depression Detection,” Biomedical Signal Processing and Control 86 (2023): 105257.
10.1016/j.bspc.2023.105257
Web of Science® Google Scholar
50T. J. Huppert, S. G. Diamond, M. A. Franceschini, and D. A. Boas, “Homer: A Review of Time-Series Analysis Methods for Near-Infrared Spectroscopy of the Brain,” Applied Optics 48, no. 10 (2009): D280–D298.
10.1364/AO.48.00D280
PubMed Web of Science® Google Scholar
51M. A. Yücel, J. Selb, C. M. Aasted, et al., “Mayer Waves Reduce the Accuracy of Estimated Hemodynamic Response Functions in Functional Near-Infrared Spectroscopy,” Biomedical Optics Express 7, no. 8 (2016): 3078–3088.
10.1364/BOE.7.003078
CAS PubMed Web of Science® Google Scholar
52T. Watanabe, N. Masuda, F. Megumi, R. Kanai, and G. Rees, “Energy Landscape and Dynamics of Brain Activity During Human Bistable Perception,” Nature Communications 5, no. 1 (2014): 4765.
10.1038/ncomms5765
CAS PubMed Google Scholar
53N. Benjamin Erichson, K. Manohar, S. L. Brunton, and J. Nathan Kutz, “Randomized Cp Tensor Decomposition. Machine Learning,” Science and Technology 1, no. 2 (2020): 025012.
Google Scholar
54T. G. Kolda and B. W. Bader, “Tensor Decompositions and Ap Plications,” SIAM Review 51, no. 3 (2009): 455–500.
10.1137/07070111X
Google Scholar
55P. Royston, “Approximating the Shapiro-Wilk w-Test for Non Normality,” Statistics and Computing 2 (1992): 117–119.
10.1007/BF01891203
Google Scholar
56W. W. Daniel, “Kruskal–Wallis One-Way Analysis of Variance by Ranks,” Applied Nonparametric Statistics (1990): 226–234.
Google Scholar
57Z. Zhang, P. Huang, S. Li, et al., “Neural Mechanisms Underlying the Processing of Emotional Stimuli in Individuals With Depression: An Ale Meta Analysis Study,” Psychiatry Research 313 (2022): 114598.
10.1016/j.psychres.2022.114598
PubMed Web of Science® Google Scholar
58C. G. Beevers, P. C. Clasen, P. M. Enock, and D. M. Schnyer, “Attention Bias Modification for Major Depressive Disorder: Effects on Attention Bias, Resting State Connectivity, and Symptom Change,” Journal of Abnormal Psychology 124, no. 3 (2015): 463.
10.1037/abn0000049
PubMed Web of Science® Google Scholar
59W. C. Drevets, “Neuroimaging studies of mood disorders,” Biological Psychiatry 48, no. 8 (2000): 813–829.
10.1016/S0006-3223(00)01020-9
CAS PubMed Web of Science® Google Scholar
60S. Koseki, T. Noda, S. Yokoyama, et al., “The Relationship Between Positive and Negative Automatic Thought and Activity in the Prefrontal and Temporal Cortices: A Multi-Channel Near-Infrared Spectroscopy (Nirs) Study,” Journal of Affective Disorders 151, no. 1 (2013): 352–359.
10.1016/j.jad.2013.05.067
PubMed Web of Science® Google Scholar
61R. B. Price and R. Duman, “Neuroplasticity in Cognitive and Psychological Mechanisms of Depression: An Integrative Model,” Molecular Psychiatry 25, no. 3 (2020): 530–543.
10.1038/s41380-019-0615-x
PubMed Web of Science® Google Scholar
62D. Shamai-Leshem, M. Linetzky, and Y. Bar-Haim, “Attention Biases in Previously Depressed Individuals: A Meta-Analysis and Implications for Depression Recurrence,” Cognitive Therapy and Research 46, no. 6 (2022): 1033–1048.
10.1007/s10608-022-10331-y
Web of Science® Google Scholar
63A. Stuhrmann, T. Suslow, and U. Dannlowski, “Facial Emotion Processing in Major Depression: A Systematic Review of Neuroimaging Findings,” Biology of Mood & Anxiety Disorders 1, no. 1 (2011): 1–17.
10.1186/2045-5380-1-10
PubMed Google Scholar
64P. B. Fitzgerald, A. R. Laird, J. Maller, and Z. J. Daskalakis, “A Meta-Analytic Study of Changes in Brain Activation in Depression,” Human Brain Mapping 29, no. 6 (2008): 683–695.
10.1002/hbm.20426
PubMed Web of Science® Google Scholar
65C. Belzung, P. Willner, and P. Philippot, “Depression: From Psychopathology to Pathophysiology,” Current Opinion in Neurobiology 30 (2015): 24–30.
10.1016/j.conb.2014.08.013
CAS PubMed Web of Science® Google Scholar
66R. De Raedt and E. H. W. Koster, “Understanding Vulnerability for Depression From a Cognitive Neuroscience Perspective: A Reappraisal of Attentional Factors and a New Conceptual Framework,” Cognitive, Affective, & Behavioral Neuroscience 10 (2010): 50–70.
10.3758/CABN.10.1.50
PubMed Web of Science® Google Scholar
67M. A. Ferdek, C. M. van Rijn, and M. Wyczesany, “Depressive Rumination and the Emotional Control Circuit: An Eeg Localization and Effective Connectivity Study,” Cognitive, Affective, & Behavioral Neuroscience 16 (2016): 1099–1113.
10.3758/s13415-016-0456-x
PubMed Web of Science® Google Scholar
68T. B. Kashdan and J. Rottenberg, “Psychological Flexibility as a Fundamental Aspect of Health,” Clinical Psychology Review 30, no. 7 (2010): 865–878.
10.1016/j.cpr.2010.03.001
PubMed Web of Science® Google Scholar
69D. Bennabi, P. Vandel, C. Papaxanthis, et al., “Psychomotor Retardation in Depression: A Systematic Review of Diagnostic, Pathophysiologic, and Therapeutic Implications,” BioMed Research International 2013 (2013): 1–18.
10.1155/2013/158746
Web of Science® Google Scholar
70D. C. Steffens and K. Ranga Rama Krishnan, “Structural Neuroimaging and Mood Disorders: Recent Findings, Implications for Classification, and Future Directions,” Biological Psychiatry 43, no. 10 (1998): 705–712.
10.1016/S0006-3223(98)00084-5
CAS PubMed Web of Science® Google Scholar
71S. Naismith, I. Hickie, P. B. Ward, et al., “Caudate Nucleus Volumes and Genetic Determinants of Homocysteine Metabolism in the Prediction of Psychomotor Speed in Older Persons With Depression,” American Journal of Psychiatry 159, no. 12 (2002): 2096–2098.
10.1176/appi.ajp.159.12.2096
PubMed Web of Science® Google Scholar

Citing Literature

Volume30, Issue11

November 2024

e70139

FNIRS-Based Energy Landscape Analysis to Signify Brain Activity Dynamics of Individuals With Depression

ABSTRACT

Background

Method

Results

Conclusion

1 Introduction

2 Paradigm and Data

2.1 Participants

2.2 Paradigm

2.3 Data Acquisition and Preprocessing

3 Method

3.1 Channel Selection

3.2 Pairwise Maximum Entropy Model

3.3 Accuracy Index

3.4 Energy Landscape

3.5 Method Summary

4 Experiment and Results

4.1 Channel Selection

4.2 Accuracy Index of pMEM

4.3 Energy Landscape Analysis Based on Major States

4.4 Energy Landscape Analysis Based on Participants

4.5 Comparison Between pMEM Features and Energy Features

5 Discussion

5.1 State-Level Analysis and Depressive Symptoms

5.2 Individual-Level Analysis and Depressive Symptoms

5.3 Limitations and Challenges

6 Conclusion

Author Contributions

Acknowledgments

Conflicts of Interest

Open Research

Data Availability Statement

Supporting Information

References

Citing Literature

Figures

References

Related

Information