Parallel group ICA+ICA: Joint estimation of linked functional network variability and structural covariation with application to schizophrenia
Funding information Beijing Municipal Science and Technology Commission, Grant/Award Number: Z181100001518005; Natural Science Foundation of China, Grant/Award Numbers: 61773380, 81471367; NIH, Grant/Award Numbers: 1R01MH094524, P20GM103472, P30GM122734, R01EB005846, R56MH117107; National Science Foundation (NSF), Grant/Award Number: 1539067; Strategic Priority Research Program of the Chinese Academy of Sciences, Grant/Award Number: XDB03040100
Abstract
There is growing evidence that rather than using a single brain imaging modality to study its association with physiological or symptomatic features, the field is paying more attention to fusion of multimodal information. However, most current multimodal fusion approaches that incorporate functional magnetic resonance imaging (fMRI) are restricted to second-level 3D features, rather than the original 4D fMRI data. This trade-off is that the valuable temporal information is not utilized during the fusion step. Here we are motivated to propose a novel approach called “parallel group ICA+ICA” that incorporates temporal fMRI information from group independent component analysis (GICA) into a parallel independent component analysis (ICA) framework, aiming to enable direct fusion of first-level fMRI features with other modalities (e.g., structural MRI), which thus can detect linked functional network variability and structural covariations. Simulation results show that the proposed method yields accurate intermodality linkage detection regardless of whether it is strong or weak. When applied to real data, we identified one pair of significantly associated fMRI-sMRI components that show group difference between schizophrenia and controls in both modalities, and this linkage can be replicated in an independent cohort. Finally, multiple cognitive domain scores can be predicted by the features identified in the linked component pair by our proposed method. We also show these multimodal brain features can predict multiple cognitive scores in an independent cohort. Overall, results demonstrate the ability of parallel GICA+ICA to estimate joint information from 4D and 3D data without discarding much of the available information up front, and the potential for using this approach to identify imaging biomarkers to study brain disorders.
1 INTRODUCTION
Magnetic resonance imaging (MRI) has provided remarkable new insights into the structure and function of human brain. Acquisition of multimodal MRI from the same subject has been widely adopted in brain imaging researches, as different modalities represent different perspectives of the brain functional, structural, or anatomical properties. Moreover, there is growing evidence suggesting that instead of using a single brain imaging modality to study the association with physiologic or pathological properties, researchers are paying more attention to fusion of multimodal information, a method that can take advantage of multiple imaging techniques, and to uncover the latent relationships that might be missed from single modality imaging analysis (Calhoun & Sui, 2016; Sui, Huster, Yu, Segall, & Calhoun, 2014; Zhang, et al., 2011). For instance, multimodal fusion can tell us how brain structure and function are impacted by psychopathology, and which structural or functional aspects of pathology could drive human behavior or cognition (Sui et al., 2014).
However, most current multimodal fusion approaches (including joint independent component analysis (jICA; Calhoun et al., 2006), multi-set canonical correlation analysis (mCCA; Correa, Eichele, Adali, Li, & Calhoun, 2010), multiway partial least squares (N-PLS; Martinez-Montes, Valdes-Sosa, Miwakeichi, Goldman, & Cohen, 2004), parallel ICA (pICA; Liu et al., 2009), parallel ICA with reference (pICAR; Chen et al., 2013), mCCA+jICA (Sui et al., 2011), mCCAR+jICA (Qi et al., 2016; Qi et al., 2018), and linked ICA (Groves, Beckmann, Smith, & Woolrich, 2011), and so on) are restricted to second-level 3D fMRI features, for example, fractional amplitude of low frequency fluctuations (fALFF) or regional homogeneity (ReHo) for fMRI (subjects × imaging feature variables), rather than first-level 4D imaging features (subjects × voxels × time points). The main reason for using second-level features in multimodal fusion is to provide a simpler subspace in which to link the multimodal data (Calhoun & Adali, 2009). While this provides a powerful framework for capturing multimodal information, the trade-off is that some essential information may be lost. For example, in fMRI related multimodal fusion analysis, the temporal dynamic information was not included in the above fusion methods.
It is well known that the temporal variation in fMRI signal conveys important information (Schmithorst & Holland, 2004). Reducing the 4D fMRI data to 3D spatial maps (nontemporal features) prior to the data fusion step does not allow the fusion process to utilize the temporal information. On the other hand, group ICA (GICA; Calhoun, Adali, Pearlson, & Pekar, 2001) is an approach that operates on first-level 4D fMRI data for multiple subjects, which is able to extract both group and subject-specific independent components (ICs) as well as their time courses. Although there are other ICA related method that can deal with fMRI, such as probabilistic ICA (Beckmann & Smith, 2004) and noisy ICA (Cichocki, Douglas, & Amari, 1998; Griffanti et al., 2017), GICA has demonstrated great potential to deal with multi-subject fMRI data, so in this article our proposed method is based on GICA. GICA allows us to establish a correspondence of group ICs with subjects' ICs while fully leveraging the temporal information. In contrast, parallel ICA aims to simultaneously identify ICs of two modalities and the linkage between them by maximizing both intermodality correlation and intramodality independence. Building on the success of GICA to leverage temporal information as well as the flexible fusion framework of parallel ICA, we combine GICA and pICA in order to simultaneously analyze both first-level fMRI and structural MRI (sMRI) features, with the purpose of identifying linked functional spatial network variability and structural covariations, while retaining the original desirable properties of both pICA and GICA. Therefore, we propose to extend the pICA method and link brain structure to first-level functional MRI data via direct optimization of their associations, enabling us to reveal structural influence on coherent functional network variability.
Parallel GICA+ICA works by defining a variability matrix that measures the subject-level variations between group and back-reconstructed subject spatial maps within GICA, and maximizing the correlation of that measure with subject expression profiles from an ICA decomposition of a sMRI dataset. In order to achieve data fusion between structural (subjects × voxel-wise gray matter [GM] volume values) and functional data (subjects × voxels × time points), a subject-level summary feature that captures variability among subjects must be defined for fMRI, so that direct associations can be measured between both modalities. To that end, we note that the group-level ICs from GICA capture spatial co-activation patterns shared among all the subjects, each forming an independent brain network. In this sense, group ICs can be interpreted as a cohort common pattern representing a functional brain network template shared over all subjects. Therefore, it is worthwhile to examine how much the subject-specific ICs deviate from the shared common pattern and whether this deviation may serve as a summarized fMRI feature, which associates with structural data (Chen et al., 2018). Based on this feature, a novel parallel GICA+ICA approach that leverages the first-level temporal information from fMRI can be derived enabling the discovery of associations in the form of linked covariation between functional spatial network patterns and structural features. The key difference from pICA is that an intermediate subject-level variability matrix ( C1) is constructed for parallel GICA+ICA by calculating the L2-distance between group- and subject-specific spatial component maps, capturing subject component variability from a group template. This variability matrix is then utilized to allow multimodal associations to directly influence the GICA and ICA estimation iteratively, hence leveraging the full temporal information of the fMRI as well as spatial variance (Figure 1). Another advantage of parallel GICA+ICA compared with current existing multimodal fusion methods is that it also enables functional network connectivity (FNC) analysis for fMRI after identification of sMRI links from the fusion analysis as well as other post analysis such as spectra and dynamic information (Hutchison et al., 2013) for fMRI.

In psychopathological studies, mounting evidence shows that schizophrenia (SZ) is associated with abnormal FNC between different brain networks, for example, visual, auditory and default mode networks (Friedman, Hastie, & Tibshirani, 2008). Based on the proposed method, we can perform FNC analysis to identify abnormal connections between brain networks that are also associated with structural covariation. In this article, we aim to apply the proposed method to identify linked functional network variability and structural covariations, and ultimately predict cognitive scores based on the identified linked fMRI-sMRI features (GM volume). The Function Biomedical Informatics Research Network (fBIRN) phase III datasets (n = 311; Keator et al., 2016) were used as a discovery cohort and the Center for Biomedical Research Excellence dataset (Jorge Nocedal, 1999; COBRE, n = 177) were used as a replication cohort. Results show that the proposed method can extract linked fMRI-sMRI components pair in both simulation and real human brain data, and this linkage can be replicated in an independent COBRE cohort. Moreover, the identified linked fMRI-sMRI features can predict multiple cognitive scores of fBIRN cohort, which demonstrates the biomarker property of the multimodal features extracted by the proposed method. Furthermore, these identified linked fMRI-sMRI features can also predict multiple cognitive scores of an independent COBRE cohort, demonstrating the ability of parallel GICA+ICA to identify potential biomarkers and the wide utility of the proposed method for the study of brain disorders.
2 METHODS AND MATERIALS
The main idea of parallel GICA+ICA is straightforward. As shown in Figure 1, in order to retain the temporal nature of fMRI, we define a new variability matrix ( C1) to capture functional subject-wise variability by calculating the L2-distance between group-level ICs ( Sgroup, j) and subject-level ICs ( Ssubi, j) ( i = 1, 2, … , N, j = 1, 2, … , M, where N is subject number and M is component number), in correspondence to the subject expression profiles ( A2) from sMRI. In addition to maximizing the independence of each component for each modality, a regularization term is added to simultaneously maximize the correlation between functional ( C1) and structural ( A2) between-subject variabilities, as shown in Figure 1c.
2.1 Parallel group ICA+ICA






















2.2 Simulation


2.3 Real human brain data
For real human brain data, we used subjects collected from the fBIRN (Keator et al., 2016) phase III study, including 149 SZ patients (37.9 ± 11.5) and 162 age-gender matched HCs (37.0 ± 11.0). All participants are adults between 18 and 65 years old. The Structured Clinical Interview for DSM-IV (SCID; First, 2002) was used to diagnose patients by doctors. Furthermore, current or past psychiatric disorder or having a first-degree relative with an Axis-I psychotic illness HCs were excluded. The cognitive performance for both SZ and HC were measured by the Computerized Multiphasic Interactive Neurocognitive System (CMINDS; van Erp et al., 2015). There is no significant age (p = .49) and gender (p = .23) difference between SZ and HC for the fBIRN cohort.
The COBRE cohort consists of 94 SZs (35.6 ± 13.1) and 83 age-gender matched HCs (36.3 ± 12.5) and were assessed using a similar cognitive assessment battery, the Measurement and Treatment Research to Improve Cognition in Schizophrenia Consensus Cognitive Battery (MCCB; Green, Kern, & Heaton, 2004). Detailed cognitive information of fBIRN and COBRE subjects are presented in Supplementary Table SI and SII. There is also no significant age (p = .45) and gender (p = .91) difference for the COBRE cohort. Resting state functional MRI and structural MRI were obtained for both cohorts. Detailed imaging parameters and preprocessing steps can be found in Supplementary “Imaging parameters and preprocessing” section. Written informed consent was obtained from all participants under protocols approved by the Institutional Review Boards for both fBIRN and COBRE cohorts.
3 RESULTS
3.1 Simulation results
The proposed parallel GICA+ICA algorithm was compared with separate GICA and ICA on carefully designed simulated data. One important property is that whether the proposed method can detect the intermodality linkage of the target components accurately and significantly under both strong (r = .49) and weak (r = .28) intermodality associations. Figure 2a,b show the ability for estimating cross modality associations and corresponding significance for different fusion methods under 15 different noise levels with strong intermodality linkage. The green line in Figure 2b,d represents a significance level of p = .05. It is clear that the proposed method, parallel GICA+ICA (red), could detect more accurate intermodality linkage compared with separate GICA and ICA under strong real linkage. The estimation linkage accuracy trend goes down with high noise levels, which is also the same as in significance detection. Figure 2c,d show the boxplot of estimating intermodality associations and corresponding significance for different fusion methods under weak linkage. It is clear that parallel GICA+ICA outperforms separate GICA and ICA for weak linkage detection. More importantly, the association estimation of separate GICA and ICA decreases remarkably when the real correlation is weak (r = .28), when compared with Figure 2a,c, demonstrating the advantage of the proposed method in estimating association with weak linkage, the likely situation in real data. So collectively, the above results show that the proposed method provides more accurate detection of the intermodality associations for both strong and weak links, and importantly, the fusion approach does not appear to inflate the multimodal links artificially.

Besides the linkage estimation, we also compared the source and mixing matrix (or variability matrix) estimation averaged over all components, as shown in Figure 3. The estimation accuracy is defined as the correlation between the true mixing matrices/source(s) and the estimated mixing matrices/component(s). It is clear that parallel GICA+ICA can achieve high estimation accuracy of subject variability within the fMRI modality (Figure 3a), and comparable estimation accuracy for structural mixing matrix and source, and also the subject specific functional source. Finally, the estimation accuracy for different IC numbers to extract modality linkage (under strong linkage, r = .49) between target ICs is presented in Figure 4. It is evident that parallel GICA+ICA out performs separate GICA and ICA in intermodality linkage estimation.


When determining the value of λci and η i in Equations (13–16) that control the weight between the independence constraint and the association constraint, we tune them adaptively. The criteria is that when the maximum value in W i is bigger than the predefined weight maximum (1.0 × 108), then λci(η i) was update as λci(η i) = 0.95 ∙ λci(η i) to avoid the blown up of W i. This means that keeping the independence among components is always the first aim, the second is to maximize the cross modality correlation simultaneously.
3.2 Results of real human brain data
3.2.1 Linked components pair
We also applied the proposed approach on fBIRN multimodal datasets including 311 subjects (162 HCs, 149 SZs). The original preprocessed TCs ×voxels from resting-state fMRI, GM volume from sMRI were extracted and combined by parallel GICA+ICA to identify associated fMRI-sMRI components. The subject-level component was 50, which needs to be higher than the group level (Erhardt et al., 2011), and the group-level component was 20 based on MDL (Li, Adali, & Calhoun, 2007) criterion for fMRI modality. Then two-sample t-tests was performed on the mixing coefficients as well as the variability loadings of each IC for each modality to compare patients and controls.
Among the 20 derived ICs, the 3rd IC of fMRI, the 19th and the 3rd IC of sMRI were found to be the linked components pair (r = .24, p = 2.6e−05* between fMRI_IC3 and sMRI_IC19). * means FDR corrected for multiple comparisons. Note that this association still exist even after regressing out diagnosis (r = .21, p = .0002*). The mixing coefficients (or variability loadings) of the paired components show significant group difference (p = 7.0e−06*, p = .01) for fMRI_IC3 and sMRI_IC19. Figure 5a shows the Z-scored brain maps (visualized at |Z| > 2), Figure 5b presents the group difference of variability loadings for each modality, as well as the correlation between variability loadings and cognitive scores. So the red brain regions in sMRI denote a higher weights in HCs than SZs and the blue brain areas are opposite. While for fMRI component, the red regions indicate a higher variability compared with fMRI_IC3 in HCs than SZs and the blue regions are opposite. More importantly, this linkage can be replicated in independent cohort (COBRE), details can be found in Supplementary “Linkage replication” section.

In addition, we found that fMRI_IC3 also correlated with sMRI_IC3 (r = .14, p = .01; r = .12, p = .051 after regressing out diagnosis). Significant group difference (p = .001*) exist for the variability loadings of sMRI_IC3, which is also positively correlated with speed of processing (r = .349, p = 4.3e−08*), working memory (r = .344, p = 6.1e−08*), visual learning (r = .336, p = 1.5e−07*), verbal learning (r = .337, p = 1.5e−07*), and composite cognitive scores (r = .30, p = 4.1e−06*), as displayed in Figure 6. The well-replicated reduced GM volume in anterior cingulate cortex and insular in schizophrenia are observed in sMRI_IC3. Note that after regressing out group label, the correlation between loadings of sMRI_IC3 and cognitive scores still remain significant. Details are presented in Supplementary “Regression out diagnosis” section.

3.2.2 Modality specific group-discriminative IC
Apart from the linked components pair, we also identified one fMRI component IC16 (Figure 7) containing the left attention network that shows marginal group difference (p = .045), and the corresponding variability loadings is negatively correlated with attention subdomain (r = −.132, p = .045). More importantly, the time courses of fMRI_IC16 were also anti-correlated with attention scores (r = −.187, p = .004).

3.2.3 FNC analysis
One of the advantages of the proposed method compared with other fusion methods is that after the fusion analysis, we can also perform FNC analysis using time courses of the corresponding components. Collectively, 19 components were selected from 20 components, with one artefactual component excluded (Du et al., 2016). Figure 8a,b shows the mean FNC matrix for HCs and SZs. The black lines partition the FNC matrix into the 7 categories: visual (VIS), default-mode network (DMN), auditory (AUD), attention network (ATN), sensorimotor (SM), cerebellar (CB), and sub-cortical (SC). We also compared the group difference of FNC matrix between HCs and SZs. Values in FNC matrix are calculated as −log10(p) × sign(t). Results show that the AUD-VIS, AUD-ATN, DMN-SC are group discriminating FNCs when p was thresholded as p < .01, while the AUD-VIS still remains significant group difference when p was thresholded as p < .0001 (FDR corrected).

3.2.4 Predicting ability

The predictive accuracy is measured by correlation between the estimated cognitive scores and its true values, as well as the normalized root mean squared prediction error (NRMSE; https://en.wikipedia.org/wiki/Root-mean-square_deviation). Multiple cognitive domain prediction results are shown in Figure 9. The five features identified based on the proposed parallel GICA+ICA were able to predict working memory (r = .266, p = 4.5e−05; NRMSE = 0.19), verbal learning (r = .254, p = 8.0e−05; NRMSE = 0.21), and composite cognitive scores (r = .261, p = 6.7e−05; NRMSE = 0.15). More importantly, after regressing out diagnosis, the prediction still keeps significant for verbal learning (r = .157, p = .016) and composite cognitive domains (r = .15, p = .02), demonstrating the effectiveness of the proposed method in identifying linked multimodal biomarkers associated with brain disorders. Note that comparing with the prediction in Figure 9, only using imaging features without the time course information the prediction analysis was lower or not significant (r = .17, p = .008 for working memory; r = .20, p = .001 for verbal learning; r = .11, p = .08 for composite), likely due to the availability of the temporal information throughout the estimation process, again highlighting a benefit of our proposed method.

To test the predictability of the extracted five linked multimodal features by the proposed method on new unseen dataset, we then extracted the brain features in COBRE cohort through fBIRN identified masks and predict an independent cohort (COBRE)‘s multiple MCCB cognitive scores for working memory (r = .235, p = .001; NRMSE = 0.15), verbal learning (r = .25, p = 8.5e−04; NRMSE = 0.17), and composite cognitive scores (r = .256, p = 9.2e−04; NRMSE = 0.18), as displayed in Figure 10. Note that MCCB and CMINDS are similar but not identical cognition assessment batteries (van Erp et al., 2015); hence, the cross-cohort prediction is a strong evidence to demonstrate the predictability of the identified multimodal features detected by parallel GICA+ICA. Furthermore, after regressing out diagnosis, the prediction still keeps significant, details are presented in Supplementary “Regression out diagnosis” section.

4 DISCUSSION AND CONCLUSION
In this article, we proposed a novel temporal information included parallel GICA+ICA, by adding a constraint that maximize the correlation between two variability matrices from fMRI and sMRI, aiming to extract the associated functional network variability with sMRI covariations. Compared with traditional multimodal fusion methods (mCCA, jICA, pICA, pICAR, mCCA+jICA, MCCAR+jICA, linked ICA), parallel GICA+ICA can deal with first-level features (temporal information included) in fMRI related fusion analysis. Another advantage is that we can also perform FNC analysis based on the fMRI results of parallel GICA+ICA. To the best of our knowledge, this is the first established method to incorporate temporal information in multimodal fusion analysis, which provides a new perspective to interpret interrelated patterns between functional network variability and co-varied structural measures.
In simulations, we compared parallel GICA+ICA with separate GICA and ICA on the performance of detecting linked fMRI-sMRI components. Our results indicate that parallel GICA+ICA provides more accurate detection of intermodality linkage regardless of whether it's strong or weak, which means that it does not inflate the link artificially as well as achieves comparable accuracy on both source maps and mixing coefficients. More importantly, the association estimation of separate GICA and ICA decreases remarkably when the real correlation is weak (r = .28) compared with parallel GICA+ICA, as shown in Figure 2c,d, demonstrating the advantages of the proposed method in estimating association with weak linkage, which is always the case in real data.
In the real multimodal brain imaging fusion application, we combined data from gray matter volume and brain function from SZ patients and HCs. One linked component pair (fMRI_IC3-sMRI_IC19) was identified that show significant group difference between SZ and HC. More importantly, this linkage can be replicated in independent cohort. Subcortical regions including bilateral insular, striatum, thalamus, and hippocampus are identified in the linked fMRI_IC3. These regions have been widely reported as dysfunctional brain areas in SZ (Friedman et al., 2008), and also associate with cognitive deficits in SZ (Bush, Valera, & Seidman, 2005; Glahn et al., 2005). SMRI_IC3 is correlated with several cognitive domains, which is consistent with the prefrontal cortex detected in sMRI in our results being involved in multiple high-order cognitive functions including manipulating and maintaining information in problem solving (Minzenberg, Laird, Thelen, Carter, & Glahn, 2009), working memory (Aleman, Hijman, De Haan, & Kahn, 1999; Potkin et al., 2009), and decision making (Barch & Ceaser, 2012). We also identified one modality-specific fMRI component containing the left attention network, whose variability loadings show group difference and also correlated with attention domain. More importantly, the corresponding time courses of fMRI_IC16 are also correlated with attention domain. The identified correspondence between the identified brain areas and the correlation with attention demonstrates well the effectiveness of the proposed method. Apart from the traditional multimodal fusion analysis, we also performed FNC analysis, and identified one abnormal FNC pair between visual and auditory networks compared with SZ and HC, which is widely reported associated with the auditory hallucination impairment with SZ (Friedman et al., 2008; Liu et al., 2018).
Finally, the primary goal of brain imaging studies is to identify biomarkers that can predict individual cognition or symptoms scores (Gupta et al., 2015). Based on the identified multimodal brain features in the linked component pair, multiple cognitive scores can be predicted in fBRIN cohort. These features can also be applied to independent cohort (COBRE) to predict unseen subjects on similar but different cognitive measures. All the above results demonstrate the ability of parallel GICA+ICA in detecting associated multimodal components pair that contains potential biomarkers related with mental disorders, suggesting a wide utility in the brain imaging community (Carter, Heckers, Nichols, Pine, & Strother, 2008; Jiang et al., 2018).
In addition to sMRI, other modalities could also be fused with temporal information included fMRI based on the proposed method, such as fractional anisotropy from dMRI (Caprihan et al., 2011; Sui et al., 2018). Parallel GICA+ICA can be applied straightforwardly to study other brain disorders, such as bipolar disorder, depression (Etkin & Wager, 2007; Qi et al., 2018), and attention-deficit/hyperactivity disorder, suggesting a comprehensive application in brain disorder related imaging studies. Moreover, except for the distance that measuring the subject variability, other measures (such as correlation or covariance between group and subjects) can be directly incorporated into our proposed method as well. Furthermore, apart from static FNC analysis in the current study, dynamic FNC (Calhoun, Miller, Pearlson, & Adali, 2014) can also be calculated based on the parallel GICA+ICA results.
In summary, this study proposed a novel temporal information added multimodal fusion method, that is, parallel GICA+ICA, and verified its effectiveness in both simulation and real human brain imaging data. To the best of our knowledge, this is the first proposed method that can directly link first-level fMRI features with sMRI data, seeking for potential linked multimodal MRI markers in brain disorders. Based on the proposed parallel GICA+ICA, we identified one linked fMRI-sMRI pair that was indicated to be associated with major SZ deficits in multiple reports, which can also be used to predict multiple cognitive scores of fBIRN cohort, as well as an independent COBRE cohort, promising a wide usage of the proposed method in detecting potential linked multimodal biomarkers for psychosis.
ACKNOWLEDGMENTS
This work was supported by the National Institute of Health grants (R56MH117107, R01EB005846, 1R01MH094524, P20GM103472, P30GM122734), the National Science Foundation (1539067), China Natural Science Foundation (61773380), the Strategic Priority Research Program of the Chinese Academy of Sciences (XDB03040100), and Beijing Municipal Science and Technology Commission (Z181100001518005).
CONFLICT OF INTEREST
The authors report no financial interests or potential conflict of interest.
DATA ACCESSIBILITY
The code of parallel GICA+ICA will soon be added in Fusion ICA Toolbox (FIT, http://mialab.mrn.org/software/fit).