Volume 44, Issue 13-14 e70145
RESEARCH ARTICLE

Statistical Inference for Tensor-Based Covariates in Cox Regression Modeling in Integrated Genome Studies

Chin-Chun Chen

Chin-Chun Chen

Department of Statistics, National Cheng Kung University, Tainan, Taiwan

Search for more papers by this author
Sheng-Mao Chang

Sheng-Mao Chang

Department of Statistics, National Taipei University, Taipei, Taiwan

Search for more papers by this author
Peng-Chan Lin

Peng-Chan Lin

Department of Oncology, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan, Taiwan

Search for more papers by this author
Pei-Fang Su

Corresponding Author

Pei-Fang Su

Department of Statistics, National Cheng Kung University, Tainan, Taiwan

Correspondence: Pei-Fang Su ([email protected])

Search for more papers by this author
First published: 05 June 2025
Funding: This work was supported by the National Science and Technology Council (Grant No. 111-2628-M-006-002-MY3).

ABSTRACT

In the form of multidimensional arrays, modeling considering tensor covariates is capable of capturing important feature interactions or encompassing a broader range of data structures. In the study, we proposed a statistical inference based on tensor-based covariates for the Cox regression model in the integrated genome study. The proposed method stands out for reducing the parameter dimension within the partial likelihood function through the application of rank-R decomposition. We introduced the method of estimation and rank selection. Subsequently, variable-specific tensor tests are proposed to evaluate the significance of the covariance effect. Asymptotic normality is obtained for all the regression estimators. Simulation studies, including rank selection, estimation performance, and testing performance, demonstrate the effectiveness of the proposed algorithm. Overall, the proposed method provides readers with the necessary information to understand the genome effect of tensor-based covariates. To benefit readers, the R code for implementing the method is provided. Lastly, we introduce the proposed method using a colorectal cancer dataset, addressing signal detection by simultaneously gathering information across different kinds of omics platforms. This dataset integrates clinical information with multi-omics data, including copy number variation, methylation, and mRNA sequencing data, as tensor-based covariates, and evaluates relationships involving right-censored data.

Conflicts of Interest

The authors declare no conflicts of interest.

Data Availability Statement

Data sharing is not applicable to this article as no new data were created or analyzed in this study.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.