Revealing relationships between genes and disease phenotypes is a critical problem in biomedical studies. This problem has been challenged by the heterogeneity of diseases. Patients of a perceived same disease may form multiple subgroups, and different subgroups have distinct sets of important genes. It is hence imperative to discover the latent subgroups and reveal the subgroup-specific important genes. Some heterogeneity analysis methods have been proposed in the recent literature. Despite considerable successes, most of the existing studies are still limited as they cannot accommodate data contamination and ignore the interconnections among genes. Aiming at these shortages, we develop a robust structured heterogeneity analysis approach to identify subgroups, select important genes as well as estimate their effects on the phenotype of interest. Possible data contamination is accommodated by employing the Huber loss function. A sparse overlapping group lasso penalty is imposed to conduct regularization estimation and gene identification, while taking into account the possibly overlapping cluster structure of genes. This approach takes an iterative strategy in the similar spirit of K-means clustering. Simulations demonstrate that the proposed approach outperforms alternatives in revealing the heterogeneity and selecting important genes for each subgroup. The analysis of Cancer Cell Line Encyclopedia data leads to biologically meaningful findings with improved prediction and grouping stability.
机构:
Nankai Univ, Sch Mat Sci, Tianjin, Peoples R China
Kashgar Univ, Sch Math & Stat, Kashgar City, Peoples R ChinaNankai Univ, Sch Mat Sci, Tianjin, Peoples R China
Wang, Tao
Zheng, Lin
论文数: 0引用数: 0
h-index: 0
机构:
Nankai Univ, Sch Mat Sci, Tianjin, Peoples R ChinaNankai Univ, Sch Mat Sci, Tianjin, Peoples R China
Zheng, Lin
Li, Zhonghua
论文数: 0引用数: 0
h-index: 0
机构:
Nankai Univ, Inst Stat, Tianjin 300071, Peoples R China
Nankai Univ, LPMC, Tianjin 300071, Peoples R ChinaNankai Univ, Sch Mat Sci, Tianjin, Peoples R China
Li, Zhonghua
Liu, Haiyang
论文数: 0引用数: 0
h-index: 0
机构:
Air Force Logist Coll, Dept Aviat Mat Management, Xuzhou, Peoples R ChinaNankai Univ, Sch Mat Sci, Tianjin, Peoples R China
机构:
Univ Chinese Acad Sci, Sch Math Sci, Beijing 100049, Peoples R China
Chinese Acad Sci, Key Lab Big Data Min & Knowledge Management, Beijing 100049, Peoples R ChinaUniv Chinese Acad Sci, Sch Math Sci, Beijing 100049, Peoples R China
Ren, Mingyang
Zhang, Sanguo
论文数: 0引用数: 0
h-index: 0
机构:
Univ Chinese Acad Sci, Sch Math Sci, Beijing 100049, Peoples R China
Chinese Acad Sci, Key Lab Big Data Min & Knowledge Management, Beijing 100049, Peoples R ChinaUniv Chinese Acad Sci, Sch Math Sci, Beijing 100049, Peoples R China
Zhang, Sanguo
Zhang, Qingzhao
论文数: 0引用数: 0
h-index: 0
机构:
Xiamen Univ, Dept Stat, Wang Yanan Inst Studies Econ, MOE Key Lab Econ,Sch Econ, Xiamen 361005, Peoples R China
Xiamen Univ, Fujian Key Lab Stat, Xiamen 361005, Peoples R ChinaUniv Chinese Acad Sci, Sch Math Sci, Beijing 100049, Peoples R China
机构:
Univ Int Business & Econ, Sch Stat, Beijing, Peoples R ChinaUniv Int Business & Econ, Sch Stat, Beijing, Peoples R China
Hao, Meiling
Lin, Yuanyuan
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Univ Hong Kong, Dept Stat, Shatin, Hong Kong, Peoples R ChinaUniv Int Business & Econ, Sch Stat, Beijing, Peoples R China
Lin, Yuanyuan
Liu, Xianhui
论文数: 0引用数: 0
h-index: 0
机构:
Jiangxi Univ Finance & Econ, Sch Stat, Nanchang, Jiangxi, Peoples R China
Jiangxi Univ Finance & Econ, Res Ctr Appl Stat, Nanchang, Jiangxi, Peoples R ChinaUniv Int Business & Econ, Sch Stat, Beijing, Peoples R China
Liu, Xianhui
Tang, Wenlu
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Univ Hong Kong, Dept Stat, Shatin, Hong Kong, Peoples R ChinaUniv Int Business & Econ, Sch Stat, Beijing, Peoples R China
机构:
King Abdulaziz Univ, Dept Stat, Abdullah Sulayman St, Mecca 21589, Saudi ArabiaKing Abdulaziz Univ, Dept Stat, Abdullah Sulayman St, Mecca 21589, Saudi Arabia
Fayomi, Aisha
Pantazis, Yannis
论文数: 0引用数: 0
h-index: 0
机构:
Fdn Res & Technol Hellas, Inst Appl & Computat Math, Vassilika 70013, GreeceKing Abdulaziz Univ, Dept Stat, Abdullah Sulayman St, Mecca 21589, Saudi Arabia
机构:
Chinese Acad Sci, Chongqing Key Lab Big Data & Intelligent Comp, Chongqing Inst Green & Intelligent Technol, Chongqing 400714, Peoples R China
Univ Chinese Acad Sci, Chongqing Sch, Chongqing 400714, Peoples R ChinaChinese Acad Sci, Chongqing Key Lab Big Data & Intelligent Comp, Chongqing Inst Green & Intelligent Technol, Chongqing 400714, Peoples R China
Wu, Di
Luo, Xin
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Chongqing Key Lab Big Data & Intelligent Comp, Chongqing Inst Green & Intelligent Technol, Chongqing 400714, Peoples R China
Univ Chinese Acad Sci, Chongqing Sch, Chongqing 400714, Peoples R ChinaChinese Acad Sci, Chongqing Key Lab Big Data & Intelligent Comp, Chongqing Inst Green & Intelligent Technol, Chongqing 400714, Peoples R China
机构:
IEEE
the Chongqing Key Laboratory of Big Data and Intelligent Computing, Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences
the Chongqing School, University of Chinese Academy of SciencesIEEE
Di Wu
Xin Luo
论文数: 0引用数: 0
h-index: 0
机构:
IEEE
the Chongqing Key Laboratory of Big Data and Intelligent Computing, Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences
the Chongqing School, University of Chinese Academy of SciencesIEEE