Optimisation Models for Pathway Activity Inference in Cancer
被引:0
|
作者:
Chen, Yongnan
论文数: 0引用数: 0
h-index: 0
机构:
Kings Coll London, Fac Nat Math & Engn Sci, Dept Informat, London WC2B 4BG, EnglandKings Coll London, Fac Nat Math & Engn Sci, Dept Informat, London WC2B 4BG, England
Chen, Yongnan
[1
]
Liu, Songsong
论文数: 0引用数: 0
h-index: 0
机构:
Harbin Inst Technol, Sch Management, Harbin 150001, Peoples R ChinaKings Coll London, Fac Nat Math & Engn Sci, Dept Informat, London WC2B 4BG, England
Liu, Songsong
[2
]
Papageorgiou, Lazaros G.
论文数: 0引用数: 0
h-index: 0
机构:
UCL, Sargent Ctr Proc Syst Engn, Dept Chem Engn, Torrington Pl, London WC1E 7JE, EnglandKings Coll London, Fac Nat Math & Engn Sci, Dept Informat, London WC2B 4BG, England
Papageorgiou, Lazaros G.
[3
]
Theofilatos, Konstantinos
论文数: 0引用数: 0
h-index: 0
机构:
Kings Coll London, British Heart Fdn Ctr, Sch Cardiovasc & Metab Med & Sci, London SE1 7EH, EnglandKings Coll London, Fac Nat Math & Engn Sci, Dept Informat, London WC2B 4BG, England
Theofilatos, Konstantinos
[4
]
Tsoka, Sophia
论文数: 0引用数: 0
h-index: 0
机构:
Kings Coll London, Fac Nat Math & Engn Sci, Dept Informat, London WC2B 4BG, EnglandKings Coll London, Fac Nat Math & Engn Sci, Dept Informat, London WC2B 4BG, England
Tsoka, Sophia
[1
]
机构:
[1] Kings Coll London, Fac Nat Math & Engn Sci, Dept Informat, London WC2B 4BG, England
[2] Harbin Inst Technol, Sch Management, Harbin 150001, Peoples R China
[3] UCL, Sargent Ctr Proc Syst Engn, Dept Chem Engn, Torrington Pl, London WC1E 7JE, England
[4] Kings Coll London, British Heart Fdn Ctr, Sch Cardiovasc & Metab Med & Sci, London SE1 7EH, England
Simple Summary Subtype classification and prognostic prediction are key research targets in complex diseases such as cancer. In this work, an optimisation model was designed to infer the activity of biological pathways from gene expression values. The optimisation model enables the pathway activity values to separate the sample subtypes to the greatest extent, thereby improving sample classification accuracy. The proposed model was evaluated on cancer molecular subtype classification, robustness to noisy data and survival prediction, and allowed the identification of disease-important genes and pathways. Background: With advances in high-throughput technologies, there has been an enormous increase in data related to profiling the activity of molecules in disease. While such data provide more comprehensive information on cellular actions, their large volume and complexity pose difficulty in accurate classification of disease phenotypes. Therefore, novel modelling methods that can improve accuracy while offering interpretable means of analysis are required. Biological pathways can be used to incorporate a priori knowledge of biological interactions to decrease data dimensionality and increase the biological interpretability of machine learning models. Methodology: A mathematical optimisation model is proposed for pathway activity inference towards precise disease phenotype prediction and is applied to RNA-Seq datasets. The model is based on mixed-integer linear programming (MILP) mathematical optimisation principles and infers pathway activity as the linear combination of pathway member gene expression, multiplying expression values with model-determined gene weights that are optimised to maximise discrimination of phenotype classes and minimise incorrect sample allocation. Results: The model is evaluated on the transcriptome of breast and colorectal cancer, and exhibits solution results of good optimality as well as good prediction performance on related cancer subtypes. Two baseline pathway activity inference methods and three advanced methods are used for comparison. Sample prediction accuracy, robustness against noise expression data, and survival analysis suggest competitive prediction performance of our model while providing interpretability and insight on key pathways and genes. Overall, our work demonstrates that the flexible nature of mathematical programming lends itself well to developing efficient computational strategies for pathway activity inference and disease subtype prediction.
机构:
Queen Mary Univ London, Barts Canc Inst, Ctr Genom & Computat Biol, London EC1M 6BQ, EnglandQueen Mary Univ London, Barts Canc Inst, Ctr Genom & Computat Biol, London EC1M 6BQ, England
机构:
Ajou Univ, Dept Comp Engn, Suwon 16499, South KoreaAjou Univ, Dept Comp Engn, Suwon 16499, South Korea
Kim, Tae Rim
Jeong, Hyun-Hwan
论文数: 0引用数: 0
h-index: 0
机构:
Baylor Coll Med, Dept Mol & Human Genet, Houston, TX 77030 USA
Texas Childrens Hosp, Jan & Dan Duncan Neurol Res Inst, Houston, TX 77030 USAAjou Univ, Dept Comp Engn, Suwon 16499, South Korea
Jeong, Hyun-Hwan
Sohn, Kyung-Ah
论文数: 0引用数: 0
h-index: 0
机构:
Ajou Univ, Dept Comp Engn, Suwon 16499, South KoreaAjou Univ, Dept Comp Engn, Suwon 16499, South Korea