Multiresolution categorical regression for interpretable cell-type annotation

被引:0
|
作者
Molstad, Aaron J. [1 ]
Motwani, Keshav [2 ]
机构
[1] Univ Minnesota, Sch Stat, Minneapolis, MN 55455 USA
[2] Univ Washington, Dept Biostat, Seattle, WA USA
基金
美国国家科学基金会;
关键词
categorical response regression; cell-type annotation; convex optimization multinomial logistic regression; multiresolution learning; single-cell RNA-seq;
D O I
10.1111/biom.13926
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In many categorical response regression applications, the response categories admit a multiresolution structure. That is, subsets of the response categories may naturally be combined into coarser response categories. In such applications, practitioners are often interested in estimating the resolution at which a predictor affects the response category probabilities. In this paper, we propose a method for fitting the multinomial logistic regression model in high dimensions that addresses this problem in a unified and data-driven way. Our method allows practitioners to identify which predictors distinguish between coarse categories but not fine categories, which predictors distinguish between fine categories, and which predictors are irrelevant. For model fitting, we propose a scalable algorithm that can be applied when the coarse categories are defined by either overlapping or nonoverlapping sets of fine categories. Statistical properties of our method reveal that it can take advantage of this multiresolution structure in a way existing estimators cannot. We use our method to model cell-type probabilities as a function of a cell's gene expression profile (i.e., cell-type annotation). Our fitted model provides novel biological insights which may be useful for future automated and manual cell-type annotation methodology.
引用
收藏
页码:3485 / 3496
页数:12
相关论文
共 50 条
  • [1] BINNED MULTINOMIAL LOGISTIC REGRESSION FOR INTEGRATIVE CELL-TYPE ANNOTATION
    Motwani, Keshav
    Bacher, Rhonda
    Molstad, Aaron j.
    ANNALS OF APPLIED STATISTICS, 2023, 17 (04): : 3426 - 3449
  • [2] Cell-type annotation with accurate unseen cell-type identification using multiple references
    Xiong, Yi-Xuan
    Wang, Meng-Guo
    Chen, Luonan
    Zhang, Xiao-Fei
    PLOS COMPUTATIONAL BIOLOGY, 2023, 19 (06)
  • [3] Transformer for one stop interpretable cell type annotation
    Chen, Jiawei
    Xu, Hao
    Tao, Wanyu
    Chen, Zhaoxiong
    Zhao, Yuxuan
    Han, Jing-Dong J.
    NATURE COMMUNICATIONS, 2023, 14 (01)
  • [4] Transformer for one stop interpretable cell type annotation
    Jiawei Chen
    Hao Xu
    Wanyu Tao
    Zhaoxiong Chen
    Yuxuan Zhao
    Jing-Dong J. Han
    Nature Communications, 14
  • [5] scATAcat: cell-type annotation for scATAC-seq data
    Altay, Aybuge
    Vingron, Martin
    NAR GENOMICS AND BIOINFORMATICS, 2024, 6 (04)
  • [6] Hierarchical and automated cell-type annotation and inference of cancer cell of origin with Census
    Ghaddar, Bassel
    De, Subhajyoti
    BIOINFORMATICS, 2023, 39 (12)
  • [7] Methods for cell-type annotation on scRNA-seq data: A recent overview
    Lazaros, Konstantinos
    Vlamos, Panagiotis
    Vrahatis, Aristidis G.
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2023,
  • [8] A biology-driven deep generative model for cell-type annotation in cytometry
    Blampey, Quentin
    Bercovici, Nadege
    Dutertre, Charles-Antoine
    Pic, Isabelle
    Ribeiro, Joana Mourato
    Andre, Fabrice
    Cournede, Paul-Henry
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (05)
  • [9] Interpretable Hierarchical Bayesian Modeling of Cell-Type Distributions in COVID-19 Disease
    Parsons, Sarah
    Whitener, Nathan P.
    Bhandari, Sapan
    Khuri, Natalia
    2022 56TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2022, : 7 - 12
  • [10] Cell-type specific pallial circuits shape categorical tuning responses in the crow telencephalon
    Ditz, Helen M.
    Fechner, Julia
    Nieder, Andreas
    COMMUNICATIONS BIOLOGY, 2022, 5 (01)