Integrating scRNA-seq and scATAC-seq with inter-type attention heterogeneous graph neural networks

被引:0
|
作者
Cai, Lingsheng [1 ]
Ma, Xiuli [1 ]
Ma, Jianzhu [2 ,3 ]
机构
[1] Peking Univ, Sch Intelligence Sci & Technol, State Key Lab Gen Artificial Intelligence, Beijing 100871, Peoples R China
[2] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
[3] Tsinghua Univ, Inst AI Ind Res, Beijing 100084, Peoples R China
关键词
single-cell multi-omics integration; heterogeneous graphs; deep learning; modality prediction; RNA;
D O I
10.1093/bib/bbae711
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Single-cell multi-omics techniques, which enable the simultaneous measurement of multiple modalities such as RNA gene expression and Assay for Transposase-Accessible Chromatin (ATAC) within individual cells, have become a powerful tool for deciphering the intricate complexity of cellular systems. Most current methods rely on motif databases to establish cross-modality relationships between genes from RNA-seq data and peaks from ATAC-seq data. However, these approaches are constrained by incomplete database coverage, particularly for novel or poorly characterized relationships. To address these limitations, we introduce single-cell Multi-omics Integration (scMI), a heterogeneous graph embedding method that encodes both cells and modality features from single-cell RNA-seq and ATAC-seq data into a shared latent space by learning cross-modality relationships. By modeling cells and modality features as distinct node types, we design an inter-type attention mechanism to effectively capture long-range cross-modality interactions between genes and peaks. Benchmark results demonstrate that embeddings learned by scMI preserve more biological information and achieve comparable or superior performance in downstream tasks including modality prediction, cell clustering, and gene regulatory network inference compared to methods that rely on databases. Furthermore, scMI significantly improves the alignment and integration of unmatched multi-omics data, enabling more accurate embedding and improved outcomes in downstream tasks.
引用
收藏
页数:12
相关论文
共 42 条
  • [31] Adaptive Weight Sampling and Graph Transformer Neural Network Framework for Cell Type Annotation of Scrna-seq Data
    Yuan, Lin
    Sun, Shengguo
    Li, Zhujun
    Zhang, Shoukang
    Wang, Xingang
    Geng, Yushui
    ADVANCED INTELLIGENT COMPUTING IN BIOINFORMATICS, PT II, ICIC 2024, 2024, 14882 : 15 - 24
  • [32] Gene Regulatory Network Inference Using Convolutional Neural Networks from scRNA-seq Data
    Mao, Guo
    Pang, Zhengbin
    Zuo, Ke
    Liu, Jie
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2023, 30 (05) : 619 - 631
  • [33] A unified framework for cell-type-specific eQTL prioritization by integrating bulk and scRNA-seq data
    Yu, Xinyi
    Hu, Xianghong
    Wan, Xiaomeng
    Zhang, Zhiyong
    Wan, Xiang
    Cai, Mingxuan
    Yu, Tianwei
    Xiao, Jiashun
    AMERICAN JOURNAL OF HUMAN GENETICS, 2025, 112 (02)
  • [34] scMGCN: A Multi-View Graph Convolutional Network for Cell Type Identification in scRNA-seq Data
    Sun, Hongmin
    Qu, Haowen
    Duan, Kaifu
    Du, Wei
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (04)
  • [35] scZAG: Integrating ZINB-Based Autoencoder with Adaptive Data Augmentation Graph Contrastive Learning for scRNA-seq Clustering
    Zhang, Tianjiao
    Ren, Jixiang
    Li, Liangyu
    Wu, Zhenao
    Zhang, Ziheng
    Dong, Guanghui
    Wang, Guohua
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (11)
  • [36] A New Graph Autoencoder-Based Consensus-Guided Model for scRNA-seq Cell Type Detection
    Zhang, Dai-Jun
    Gao, Ying-Lian
    Zhao, Jing-Xiu
    Zheng, Chun-Hou
    Liu, Jin-Xing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (02) : 2473 - 2483
  • [37] Identification of COVID-19-Associated DNA Methylation Variations by Integrating Methylation Array and scRNA-Seq Data at Cell-Type Resolution
    Wang, Guoliang
    Xiong, Zhuang
    Yang, Fei
    Zheng, Xinchang
    Zong, Wenting
    Li, Rujiao
    Bao, Yiming
    GENES, 2022, 13 (07)
  • [38] How regulatory sequences learn cell representations New computational method uses convolutional neural networks for cis-regulatory sequence analysis to analyze and cluster scATAC-seq data
    Aerts, Stein
    NATURE METHODS, 2022, 19 (09) : 1041 - 1043
  • [39] Integrating spatial transcriptomics and bulk RNA-seq: predicting gene expression with enhanced resolution through graph attention networks
    Baul, Sudipto
    Ahmed, Khandakar Tanvir
    Jiang, Qibing
    Wang, Guangyu
    Li, Qian
    Yong, Jeongsik
    Zhang, Wei
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (04)
  • [40] A Multi-omics approach to identify and validate shared genetic architecture in rheumatoid arthritis, multiple sclerosis, and type 1 diabetes: integrating GWAS, GEO, MSigDB, and scRNA-seq data
    Tailin Wang
    Qian He
    Kei Hang Katie Chan
    Functional & Integrative Genomics, 2025, 25 (1)