Integrating scRNA-seq and scATAC-seq with inter-type attention heterogeneous graph neural networks

被引:0
|
作者
Cai, Lingsheng [1 ]
Ma, Xiuli [1 ]
Ma, Jianzhu [2 ,3 ]
机构
[1] Peking Univ, Sch Intelligence Sci & Technol, State Key Lab Gen Artificial Intelligence, Beijing 100871, Peoples R China
[2] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
[3] Tsinghua Univ, Inst AI Ind Res, Beijing 100084, Peoples R China
关键词
single-cell multi-omics integration; heterogeneous graphs; deep learning; modality prediction; RNA;
D O I
10.1093/bib/bbae711
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Single-cell multi-omics techniques, which enable the simultaneous measurement of multiple modalities such as RNA gene expression and Assay for Transposase-Accessible Chromatin (ATAC) within individual cells, have become a powerful tool for deciphering the intricate complexity of cellular systems. Most current methods rely on motif databases to establish cross-modality relationships between genes from RNA-seq data and peaks from ATAC-seq data. However, these approaches are constrained by incomplete database coverage, particularly for novel or poorly characterized relationships. To address these limitations, we introduce single-cell Multi-omics Integration (scMI), a heterogeneous graph embedding method that encodes both cells and modality features from single-cell RNA-seq and ATAC-seq data into a shared latent space by learning cross-modality relationships. By modeling cells and modality features as distinct node types, we design an inter-type attention mechanism to effectively capture long-range cross-modality interactions between genes and peaks. Benchmark results demonstrate that embeddings learned by scMI preserve more biological information and achieve comparable or superior performance in downstream tasks including modality prediction, cell clustering, and gene regulatory network inference compared to methods that rely on databases. Furthermore, scMI significantly improves the alignment and integration of unmatched multi-omics data, enabling more accurate embedding and improved outcomes in downstream tasks.
引用
收藏
页数:12
相关论文
共 42 条
  • [41] Late Breaking Results: Attention in Graph2Seq Neural Networks towards Push-Button Analog IC Placement
    Gusmao, Antonio
    Horta, Nuno
    Lourenco, Nuno
    Martins, Ricardo
    2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 1360 - 1361
  • [42] Cross-species cell-type assignment from single-cell RNA-seq data by a heterogeneous graph neural network
    Liu, Xingyan
    Shen, Qunlun
    Zhang, Shihua
    GENOME RESEARCH, 2023, 33 (01) : 96 - 111