SpaCCC: Large Language Model-Based Cell-Cell Communication Inference for Spatially Resolved Transcriptomic Data

被引:0
|
作者
Ji, Boya [1 ]
Wang, Xiaoqi [2 ]
Qiao, Debin [3 ,4 ]
Xu, Liwen [1 ]
Peng, Shaoliang [1 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Peoples R China
[2] Northwestern Polytech Univ, Sch Comp Sci, Xian 710000, Peoples R China
[3] Zhengzhou Univ, Sch Comp & Artificial Intelligence, Zhengzhou 450001, Peoples R China
[4] Zhengzhou Univ, Natl Supercomp Ctr Zhengzhou, Zhengzhou 450001, Peoples R China
来源
BIG DATA MINING AND ANALYTICS | 2024年 / 7卷 / 04期
基金
中国国家自然科学基金;
关键词
Accuracy; Large language models; Transcriptomics; Data visualization; Receivers; Spatial databases; Biology; Reliability; Spatial resolution; Signal resolution; Large Language Models (LLM); spatial transcriptome data; Cell-Cell Communications (CCCs); functional gene interaction networks; unified latent space;
D O I
10.26599/BDMA.2024.9020056
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Drawing parallels between linguistic constructs and cellular biology, Large Language Models (LLMs) have achieved success in diverse downstream applications for single-cell data analysis. However, to date, it still lacks methods to take advantage of LLMs to infer Ligand-Receptor (LR)-mediated cell-cell communications for spatially resolved transcriptomic data. Here, we propose SpaCCC to facilitate the inference of spatially resolved cell-cell communications, which relies on our fine-tuned single-cell LLM and functional gene interaction network to embed ligand and receptor genes into a unified latent space. The LR pairs with a significant closer distance in latent space are taken to be more likely to interact with each other. After that, the molecular diffusion and permutation test strategies are respectively employed to calculate the communication strength and filter out communications with low specificities. The benchmarked performance of SpaCCC is evaluated on real single-cell spatial transcriptomic datasets with superiority over other methods. SpaCCC also infers known LR pairs concealed by existing aggregative methods and then identifies communication patterns for specific cell types and their signaling pathways. Furthermore, SpaCCC provides various cell-cell communication visualization results at both single-cell and cell type resolution. In summary, SpaCCC provides a sophisticated and practical tool allowing researchers to decipher spatially resolved cell-cell communications and related communication patterns and signaling pathways based on spatial transcriptome data. SpaCCC is free and publicly available at https://github.com/jiboyalab/SpaCCC.
引用
收藏
页码:1129 / 1147
页数:19
相关论文
共 50 条
  • [31] CellEnBoost: A Boosting-Based Ligand-Receptor Interaction Identification Model for Cell-to-Cell Communication Inference
    Peng, Lihong
    Yuan, Ruya
    Han, Chendi
    Han, Guosheng
    Tan, Jingwei
    Wang, Zhao
    Chen, Min
    Chen, Xing
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2023, 22 (04) : 705 - 715
  • [32] Model-based assessment of mammalian cell metabolic functionalities using omics data
    Richelle, Anne
    Kellman, Benjamin P.
    Wenzel, Alexander T.
    Chiang, Austin W. T.
    Reagan, Tyler
    Gutierrez, Jahir M.
    Joshi, Chintan
    Li, Shangzhong
    Liu, Joanne K.
    Masson, Helen
    Lee, Jooyong
    Li, Zerong
    Heirendt, Laurent
    Trefois, Christophe
    Juarez, Edwin F.
    Bath, Tyler
    Borland, David
    Mesirov, Jill P.
    Robasky, Kimberly
    Lewis, Nathan E.
    CELL REPORTS METHODS, 2021, 1 (03):
  • [33] Exploring the potential of large language model-based chatbots in challenges of ribosome profiling data analysis: a review
    Ding, Zheyu
    Wei, Rong
    Xia, Jianing
    Mu, Yonghao
    Wang, Jiahuan
    Lin, Yingying
    BRIEFINGS IN BIOINFORMATICS, 2024, 26 (01)
  • [34] Let’s Discover More API Relations: A Large Language Model-Based AI Chain for Unsupervised API Relation Inference
    Huang, Qing
    Sun, Yanbang
    Xing, Zhenchang
    Cao, Yuanlong
    Chen, Jieshan
    Xu, Xiwei
    Jin, Huan
    Lu, Jiaxing
    ACM Transactions on Software Engineering and Methodology, 2024, 33 (08)
  • [35] Transcriptomic model-based lncRNAs and mRNAs serve as independent prognostic indicators in head and neck squamous cell carcinoma
    Zhang, Zhi-Li
    Zhao, Li-Jing
    Xu, Lin
    Chai, Liang
    Wang, Feng
    Xu, Ya-Ping
    Zhou, Shui-Hong
    Fu, Yong
    ONCOLOGY LETTERS, 2019, 17 (06) : 5536 - 5544
  • [36] pLM4CPPs: Protein Language Model-Based Predictor for Cell Penetrating Peptides
    Kumar, Nandan
    Du, Zhenjiao
    Li, Yonghui
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2025, 65 (03) : 1128 - 1139
  • [37] FunRes: resolving tissue-specific functional cell states based on a cell-cell communication network model (vol 22, bbaa283, 2021)
    Jung, Sascha
    Singh, Kartikeya
    del Sol, Antonio
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (05)
  • [38] Pareto task inference analysis reveals cellular trade-offs in diffuse large B-Cell lymphoma transcriptomic data
    Blais, Jonatan
    Jeukens, Julie
    FRONTIERS IN SYSTEMS BIOLOGY, 2024, 4
  • [39] Model-Based Analysis for Qualitative Data: An Application in Drosophila Germline Stem Cell Regulation
    Pargett, Michael
    Rundell, Ann E.
    Buzzard, Gregery T.
    Umulis, David M.
    PLOS COMPUTATIONAL BIOLOGY, 2014, 10 (03):
  • [40] RumorLLM: A Rumor Large Language Model-Based Fake-News-Detection Data-Augmentation Approach
    Lai, Jianqiao
    Yang, Xinran
    Luo, Wenyue
    Zhou, Linjiang
    Li, Langchen
    Wang, Yongqi
    Shi, Xiaochuan
    APPLIED SCIENCES-BASEL, 2024, 14 (08):