MiMultiCat: A Unified Cloud Platform for the Analysis of Microbiome Data with Multi-Categorical Responses

被引:1
|
作者
Kim, Jihun [1 ]
Jang, Hyojung [1 ]
Koh, Hyunwook [1 ]
机构
[1] State Univ New York SUNY, Dept Appl Math & Stat, Incheon 21985, South Korea
来源
BIOENGINEERING-BASEL | 2024年 / 11卷 / 01期
基金
新加坡国家研究基金会;
关键词
microbiome data analysis; cloud computing; human microbiome; multi-categorical response; microbiome association testing; microbiome prediction modeling;
D O I
10.3390/bioengineering11010060
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
The field of the human microbiome is rapidly growing due to the recent advances in high-throughput sequencing technologies. Meanwhile, there have also been many new analytic pipelines, methods and/or tools developed for microbiome data preprocessing and analytics. They are usually focused on microbiome data with continuous (e.g., body mass index) or binary responses (e.g., diseased vs. healthy), yet multi-categorical responses that have more than two categories are also common in reality. In this paper, we introduce a new unified cloud platform, named MiMultiCat, for the analysis of microbiome data with multi-categorical responses. The two main distinguishing features of MiMultiCat are as follows: First, MiMultiCat streamlines a long sequence of microbiome data preprocessing and analytic procedures on user-friendly web interfaces; as such, it is easy to use for many people in various disciplines (e.g., biology, medicine, public health). Second, MiMultiCat performs both association testing and prediction modeling extensively. For association testing, MiMultiCat handles both ecological (e.g., alpha and beta diversity) and taxonomical (e.g., phylum, class, order, family, genus, species) contexts through covariate-adjusted or unadjusted analysis. For prediction modeling, MiMultiCat employs the random forest and gradient boosting algorithms that are well suited to microbiome data while providing nice visual interpretations. We demonstrate its use through the reanalysis of gut microbiome data on obesity with body mass index categories. MiMultiCat is freely available on our web server.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Multi-Tenant Big Data Analytics on AWS Cloud Platform
    Khedekar, Vinay
    Tian, Yun
    2020 10TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2020, : 647 - 653
  • [22] Construction and Performance Analysis of Unified Storage Cloud Platform Based on OpenStack with Ceph RBD
    Ding, Weichao
    Gu, Chunhua
    Luo, Fei
    Chang, Yaohui
    2018 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2018, : 135 - 141
  • [23] MANTA, an integrative database and analysis platform that relates microbiome and phenotypic data
    Chen, Yi-An
    Park, Jonguk
    Natsume-Kitatani, Yayoi
    Kawashima, Hitoshi
    Mohsen, Attayeb
    Hosomi, Koji
    Tanisawa, Kumpei
    Ohno, Harumi
    Konishi, Kana
    Murakami, Haruka
    Miyachi, Motohiko
    Kunisawa, Jun
    Mizuguchi, Kenji
    PLOS ONE, 2020, 15 (12):
  • [24] Development of a cloud platform for gathering, storing and analysis of video data
    Stepanenko, Sergei
    Yakimov, Pavel
    2020 VI INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND NANOTECHNOLOGY (IEEE ITNT-2020), 2020,
  • [25] Analysis and Research on the Big Data Security Based on Cloud Platform
    Yang, Bo
    PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON MECHATRONICS, MATERIALS, CHEMISTRY AND COMPUTER ENGINEERING 2015 (ICMMCCE 2015), 2015, 39 : 303 - 307
  • [26] A Cloud Computing Platform for Data Analysis based on R Cluster
    Tong, Yiming
    Zheng, Zeyu
    Fu, Dianzheng
    Fu, Yang
    Li, Shuai
    2016 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY PROCEEDINGS - CYBERC 2016, 2016, : 243 - 248
  • [27] Foetal data analysis system based on cloud computing platform
    Liang, Jifeng
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 125 : 19 - 20
  • [28] Analysis of correlated ordered categorical data with rank measures of association for responses with ties
    Jung, Jin-Whan
    Koch, Gary G.
    Communications in Statistics Part B: Simulation and Computation, 27 (01): : 167 - 183
  • [29] Analysis of correlated ordered categorical data with rank measures of association for responses with ties
    Jung, JW
    Koch, GG
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 1998, 27 (01) : 167 - 183
  • [30] Innovative model for security of multi-cloud platform: data integrity perspective
    Jebakumari, S. Adlin
    Mahajan, Shriya
    Raichura, Harshit
    Nisha, B.
    Reddy, B.
    Ahmed, Zahid
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2024,