Applications of Machine Learning for the Classification of Porcine Reproductive and Respiratory Syndrome Virus Sublineages Using Amino Acid Scores of ORF5 Gene

被引:5
|
作者
Kim, Jeonghoon [1 ]
Lee, Kyuyoung [2 ]
Rupasinghe, Ruwini [2 ]
Rezaei, Shahbaz [3 ]
Martinez-Lopez, Beatriz [2 ]
Liu, Xin [3 ]
机构
[1] Univ Calif Davis, Dept Math, Davis, CA 95616 USA
[2] Univ Calif Davis, Sch Vet Med, Ctr Anim Dis Modeling & Surveillance CADMS, Dept Med & Epidemiol, Davis, CA 95616 USA
[3] Univ Calif Davis, Dept Comp Sci, Davis, CA 95616 USA
基金
美国国家科学基金会;
关键词
artificial intelligence; random forest; k-nearest neighbor; support vector machine; swine health; phylogenetic tree; multilayer perceptron; classification; IDENTIFICATION; DETERMINANTS; PATTERN;
D O I
10.3389/fvets.2021.683134
中图分类号
S85 [动物医学(兽医学)];
学科分类号
0906 ;
摘要
Porcine reproductive and respiratory syndrome is an infectious disease of pigs caused by PRRS virus (PRRSV). A modified live-attenuated vaccine has been widely used to control the spread of PRRSV and the classification of field strains is a key for a successful control and prevention. Restriction fragment length polymorphism targeting the Open reading frame 5 (ORF5) genes is widely used to classify PRRSV strains but showed unstable accuracy. Phylogenetic analysis is a powerful tool for PRRSV classification with consistent accuracy but it demands large computational power as the number of sequences gets increased. Our study aimed to apply four machine learning (ML) algorithms, random forest, k-nearest neighbor, support vector machine and multilayer perceptron, to classify field PRRSV strains into four clades using amino acid scores based on ORF5 gene sequence. Our study used amino acid sequences of ORF5 gene in 1931 field PRRSV strains collected in the US from 2012 to 2020. Phylogenetic analysis was used to labels field PRRSV strains into one of four clades: Lineage 5 or three clades in Linage 1. We measured accuracy and time consumption of classification using four ML approaches by different size of gene sequences. We found that all four ML algorithms classify a large number of field strains in a very short time (<2.5 s) with very high accuracy (>0.99 Area under curve of the Receiver of operating characteristics curve). Furthermore, the random forest approach detects a total of 4 key amino acid positions for the classification of field PRRSV strains into four clades. Our finding will provide an insightful idea to develop a rapid and accurate classification model using genetic information, which also enables us to handle large genome datasets in real time or semi-real time for data-driven decision-making and more timely surveillance.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Novel structural protein in porcine reproductive and respiratory syndrome virus encoded by an alternative ORF5 present in all arteriviruses
    Johnson, Craig R.
    Griggs, Theodor F.
    Gnanandarajah, Josephine
    Murtaugh, Michael P.
    JOURNAL OF GENERAL VIROLOGY, 2011, 92 : 1107 - 1116
  • [22] Positioning Quebec ORF5 sequences of porcine reproductive and respiratory syndrome virus (PRRSV) within Canada and worldwide diversity
    Lambert, Marie-Eve
    Delisle, Benjamin
    Arsenault, Julie
    Poljak, Zvonimir
    D'Allaire, Sylvie
    INFECTION GENETICS AND EVOLUTION, 2019, 74
  • [23] Pathogenicity of porcine reproductive and respiratory syndrome virus (ORF5 RFLP 1-7-4 viruses) in China
    Xie, Chang-Zhan
    Ha, Zhuo
    Zhang, Han
    Zhang, Ying
    Xie, Yu-Biao
    Zhang, He
    Nan, Fu-Long
    Wang, Zheng
    Zhang, Ping
    Xu, Wang
    Han, Ji-Cheng
    Wen, Shu-Bo
    Lu, Hui-Jun
    Jin, Ning-Yi
    TRANSBOUNDARY AND EMERGING DISEASES, 2020, 67 (05) : 2065 - 2072
  • [24] RT-PCR detection of porcine reproductive and respiratory syndrome virus based on the ORF5 gene in mainland China, 2012-2015
    Peng, Z.
    Zhao, T.
    Liang, W.
    Song, W.
    Gao, Z.
    Tang, X.
    Chen, H.
    Wu, B.
    ACTA VIROLOGICA, 2017, 61 (03) : 336 - 340
  • [25] Analysis of ORF5 sequences of Porcine Reproductive and Respiratory Syndrome virus (PRRSV) circulating within swine farms in Costa Rica
    Guzman, Monica
    Melendez, Ronald
    Jimenez, Carlos
    Piche, Marta
    Jimenez, Emily
    Leon, Bernal
    Cordero, Juan M.
    Ramirez-Carvajal, Lisbeth
    Uribe, Alberto
    Van Nes, Arie
    Stegeman, Arjan
    Romero, Juan Jose
    BMC VETERINARY RESEARCH, 2021, 17 (01)
  • [26] Phylogenetic analysis of ORF5 and ORF7 of porcine reproductive and respiratory syndrome (PRRS) virus and the frequency of wild-type PRRS virus in Mexico
    Martinez-Bautista, N. R.
    Sciutto-Conde, E.
    Cervantes-Torres, J.
    Segura-Velazquez, R.
    Mercado Garcia, M. C.
    Ramirez-Mendoza, H.
    Trujillo Ortega, M. E.
    Delgadillo Alvarez, J.
    Castillo-Juarez, H.
    Sanchez-Betancourt, J. I.
    TRANSBOUNDARY AND EMERGING DISEASES, 2018, 65 (04) : 993 - 1008
  • [27] Characterization of porcine reproductive and respiratory syndrome virus (ORF5 RFLP 1-7-4 viruses) in northern China
    Xie, Changzhan
    Ha, Zhuo
    Nan, Fulong
    Zhang, Ying
    Zhang, Han
    Li, Jinfeng
    Zhang, Ping
    Han, Jicheng
    Zhang, He
    Zhuang, Xinyu
    Zhang, Jinyong
    Lu, Huijun
    Jin, Ningyi
    MICROBIAL PATHOGENESIS, 2020, 140
  • [28] Analysis of Quasispecies Diversity and Mutations for Highly Pathogenic Porcine Reproductive and Respiratory Syndrome Virus in China Based on ORF5 Genes
    Zhao, Peng
    Ma, Cheng-Tai
    Cui, Zhi-Zhong
    JOURNAL OF ANIMAL AND VETERINARY ADVANCES, 2012, 11 (15): : 2741 - 2746
  • [29] Analysis of ORF5 sequences of Porcine Reproductive and Respiratory Syndrome virus (PRRSV) circulating within swine farms in Costa Rica
    Mónica Guzmán
    Ronald Meléndez
    Carlos Jiménez
    Marta Piche
    Emily Jiménez
    Bernal León
    Juan M. Cordero
    Lisbeth Ramirez-Carvajal
    Alberto Uribe
    Arie Van Nes
    Arjan Stegeman
    Juan José Romero
    BMC Veterinary Research, 17
  • [30] Genetic variation analysis of porcine reproductive and respiratory syndrome virus isolated in China from 2002 to 2007 based on ORF5
    Li, Yufeng
    Wang, Xinglong
    Jiang, Ping
    Wang, Xiaofeng
    Chen, Wen
    Wang, Xianwei
    Wang, Kewei
    VETERINARY MICROBIOLOGY, 2009, 138 (1-2) : 150 - 155