A Binary Classifier for the Prediction of EC Numbers of Enzymes

被引:35
|
作者
Cui, Hao [1 ]
Chen, Lei [1 ]
机构
[1] Shanghai Maritime Univ, Coll Informat Engn, Shanghai 201306, Peoples R China
基金
上海市自然科学基金;
关键词
Enzyme; EC number; support vector machine; protein-protein interaction; Weka; binary classification; five-fold cross-validation; AMINO-ACID-COMPOSITION; SUPPORT VECTOR MACHINE; PROTEIN-STRUCTURE; FAMILY CLASSES; GENES; ALGORITHM;
D O I
10.2174/1570164616666190126103036
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Identification of Enzyme Commission (EC) number of enzymes is quite important for understanding the metabolic processes that produce enough energy to sustain life. Previous studies mainly focused on predicting six main functional classes or sub-functional classes, i.e., the first two digits of the EC number. Objective: In this study, a binary classifier was proposed to identify the full EC number (four digits) of enzymes. Methods: Enzymes and their known EC numbers were paired as positive samples and negative samples were randomly produced that were as many as positive samples. The associations between any two samples were evaluated by integrating the linkages between enzymes and EC numbers. The classic machining learning algorithm, Support Vector Machine (SVM), was adopted as the prediction engine. Results: The five-fold cross-validation test on five datasets indicated that the overall accuracy, Matthews correlation coefficient and Fl-measure were about 0.786, 0.576 and 0.771, respectively, suggesting the utility of the proposed classifier. In addition, the effectiveness of the classifier was elaborated by comparing it with other classifiers that were based on other classic machine learning algorithms. Conclusion: The proposed classifier was quite effective for prediction of EC number of enzymes and was specially designed for dealing with the problem addressed in this study by testing it on five datasets containing randomly produced samples.
引用
收藏
页码:383 / 391
页数:9
相关论文
共 50 条
  • [41] GETTING MEASURE OF BINARY NUMBERS
    MOORE, JW
    MACHINE DESIGN, 1974, 46 (11) : 114 - 117
  • [42] A BINARY SYSTEM FOR COMPLEX NUMBERS
    PENNEY, W
    JOURNAL OF THE ACM, 1965, 12 (02) : 247 - &
  • [43] Classifier Adaptation at Prediction Time
    Royer, Amelie
    Lampert, Christoph H.
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 1401 - 1409
  • [44] HARDWARE REALIZATION OF FAST CLASSIFIER FOR BINARY AND NON-BINARY PATTERNS
    SCHMID, D
    NACHRICHTENTECHNISCHE ZEITSCHRIFT, 1970, 23 (04): : 185 - &
  • [45] Decomposition of Binary Mixtures of DMC/EC, EMC/EC, and DEC/EC on Potassium Surfaces; GC, XPS, and Calculation
    Wildersinn, Leonie
    Stottmeister, Daniel
    Jeschull, Fabian
    Gross, Axel
    Hofmann, Andreas
    ACS APPLIED MATERIALS & INTERFACES, 2025, 17 (06) : 10055 - 10072
  • [46] Statistical criterion for comparison of binary classifier accuracy
    Strebkov, E. V.
    Zheltukhin, V. S.
    Borodaev, I. A.
    12TH INTERNATIONAL CONFERENCE - MESH METHODS FOR BOUNDARY: VALUE PROBLEMS AND APPLICATIONS, 2019, 1158
  • [47] Evaluation of diversity measures for binary classifier ensembles
    Narasinihamurthy, A
    MULTIPLE CLASSIFIER SYSTEMS, 2005, 3541 : 267 - 277
  • [48] KDV classifier: a novel approach for binary classification
    Krishna Gopal Sharma
    Yashpal Singh
    Multimedia Tools and Applications, 2022, 81 : 42241 - 42259
  • [49] Finite State Machine Based Binary Classifier
    Tetik, Yusuf Engin
    2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [50] Binary Spectrum Feature for Improved Classifier Performance
    Ulapane, Nalika
    Thiyagarajan, Karthick
    Kodagoda, Sarath
    16TH IEEE INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2020), 2020, : 1117 - 1122