Machine Learning for Biomedical Literature Triage

被引:22
|
作者
Almeida, Hayda [1 ]
Meurs, Marie-Jean [2 ]
Kosseim, Leila [1 ]
Butler, Greg [1 ,2 ]
Tsang, Adrian [2 ]
机构
[1] Concordia Univ, Dept Comp Sci & Software Engn, Montreal, PQ, Canada
[2] Concordia Univ, Ctr Struct & Funct Genom, Montreal, PQ, Canada
来源
PLOS ONE | 2014年 / 9卷 / 12期
关键词
SUPPORT VECTOR MACHINES; IMBALANCED DATA; TEXT;
D O I
10.1371/journal.pone.0115892
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper presents a machine learning system for supporting the first task of the biological literature manual curation process, called triage. We compare the performance of various classification models, by experimenting with dataset sampling factors and a set of features, as well as three different machine learning algorithms (Naive Bayes, Support Vector Machine and Logistic Model Trees). The results show that the most fitting model to handle the imbalanced datasets of the triage classification task is obtained by using domain relevant features, an under-sampling technique, and the Logistic Model Trees algorithm.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] A machine learning approach for the curation of biomedical literature
    Shi, M
    Edwin, DS
    Menon, R
    Shen, LX
    Lim, JYK
    Loh, HT
    Keerthi, SS
    Ong, CJ
    ADVANCES IN INFORMATION RETRIEVAL, 2003, 2633 : 597 - 604
  • [2] Topics in machine learning for biomedical literature analysis and text retrieval
    Dogan, Rezarta Islamaj
    Yeganova, Lana
    BMC BIOINFORMATICS, 2011, 12
  • [3] Topics in machine learning for biomedical literature analysis and text retrieval
    Rezarta Islamaj Doğan
    Lana Yeganova
    BMC Bioinformatics, 12
  • [4] Topics in machine learning for biomedical literature analysis and text retrieval
    Rezarta Islamaj Doğan
    Lana Yeganova
    Journal of Biomedical Semantics, 3 (Suppl 3)
  • [5] Recognizing software names in biomedical literature using machine learning
    Wei, Qiang
    Zhang, Yaoyun
    Amith, Muhammad
    Lin, Rebecca
    Lapeyrolerie, Jenay
    Tao, Cui
    Xu, Hua
    HEALTH INFORMATICS JOURNAL, 2020, 26 (01) : 21 - 33
  • [6] Machine learning approach to identify adverse events in scientific biomedical literature
    Wewering, Sonja
    Pietsch, Claudia
    Sumner, Marc
    Marko, Kornel
    Luelf-Averhoff, Anna-Theresa
    Baehrens, David
    CTS-CLINICAL AND TRANSLATIONAL SCIENCE, 2022, 15 (06): : 1500 - 1506
  • [7] Machine learning in biomedical engineering
    Park, Cheolsoo
    Took, Clive Cheong
    Seong, Joon-Kyung
    BIOMEDICAL ENGINEERING LETTERS, 2018, 8 (01) : 1 - 3
  • [8] Machine Learning for Biomedical Application
    Strzelecki, Michal
    Badura, Pawel
    APPLIED SCIENCES-BASEL, 2022, 12 (04):
  • [9] Machine learning in biomedical engineering
    Cheolsoo Park
    Clive Cheong Took
    Joon-Kyung Seong
    Biomedical Engineering Letters, 2018, 8 (1) : 1 - 3
  • [10] Machine Learning for Biomedical Applications
    Cesarelli, Giuseppe
    Ponsiglione, Alfonso Maria
    Sansone, Mario
    Amato, Francesco
    Donisi, Leandro
    Ricciardi, Carlo
    BIOENGINEERING-BASEL, 2024, 11 (08):