Neurosurgical literature classification - Evaluation of three automated methods and time trend analysis of the literature

被引:0
|
作者
Eftekhar, Shayan [1 ]
Eftekhar, Behzad [2 ,3 ,4 ]
机构
[1] Univ Queensland, Brisbane, Australia
[2] Univ Sydney, Fac Med & Hlth, Sydney Med Sch, Dept Neurosurg,Nepean Clin Sch, Sydney, Australia
[3] Macquarie Univ, Australian Sch Adv Med, Dept Neurosurg, Sydney, Australia
[4] Univ Sydney, Nepean Hosp, Dept Neurosurg, Sydney, Australia
关键词
Neurosurgical literature; Text classification; Similarity -based methods; Human vs machine learning performance; Lbl2Vec;
D O I
10.1016/j.heliyon.2024.e26831
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Automated supervised text classification methods require preclassified training data. Their application in scenarios that a large amount of preclassified data is not accessible is challenging. Neurosurgical literature classification into subspecialties is an example of this situation. We have introduced an automated similarity-based text classification method, evaluated it along with two other automated methods and applied the introduced method in neurosurgical literature classification. Methods: Performance of an introduced similarity-based text classification method along with two other automated methods (Lbl2Vec and keyword counting-based methods) was compared with performance of two senior neurosurgery registrars in classification of neurosurgical literature to 5 subspecialties. The Kappa-statistic measure of interrater agreement, overall marginal homogeneity using the Stuart-Maxwell test, marginal homogeneity relative to individual categories using McNemar tests and the sensitivity and specificity of each of the three methods were calculated. The introduced method was used to classify 211617 neurosurgical publications indexed in Pubmed to different subspecialties based on keywords extracted from subspecialty sections of a neurosurgery textbook. Results: The introduced similarity-based method showed the highest agreement with the registrars (raw agreement and Kappa value) followed by the Lbl2Vec and the counting-based method. Classifications of the English neurosurgical publications indexed in Pubmed into categories of Oncology, Vascular, Spine and functional using the introduced similarity-based method were more reliable (closer to the registrars' classifications) than Cranial trauma. The classifications and future forecast showed highest publications in Oncology, followed by Cranial trauma, Vascular, spine and functional neurosurgery. Conclusion: The classification of the English neurosurgical publications indexed in Pubmed to different subspecialties, using the introduced method, shows that Oncology and tumour has been the main battleground for the neurosurgeons over years and probably in the near future. The performance of the introduced classification method in comparison with the human performance shows its potential application in the situations that enough preclassified data are not accessible for automated text classification.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Classification of Evaluation Methods for the Effective Assessment of Simulation Games: Results from a Literature Review
    Faizan, Nilufer
    Loeffler, Alexander
    Heininger, Robert
    Utesch, Matthias
    Krcmar, Helmut
    INTERNATIONAL JOURNAL OF ENGINEERING PEDAGOGY, 2019, 9 (01): : 19 - 33
  • [22] Distributed Maintenance: A Literature Analysis and Classification
    Attajer, Ali
    Darmoul, Saber
    Riane, Fouad
    Bouras, Abdelghani
    IFAC PAPERSONLINE, 2019, 52 (13): : 619 - 624
  • [23] The Analysis of the Late Ming Dynasty Literature Ideological Trend
    Wang, Ruoming
    2013 THE FOURTH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND EDUCATION APPLICATION (ICEA 2013), PT 1, 2013, 30 : 453 - 457
  • [24] Lean Service: A literature analysis and classification
    Suarez-Barraza, Manuel F.
    Smith, Tricia
    Dahlgaard-Park, Su Mi
    TOTAL QUALITY MANAGEMENT & BUSINESS EXCELLENCE, 2012, 23 (3-4) : 359 - 380
  • [25] Forensics in the cloud: A literature analysis and classification
    Agbedanu, Promise R.
    Wang, Pengwei
    Nortey, Richard N.
    Odartey, Lamptey K.
    5TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS (BIGCOM 2019), 2019, : 124 - 132
  • [26] A topic trend analysis on COVID-19 literature
    Urru, Sara
    Sciannameo, Veronica
    Lanera, Corrado
    Salaris, Silvano
    Gregori, Dario
    Berchialla, Paola
    DIGITAL HEALTH, 2022, 8
  • [27] Literature classification for semi-automated updating of biological knowledgebases
    Olsen, Lars Ronn
    Kudahl, Ulrich Johan
    Winther, Ole
    Brusic, Vladimir
    BMC GENOMICS, 2013, 14
  • [28] Integrated access to legal literature through automated semantic classification
    Francesconi, E.
    Peruginelli, G.
    ARTIFICIAL INTELLIGENCE AND LAW, 2009, 17 (01) : 31 - 49
  • [29] TRENDS IN NEUROIMAGING METHODS: LARGE-SCALE AUTOMATED ANALYSIS OF THE FMRI LITERATURE.
    Carp, Joshua
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2013, : 201 - 202
  • [30] A systematic literature review of automated clinical coding and classification systems
    Stanfill, Mary H.
    Williams, Margaret
    Fenton, Susan H.
    Jenders, Robert A.
    Hersh, William B.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (06) : 646 - 651