Neurosurgical literature classification - Evaluation of three automated methods and time trend analysis of the literature

被引:0
|
作者
Eftekhar, Shayan [1 ]
Eftekhar, Behzad [2 ,3 ,4 ]
机构
[1] Univ Queensland, Brisbane, Australia
[2] Univ Sydney, Fac Med & Hlth, Sydney Med Sch, Dept Neurosurg,Nepean Clin Sch, Sydney, Australia
[3] Macquarie Univ, Australian Sch Adv Med, Dept Neurosurg, Sydney, Australia
[4] Univ Sydney, Nepean Hosp, Dept Neurosurg, Sydney, Australia
关键词
Neurosurgical literature; Text classification; Similarity -based methods; Human vs machine learning performance; Lbl2Vec;
D O I
10.1016/j.heliyon.2024.e26831
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Automated supervised text classification methods require preclassified training data. Their application in scenarios that a large amount of preclassified data is not accessible is challenging. Neurosurgical literature classification into subspecialties is an example of this situation. We have introduced an automated similarity-based text classification method, evaluated it along with two other automated methods and applied the introduced method in neurosurgical literature classification. Methods: Performance of an introduced similarity-based text classification method along with two other automated methods (Lbl2Vec and keyword counting-based methods) was compared with performance of two senior neurosurgery registrars in classification of neurosurgical literature to 5 subspecialties. The Kappa-statistic measure of interrater agreement, overall marginal homogeneity using the Stuart-Maxwell test, marginal homogeneity relative to individual categories using McNemar tests and the sensitivity and specificity of each of the three methods were calculated. The introduced method was used to classify 211617 neurosurgical publications indexed in Pubmed to different subspecialties based on keywords extracted from subspecialty sections of a neurosurgery textbook. Results: The introduced similarity-based method showed the highest agreement with the registrars (raw agreement and Kappa value) followed by the Lbl2Vec and the counting-based method. Classifications of the English neurosurgical publications indexed in Pubmed into categories of Oncology, Vascular, Spine and functional using the introduced similarity-based method were more reliable (closer to the registrars' classifications) than Cranial trauma. The classifications and future forecast showed highest publications in Oncology, followed by Cranial trauma, Vascular, spine and functional neurosurgery. Conclusion: The classification of the English neurosurgical publications indexed in Pubmed to different subspecialties, using the introduced method, shows that Oncology and tumour has been the main battleground for the neurosurgeons over years and probably in the near future. The performance of the introduced classification method in comparison with the human performance shows its potential application in the situations that enough preclassified data are not accessible for automated text classification.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Literature classification for semi-automated updating of biological knowledgebases
    Lars Rønn Olsen
    Ulrich Johan Kudahl
    Ole Winther
    Vladimir Brusic
    BMC Genomics, 14
  • [32] A Systematic Literature Review on Machine Learning for Automated Requirements Classification
    Manuel Perez-Verdejo, J.
    Sanchez-Garcia, Angel J.
    Octavio Ocharan-Hernandez, Jorge
    2020 8TH EDITION OF THE INTERNATIONAL CONFERENCE IN SOFTWARE ENGINEERING RESEARCH AND INNOVATION (CONISOFT 2020), 2020, : 21 - 28
  • [33] A literature Review of Financial Analysis: Status and Development Trend
    Yang, Fangwen
    Liu, Yeling
    2015 3RD INTERNATIONAL CONFERENCE ON SOCIAL SCIENCES RESEARCH (SSR 2015), 2015, 13 : 102 - 105
  • [34] Trends in Classification Literature: Analysis of Literature Published during 2000 to 2009
    Kumbhar, Rajendra
    DESIDOC JOURNAL OF LIBRARY & INFORMATION TECHNOLOGY, 2012, 32 (02): : 179 - 185
  • [35] An evaluation of three automated methods for the measurement of cortisol
    Blick, K
    Fry, H
    Chaney, C
    Watson, L
    Hagan, J
    Passey, R
    Gillum, R
    CLINICAL CHEMISTRY, 1996, 42 (06) : 382 - 382
  • [36] Variables, mechanisms, and simulations : can the three methods be synthesized ? A critical analysis of the literature
    Manzo, G
    REVUE FRANCAISE DE SOCIOLOGIE, 2005, 46 (01): : 37 - +
  • [37] Variables, mechanisms, and simulations: Can the three methods be synthesized? A critical analysis of the literature
    Manzo, Gianluca
    REVUE FRANCAISE DE SOCIOLOGIE, 2007, 48 : 35 - 71
  • [38] Ecodesign Methods for Mechatronic Systems: A Literature Review and Classification
    Mcrschak, Simon
    Hehenberger, Peter
    PROCEEDINGS OF THE 2019 20TH INTERNATIONAL CONFERENCE ON RESEARCH AND EDUCATION IN MECHATRONICS (REM 2019), 2019,
  • [39] Electric load forecasting: literature survey and classification of methods
    Alfares, HK
    Nazeeruddin, M
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2002, 33 (01) : 23 - 34
  • [40] METHODS FOR TIME -VARYING EXPOSURE - A LITERATURE REVIEW
    Linder, M.
    Vago, E.
    Bahmanyar, S.
    Heeg, B.
    Myers, D.
    Zhang, M.
    Andersen, M.
    VALUE IN HEALTH, 2016, 19 (07) : A382 - A382