Neurosurgical literature classification - Evaluation of three automated methods and time trend analysis of the literature

被引:0
|
作者
Eftekhar, Shayan [1 ]
Eftekhar, Behzad [2 ,3 ,4 ]
机构
[1] Univ Queensland, Brisbane, Australia
[2] Univ Sydney, Fac Med & Hlth, Sydney Med Sch, Dept Neurosurg,Nepean Clin Sch, Sydney, Australia
[3] Macquarie Univ, Australian Sch Adv Med, Dept Neurosurg, Sydney, Australia
[4] Univ Sydney, Nepean Hosp, Dept Neurosurg, Sydney, Australia
关键词
Neurosurgical literature; Text classification; Similarity -based methods; Human vs machine learning performance; Lbl2Vec;
D O I
10.1016/j.heliyon.2024.e26831
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Automated supervised text classification methods require preclassified training data. Their application in scenarios that a large amount of preclassified data is not accessible is challenging. Neurosurgical literature classification into subspecialties is an example of this situation. We have introduced an automated similarity-based text classification method, evaluated it along with two other automated methods and applied the introduced method in neurosurgical literature classification. Methods: Performance of an introduced similarity-based text classification method along with two other automated methods (Lbl2Vec and keyword counting-based methods) was compared with performance of two senior neurosurgery registrars in classification of neurosurgical literature to 5 subspecialties. The Kappa-statistic measure of interrater agreement, overall marginal homogeneity using the Stuart-Maxwell test, marginal homogeneity relative to individual categories using McNemar tests and the sensitivity and specificity of each of the three methods were calculated. The introduced method was used to classify 211617 neurosurgical publications indexed in Pubmed to different subspecialties based on keywords extracted from subspecialty sections of a neurosurgery textbook. Results: The introduced similarity-based method showed the highest agreement with the registrars (raw agreement and Kappa value) followed by the Lbl2Vec and the counting-based method. Classifications of the English neurosurgical publications indexed in Pubmed into categories of Oncology, Vascular, Spine and functional using the introduced similarity-based method were more reliable (closer to the registrars' classifications) than Cranial trauma. The classifications and future forecast showed highest publications in Oncology, followed by Cranial trauma, Vascular, spine and functional neurosurgery. Conclusion: The classification of the English neurosurgical publications indexed in Pubmed to different subspecialties, using the introduced method, shows that Oncology and tumour has been the main battleground for the neurosurgeons over years and probably in the near future. The performance of the introduced classification method in comparison with the human performance shows its potential application in the situations that enough preclassified data are not accessible for automated text classification.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] CLASSIFICATION LITERATURE IN INDIA - ANALYSIS AND TREND
    KAULA, PN
    PRASAD, HN
    INTERNATIONAL CLASSIFICATION, 1981, 8 (03): : 145 - 148
  • [2] Time trend analysis of database and registry use in the neurosurgical literature: evidence for the advance of registry science
    Asher, Anthony L.
    El Sammak, Sally
    Michalopoulos, Giorgos D.
    Yolcu, Yagiz U.
    Alexander, A. Yohan
    Knightly, John J.
    Foley, Kevin T.
    Shaffrey, Christopher I.
    Harbaugh, Robert E.
    Rose, Geoffrey A.
    Coric, Domagoj
    Bisson, Erica F.
    Glassman, Steven D.
    Mummaneni, Praveen V.
    Bydon, Mohamad
    JOURNAL OF NEUROSURGERY, 2022, 136 (06) : 1804 - 1809
  • [3] Entrepreneurship research in service industries: a literature classification and trend analysis
    Devece, Carlos A.
    Palacios-Marques, Daniel
    Fernandez, Rocio
    INTERNATIONAL ENTREPRENEURSHIP AND MANAGEMENT JOURNAL, 2011, 7 (04) : 479 - 493
  • [4] Entrepreneurship research in service industries: a literature classification and trend analysis
    Carlos A. Devece
    Daniel Palacios-Marques
    Rocio Fernandez
    International Entrepreneurship and Management Journal, 2011, 7 : 479 - 493
  • [5] AUTOMATED LITERATURE ANALYSIS
    KOIO, M
    NAUCHNO-TEKHNICHESKAYA INFORMATSIYA SERIYA 2-INFORMATSIONNYE PROTSESSY I SISTEMY, 1967, (06): : 15 - &
  • [6] Analysis of Automated Tools for FAIRness Evaluation: A Literature Perspective
    Azevedo, Leonardo Guerreiro
    Banaggia, Gabriel
    Tesolin, Julio
    Cerqueira, Renato
    SEMANTIC WEB: ESWC 2024 SATELLITE EVENTS, PT II, 2025, 15345 : 149 - 166
  • [7] An automated classification pipeline for tables in pharmacokinetic literature
    Victoria C. Smith
    Ferran Gonzalez Hernandez
    Thanaporn Wattanakul
    Palang Chotsiri
    José Antonio Cordero
    Maria Rosa Ballester
    Màrius Duran
    Olga Fanlo Escudero
    Watjana Lilaonitkul
    Joseph F. Standing
    Frank Kloprogge
    Scientific Reports, 15 (1)
  • [8] A Literature Review on Automated Writing Evaluation
    杨曼
    海外英语, 2018, (03) : 227 - 228+232
  • [9] Suicide in the Philippines: time trend analysis (1974-2005) and literature review
    Maria Theresa Redaniel
    May Antonnette Lebanan-Dalida
    David Gunnell
    BMC Public Health, 11
  • [10] Suicide in the Philippines: time trend analysis (1974-2005) and literature review
    Redaniel, Maria Theresa
    Antonnette Lebanan-Dalida, May
    Gunnell, David
    BMC PUBLIC HEALTH, 2011, 11