Neurosurgical literature classification - Evaluation of three automated methods and time trend analysis of the literature

被引：0

作者：

Eftekhar, Shayan ^{[1
]}

Eftekhar, Behzad ^{[2
,3
,4
]}

机构：

[1] Univ Queensland, Brisbane, Australia

[2] Univ Sydney, Fac Med & Hlth, Sydney Med Sch, Dept Neurosurg,Nepean Clin Sch, Sydney, Australia

[3] Macquarie Univ, Australian Sch Adv Med, Dept Neurosurg, Sydney, Australia

[4] Univ Sydney, Nepean Hosp, Dept Neurosurg, Sydney, Australia

来源：

HELIYON | 2024年 / 10卷 / 05期

关键词：

Neurosurgical literature; Text classification; Similarity -based methods; Human vs machine learning performance; Lbl2Vec;

D O I：

10.1016/j.heliyon.2024.e26831

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Background: Automated supervised text classification methods require preclassified training data. Their application in scenarios that a large amount of preclassified data is not accessible is challenging. Neurosurgical literature classification into subspecialties is an example of this situation. We have introduced an automated similarity-based text classification method, evaluated it along with two other automated methods and applied the introduced method in neurosurgical literature classification. Methods: Performance of an introduced similarity-based text classification method along with two other automated methods (Lbl2Vec and keyword counting-based methods) was compared with performance of two senior neurosurgery registrars in classification of neurosurgical literature to 5 subspecialties. The Kappa-statistic measure of interrater agreement, overall marginal homogeneity using the Stuart-Maxwell test, marginal homogeneity relative to individual categories using McNemar tests and the sensitivity and specificity of each of the three methods were calculated. The introduced method was used to classify 211617 neurosurgical publications indexed in Pubmed to different subspecialties based on keywords extracted from subspecialty sections of a neurosurgery textbook. Results: The introduced similarity-based method showed the highest agreement with the registrars (raw agreement and Kappa value) followed by the Lbl2Vec and the counting-based method. Classifications of the English neurosurgical publications indexed in Pubmed into categories of Oncology, Vascular, Spine and functional using the introduced similarity-based method were more reliable (closer to the registrars' classifications) than Cranial trauma. The classifications and future forecast showed highest publications in Oncology, followed by Cranial trauma, Vascular, spine and functional neurosurgery. Conclusion: The classification of the English neurosurgical publications indexed in Pubmed to different subspecialties, using the introduced method, shows that Oncology and tumour has been the main battleground for the neurosurgeons over years and probably in the near future. The performance of the introduced classification method in comparison with the human performance shows its potential application in the situations that enough preclassified data are not accessible for automated text classification.

引用

页数：8

共 50 条

[31] Literature classification for semi-automated updating of biological knowledgebases
Lars Rønn Olsen
Ulrich Johan Kudahl
Ole Winther
Vladimir Brusic
BMC Genomics, 14
[32] A Systematic Literature Review on Machine Learning for Automated Requirements Classification
Manuel Perez-Verdejo, J.
Sanchez-Garcia, Angel J.
Octavio Ocharan-Hernandez, Jorge
2020 8TH EDITION OF THE INTERNATIONAL CONFERENCE IN SOFTWARE ENGINEERING RESEARCH AND INNOVATION (CONISOFT 2020), 2020, : 21 - 28
[33] A literature Review of Financial Analysis: Status and Development Trend
Yang, Fangwen
Liu, Yeling
2015 3RD INTERNATIONAL CONFERENCE ON SOCIAL SCIENCES RESEARCH (SSR 2015), 2015, 13 : 102 - 105
[34] Trends in Classification Literature: Analysis of Literature Published during 2000 to 2009
Kumbhar, Rajendra
DESIDOC JOURNAL OF LIBRARY & INFORMATION TECHNOLOGY, 2012, 32 (02): : 179 - 185
[35] An evaluation of three automated methods for the measurement of cortisol
Blick, K
Fry, H
Chaney, C
Watson, L
Hagan, J
Passey, R
Gillum, R
CLINICAL CHEMISTRY, 1996, 42 (06) : 382 - 382
[36] Variables, mechanisms, and simulations : can the three methods be synthesized ? A critical analysis of the literature
Manzo, G
REVUE FRANCAISE DE SOCIOLOGIE, 2005, 46 (01): : 37 - +
[37] Variables, mechanisms, and simulations: Can the three methods be synthesized? A critical analysis of the literature
Manzo, Gianluca
REVUE FRANCAISE DE SOCIOLOGIE, 2007, 48 : 35 - 71
[38] Ecodesign Methods for Mechatronic Systems: A Literature Review and Classification
Mcrschak, Simon
Hehenberger, Peter
PROCEEDINGS OF THE 2019 20TH INTERNATIONAL CONFERENCE ON RESEARCH AND EDUCATION IN MECHATRONICS (REM 2019), 2019,
[39] Electric load forecasting: literature survey and classification of methods
Alfares, HK
Nazeeruddin, M
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2002, 33 (01) : 23 - 34
[40] METHODS FOR TIME -VARYING EXPOSURE - A LITERATURE REVIEW
Linder, M.
Vago, E.
Bahmanyar, S.
Heeg, B.
Myers, D.
Zhang, M.
Andersen, M.
VALUE IN HEALTH, 2016, 19 (07) : A382 - A382

← 1 2 3 4 5 →