Enhancing random forest classification with NLP in DAMEH: A system for DAta Management in eHealth Domain

被引:8
|
作者
Amato, Flora [1 ]
Coppolino, Luigi [2 ]
Cozzolino, Giovanni [1 ]
Mazzeo, Giovanni [1 ]
Moscato, Francesco [3 ]
Nardone, Roberto [4 ]
机构
[1] Univ Naples Federico II, DIETI, Naples, Italy
[2] Univ Naples Parthenope, DI, Naples, Italy
[3] Univ Salerno, DIEM, Fisciano, Italy
[4] Univ Mediterranea Reggio Calabria, DIIES, Reggio Di Calabria, Italy
关键词
Big data processing; E-health; Machine learning; Random forests; Multi-classification schema; FEATURE-SELECTION; ARCHITECTURE;
D O I
10.1016/j.neucom.2020.08.091
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The use of pervasive IoT devices in Smart Cities, have increased the Volume of data produced in many and many field. Interesting and very useful applications grow up in number in E-health domain, where smart devices are used in order to manage huge amount of data, in highly distributed environments, in order to provide smart services able to collect data to fill medical records of patients. The problem here is to gather data, to produce records and to analyze medical records depending on their contents. Since data gathering involve very different devices (not only wearable medical sensors, but also environmental smart devices, like weather, pollution and other sensors) it is very difficult to classify data depending their contents, in order to enable better management of patients. Data from smart devices couple with medical records written in natural language: we describe here an architecture that is able to determine best features for classification, depending on existent medical records. The architecture is based on pre filtering phase based on Natural Language Processing, that is able to enhance Machine learning classification based on Random Forests. We carried on experiments on about 5000 medical records from real (anonymized) case studies from various health-care organizations in Italy. We show accuracy of the presented approach in terms of Accuracy-Rejection curves. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:79 / 91
页数:13
相关论文
共 50 条
  • [41] Vacant Parking Lot Detection System Using Random Forest Classification
    Raj, Suthapalli Uday
    Manikanta, Mummidi Veera
    Sai, Paduchuri Sesha
    Leo, M. Judith
    PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 454 - 458
  • [42] Development of automatic classification system for leukocyte images using Random Forest
    Tomiyama, Shinnosuke
    Sakata-Yanagimoto, Mamiko
    Chiba, Shigeru
    Aikawa, Naoyuki
    ELECTRONICS AND COMMUNICATIONS IN JAPAN, 2018, 101 (11) : 13 - 19
  • [43] Enhancing skin lesion Classification: A machine learning approach using KNN, XGBoost, and Random Forest
    Hussain, S. K. Rhaber
    Powar, Omkar S.
    2024 CONTROL INSTRUMENTATION SYSTEM CONFERENCE, CISCON 2024, 2024,
  • [44] Enhancing breast cancer screening with urinary biomarkers and Random Forest supervised classification: A comprehensive investigation
    Alladio, Eugenio
    Trapani, Fulvia
    Castellino, Lorenzo
    Massano, Marta
    Di Corcia, Daniele
    Salomone, Alberto
    Berrino, Enrico
    Ponzone, Riccardo
    Marchio, Caterina
    Sapino, Anna
    Vincenti, Marco
    JOURNAL OF PHARMACEUTICAL AND BIOMEDICAL ANALYSIS, 2024, 244
  • [45] Random Forest Based Multiclass Classification Approach for Highly Skewed Particle Data
    Kuzu, Serpil Yalcin
    JOURNAL OF SCIENTIFIC COMPUTING, 2023, 95 (01)
  • [46] Hyperspectral data classification using spline curve fitting and random forest classifier
    Mehdi M. Molabashi
    S. Abolfazl Hosseini
    S. Ali Hosseiny
    Earth Science Informatics, 2025, 18 (2)
  • [47] Lithological classification and analysis using Hyperion hyperspectral data and Random Forest method
    Ke YuanChu
    Shi ZhongKui
    Li PeiJun
    Zhang XiYa
    ACTA PETROLOGICA SINICA, 2018, 34 (07) : 2181 - 2188
  • [48] A Clustering Approach for Feature Selection in Microarray Data Classification Using Random forest
    Aydadenta, Husna
    Adiwijaya
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2018, 14 (05): : 1167 - 1175
  • [49] Random forest for big data classification in the internet of things using optimal features
    Lakshmanaprabu, S. K.
    Shankar, K.
    Ilayaraja, M.
    Nasir, Abdul Wahid
    Vijayakumar, V.
    Chilamkurti, Naveen
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (10) : 2609 - 2618
  • [50] ExtractingRuleRF in Educational Data Classification: from a Random Forest to Interpretable Refined Rules
    Lu Thi Kim Phung
    Vo Thi Ngoc Chau
    Nguyen Hua Phung
    2015 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND APPLICATIONS (ACOMP), 2015, : 20 - 27