Model Evaluation Improvements for Multiclass Classification in Diagnosis Prediction

被引:0
|
作者
Coroiu, Adriana Mihaela [1 ]
机构
[1] Babes Bolyai Univ, Cluj Napoca, Romania
关键词
Multiclass classification; Evaluation model; Features selection;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We are living in an age in which we are invaded by the amount of available data. These data are increasing in an exponential way. The art of making sense of all the data represent an issues nowadays. Moreover, the ability to deal with different types of these data require new approaches in the field of exploratory analysis. Therefore the extraction of relevant information, the discovery of relations between data and the ability to generalize to new data represent a continuous challenge. Exploratory data analysis becomes an impressive area of concern for certain domains such as education, healthcare, biology, economics, geography, geology, history or agriculture. Particularly, the purpose of this paper is related to medicine and psychology. Some machine learning advantages are being investigated in order to improve a treatment, a diagnosis of a patient. This paper, presenting a work in progress, discusses an approach to a relevant supervised learning method from the art of machine learning field: classification. Various aspects are considered, as preprocessing of the input data; selection of the model applied to the data; evaluation of the model; improving the performance of a model, selection of the most relevant features to be included in the model and also learning a model that is able to perform well on new data [1]. The computed metrics for performance evaluation of a model are also highlighted. The data sets (mixed data) used in our analysis are data from medical field (kidney and lung disease: pulmonar-renal syndrome) and also are suitable for multiclass classification. In this paper, the selected models are ensembles of decision trees such as Random Forest and Gradient Boosted Regression Trees. The model evaluation, the model improvements and feature selection ultimately lead to building models able to generalize to new data with a high value of accuracy. All these represent an added value in fields such us medicine and psychology, where a physician or a psychologist may use pattern and information as input in the treatment of a patient.
引用
收藏
页码:782 / 783
页数:2
相关论文
共 50 条
  • [21] A relative evaluation of multiclass image classification by support vector machines
    Foody, GM
    Mathur, A
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2004, 42 (06): : 1335 - 1343
  • [22] Analysis Accuracy of XGBoost Model for Multiclass Classification - A Case Study of Applicant Level Risk Prediction for Life Insurance
    Mustika, Widya Fajar
    Murfi, Hendri
    Widyaningsih, Yekti
    2019 5TH INTERNATIONAL CONFERENCE ON SCIENCE ININFORMATION TECHNOLOGY (ICSITECH): EMBRACING INDUSTRY 4.0 - TOWARDS INNOVATION IN CYBER PHYSICAL SYSTEM, 2019, : 71 - 77
  • [23] Learning ECOC Code Matrix for Multiclass Classification with Application to Glaucoma Diagnosis
    Xiaolong Bai
    Swamidoss Issac Niwas
    Weisi Lin
    Bing-Feng Ju
    Chee Keong Kwoh
    Lipo Wang
    Chelvin C. Sng
    Maria C. Aquino
    Paul T. K. Chew
    Journal of Medical Systems, 2016, 40
  • [24] Learning ECOC Code Matrix for Multiclass Classification with Application to Glaucoma Diagnosis
    Bai, Xiaolong
    Niwas, Swamidoss Issac
    Lin, Weisi
    Ju, Bing-Feng
    Kwoh, Chee Keong
    Wang, Lipo
    Sng, Chelvin C.
    Aquino, Maria C.
    Chew, Paul T. K.
    JOURNAL OF MEDICAL SYSTEMS, 2016, 40 (04) : 1 - 10
  • [25] Diagnosis: From classification to prediction
    Armstrong, David
    SOCIAL SCIENCE & MEDICINE, 2019, 237
  • [26] Diagnosis and classification prediction model of pituitary tumor based on machine learning
    Liu, Anmin
    Xiao, Yan
    Wu, Min
    Tan, Yuzhen
    He, Yujie
    Deng, Yang
    Tang, Liang
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (12): : 9257 - 9272
  • [27] Diagnosis and classification prediction model of pituitary tumor based on machine learning
    Anmin Liu
    Yan Xiao
    Min Wu
    Yuzhen Tan
    Yujie He
    Yang Deng
    Liang Tang
    Neural Computing and Applications, 2022, 34 : 9257 - 9272
  • [28] Hybrid intelligent predictive maintenance model for multiclass fault classification
    Buabeng, Albert
    Simons, Anthony
    Frempong, Nana Kena
    Ziggah, Yao Yevenyo
    SOFT COMPUTING, 2023, 28 (15-16) : 8749 - 8770
  • [29] A multiclass classification model for predicting the thermal conductivity of uranium compounds
    Sun, Y.
    Kumagai, M.
    Jin, M.
    Sato, E.
    Aoki, M.
    Ohishi, Y.
    Kurosaki, K.
    JOURNAL OF NUCLEAR SCIENCE AND TECHNOLOGY, 2024, 61 (06) : 778 - 788
  • [30] A Multiclass Classification Model for Stock News Based on Structured Data
    Weng, Weitao
    Liu, Yongbin
    Wang, Sibo
    Lei, Kai
    2016 SIXTH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2016, : 72 - 78