Intelligent medical heterogeneous big data set balanced clustering using deep learning

被引:21
|
作者
Li, Xiaofeng [1 ]
Jiao, Hongshuang [2 ]
Li, Dong [3 ]
机构
[1] Heilongjiang Int Univ, Dept Informat Engn, Harbin 150025, Peoples R China
[2] Heilongjiang Int Univ, Off Acad Res, Harbin 150025, Peoples R China
[3] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Smart medical; Heterogeneous big data; Deep neural network; Data balanced clustering; ALGORITHM; OPTIMIZATION;
D O I
10.1016/j.patrec.2020.08.027
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to address the clustering problem of intelligent medical data, the data sets were not preprocessed using the traditional method, leading to a large amount of calculation, low efficiency, and large data cluster center offset distance. We proposed a balanced clustering algorithm for intelligent medical heterogeneous big data set using deep learning. Firstly, a deep neural network model based on incremental updating was constructed, and adaptive training and adjustment were made according to data scale, and the multi-layer feature learning of heterogeneous big data sets of intelligent medical care. Secondly, under-sampling preprocessing was carried out on the data set so that the data of the heterogeneous big data set was in a balanced state, and on this basis, clustering calculation of the heterogeneous big data was conducted. Then, the clustering center was set according to the kernel density estimation results, and the data cluster center was updated iteratively until convergence by combining the data features obtained from deep learning and euclidean distance calculation, so as to complete the balanced clustering of the heterogeneous big data set of intelligent medical treatment. The results show that the proposed algorithm has the advantages of small data cluster center offset distance, short clustering time, low energy consumption, high Macro-F1 value and NMI value, and the accuracy of clustering can be as high as 95%, the calculational cost is low, which has certain advantages. (C) 2020 Elsevier Ltd. All rights reserved. (c) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:548 / 555
页数:8
相关论文
共 50 条
  • [31] Quantifying Uncertainty in Internet of Medical Things and Big-Data Services Using Intelligence and Deep Learning
    Al-Turjman, Fadi
    Zahmatkesh, Hadi
    Mostarda, Leonardo
    IEEE ACCESS, 2019, 7 : 115749 - 115759
  • [32] Efficient automated error detection in medical data using deep-learning and label-clustering
    Nguyen, T. V.
    Diakiw, S. M.
    VerMilyea, M. D.
    Dinsmore, A. W.
    Perugini, M.
    Perugini, D.
    Hall, J. M. M.
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [33] Efficient automated error detection in medical data using deep-learning and label-clustering
    T. V. Nguyen
    S. M. Diakiw
    M. D. VerMilyea
    A. W. Dinsmore
    M. Perugini
    D. Perugini
    J. M. M. Hall
    Scientific Reports, 13 (1)
  • [34] Intrusion Detection Using Big Data and Deep Learning Techniques
    Faker, Osama
    Dogdu, Erdogan
    PROCEEDINGS OF THE 2019 ANNUAL ACM SOUTHEAST CONFERENCE (ACMSE 2019), 2019, : 86 - 93
  • [35] Predicting Infectious Disease Using Deep Learning and Big Data
    Chae, Sangwon
    Kwon, Sungjun
    Lee, Donghyun
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2018, 15 (08)
  • [36] Environment Classification for Lrban Big Data Using Deep Learning
    Hossain, M. Shamim
    Muhammad, Ghulam
    IEEE COMMUNICATIONS MAGAZINE, 2018, 56 (11) : 44 - 50
  • [37] Health Analysis of Footballer Using Big Data and Deep Learning
    Yang, Tao
    Yuan, Guoliang
    Yan, Jing
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [38] Understanding Emotions in Text Using Deep Learning and Big Data
    Chatterjee, Ankush
    Gupta, Umang
    Chinnakotla, Manoj Kumar
    Srikanth, Radhakrishnan
    Galley, Michel
    Agrawal, Puneet
    COMPUTERS IN HUMAN BEHAVIOR, 2019, 93 : 309 - 317
  • [39] Big Data Image and Video Analysis Using Deep Learning
    Rao, Chunchu Srinivasa
    Naramala, Venkateswara Rao
    Sitharamulu, V.
    Devi, Jujjuri. Rama
    Paladugu, Rama Krishna
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (04): : 824 - 828
  • [40] Optimizing data processing for edge-enabled IoT devices using deep learning based heterogeneous data clustering approach
    Sudhakar M.
    Anne K.R.
    Measurement: Sensors, 2024, 31