Improved Dirichlet mixture model clustering algorithm for medical data anomaly detection

被引:0
|
作者
Wu, Lili [1 ,2 ]
Ali, Majid Khan Majahar [3 ]
Shan, Fam Pei [3 ]
Tian, Ying [4 ]
Tao, Li [3 ]
机构
[1] Xinzhou Teachers Univ, Dept Comp Sci, Xinzhou 034000, Peoples R China
[2] Univ Sains Malaysia USM, Sch Math Sci, George Town 11800, Malaysia
[3] USM, Sch Math Sci, George Town 11800, Malaysia
[4] Taiyuan Univ Technol, Dept Math, Taiyuan 030024, Peoples R China
关键词
over-diagnosis; anomaly expenses; anomaly detection; DPMM; CBLOF;
D O I
10.1504/IJBIC.2024.10064803
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to address the issue of identifying over-diagnosis and anomaly expenses in the healthcare service process, a local outlier mining clustering algorithm (ILOF-DPMM) is proposed by combining the clustering-based local outlier factor (CBLOF) algorithm with Dirichlet mixture model (DPMM). By extracting the patient's hospitalisation records from the medical record homepage, the influencing factors of hospitalisation costs for different disease types are classified, and the random forest method is used to reduce the feature dimension by disease type. The feature extraction and dimensionality reduction methods adopted by this algorithm effectively cluster medical insurance expense data. When calculating the LOF value of data, using a weighted calculation method based on the similarity of discrete and continuous features can more accurately detect abnormal data points in the data set, and has the ability to detect new data in real time, thus improving detection accuracy and efficiency.
引用
收藏
页码:11 / 21
页数:12
相关论文
共 50 条
  • [41] A Dirichlet Mixture Model of Hawkes Processes for Event Sequence Clustering
    Xu, Hongteng
    Zha, Hongyuan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [42] A Dirichlet Process Mixture Model for Spherical Data
    Straub, Julian
    Chang, Jason
    Freifeld, Oren
    Fisher, John W., III
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 38, 2015, 38 : 930 - 938
  • [43] Clustering Algorithm Based on Outlier Detection for Anomaly Intrusion Detection
    Yin, Shang-Nan
    Kang, Ho-Seok
    Kim, Sung-Ryul
    JOURNAL OF INTERNET TECHNOLOGY, 2016, 17 (02): : 291 - 299
  • [44] Online Data Clustering Using Variational Learning of a Hierarchical Dirichlet Process Mixture of Dirichlet Distributions
    Fan, Wentao
    Bouguila, Nizar
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2014, 2014, 8505 : 18 - 32
  • [45] Medical CT Edge Detection Algorithm based on Improved Fuzzy Clustering Analysis
    Sun, Shiling
    Yan, Shuxun
    Wang, Ying
    Li, Yun
    2014 FIFTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND ENGINEERING APPLICATIONS (ISDEA), 2014, : 607 - 610
  • [46] Improved Collaborative Algorithm Based on Spatial-spectral Joint Clustering for Hyperspectral Anomaly Detection
    Ma Shi-xin
    Liu Chun-tong
    Li Hong-cai
    He Zhen-xin
    Wang Hao
    ACTA PHOTONICA SINICA, 2019, 48 (01)
  • [47] DIRICHLET PROCESS MIXTURE MODELS FOR CLUSTERING I-VECTOR DATA
    Seshadri, Shreyas
    Remes, Ulpu
    Rasanen, Okko
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5470 - 5474
  • [48] Ocean Data Anomaly Detection Algorithm Based on Improved k-medoids
    Jiang Hua
    Wu Yao
    Lyu Kuilin
    Wang Huijiao
    2019 ELEVENTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI 2019), 2019, : 196 - 201
  • [49] Outlier Dirichlet Mixture Mechanism: Adversarial Statistical Learning for Anomaly Detection in the Fog
    Moustafa, Nour
    Choo, Kim-Kwang Raymond
    Radwan, Ibrahim
    Camtepe, Seyit
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019, 14 (08) : 1975 - 1987
  • [50] Mixture model clustering of uncertain data
    Hamdan, H
    Govaert, G
    FUZZ-IEEE 2005: PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS: BIGGEST LITTLE CONFERENCE IN THE WORLD, 2005, : 879 - 884