Online Variational Learning of Dirichlet Process Mixtures of Scaled Dirichlet Distributions

被引:9
|
作者
Manouchehri, Narges [1 ]
Nguyen, Hieu [1 ]
Koochemeshkian, Pantea [2 ]
Bouguila, Nizar [1 ]
Fan, Wentao [3 ]
机构
[1] Concordia Univ, Concordia Inst Informat Syst Engn, Montreal, PQ H3G 1M8, Canada
[2] Concordia Univ, Dept Elect & Comp Engn, Montreal, PQ H3G 1M8, Canada
[3] Huaqiao Univ, Dept Comp Sci & Technol, Xiamen, Peoples R China
基金
加拿大自然科学与工程研究理事会; 中国国家自然科学基金;
关键词
Infinite mixture models; Dirichlet process mixtures of scaled Dirichlet distributions; Online variational learning; Spam categorization; Diabetes; Hepatitis; UNSUPERVISED SELECTION; MODEL;
D O I
10.1007/s10796-020-10027-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data clustering as an unsupervised method has been one of the main attention-grabbing techniques and a large class of tasks can be formulated by this method. Mixture models as a branch of clustering methods have been used in various fields of research such as computer vision and pattern recognition. To apply these models, we need to address some problems such as finding a proper distribution that properly fits data, defining model complexity and estimating the model parameters. In this paper, we apply scaled Dirichlet distribution to tackle the first challenge and propose a novel online variational method to mitigate the other two issues simultaneously. The effectiveness of the proposed work is evaluated by four challenging real applications, namely, text and image spam categorization, diabetes and hepatitis detection.
引用
收藏
页码:1085 / 1093
页数:9
相关论文
共 50 条
  • [31] Clustering consistency with Dirichlet process mixtures
    Ascolani, F.
    Lijoi, A.
    Rebaudo, G.
    Zanella, G.
    BIOMETRIKA, 2023, 110 (02) : 551 - 558
  • [32] A Hierarchical Pitman-Yor mixture of Scaled Dirichlet Distributions
    Baghdadi, Ali
    Manouchehri, Narges
    Bouguila, Nizar
    2022 IEEE 31ST INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2022, : 168 - 173
  • [33] Infinite Scaled Dirichlet Mixture Models for Spam Filtering Via Bayesian and Variational Bayes Learning
    Aldosari, Fand
    Bourouis, Sami
    Bouguila, Nizar
    Sallay, Hassen
    Khayyat, Khalid M. Jamil
    IEEE 2018 INTERNATIONAL CONGRESS ON CYBERMATICS / 2018 IEEE CONFERENCES ON INTERNET OF THINGS, GREEN COMPUTING AND COMMUNICATIONS, CYBER, PHYSICAL AND SOCIAL COMPUTING, SMART DATA, BLOCKCHAIN, COMPUTER AND INFORMATION TECHNOLOGY, 2018, : 1841 - 1847
  • [34] Data Clustering Using Variational Learning of Finite Scaled Dirichlet Mixture Models with Component Splitting
    Hieu Nguyen
    Maanicshah, Kamal
    Azam, Muhammad
    Bouguila, Nizar
    IMAGE ANALYSIS AND RECOGNITION (ICIAR 2019), PT II, 2019, 11663 : 117 - 128
  • [35] Collapsed Variational Dirichlet Process Mixture Models
    Kurihara, Kenichi
    Welling, Max
    Teh, Yee Whye
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 2796 - 2801
  • [36] MIXTURES OF DIRICHLET DISTRIBUTIONS AND ESTIMATION IN CONTINGENCY-TABLES
    ALBERT, JH
    GUPTA, AK
    ANNALS OF STATISTICS, 1982, 10 (04): : 1261 - 1268
  • [37] FLEXIBLE ONLINE MULTIVARIATE REGRESSION WITH VARIATIONAL BAYES AND THE MATRIX-VARIATE DIRICHLET PROCESS
    Ong, Victor Meng Hwee
    Nott, David J.
    Choi, Taeryon
    Jasra, Ajay
    FOUNDATIONS OF DATA SCIENCE, 2019, 1 (02): : 129 - 156
  • [38] Bayesian Nonparametric Models of Circular Variables Based on Dirichlet Process Mixtures of Normal Distributions
    Gabriel Nuñez-Antonio
    María Concepción Ausín
    Michael P. Wiper
    Journal of Agricultural, Biological, and Environmental Statistics, 2015, 20 : 47 - 64
  • [39] Clustering distributions with the marginalized nested Dirichlet process
    Zuanetti, Daiane Aparecida
    Muller, Peter
    Zhu, Yitan
    Yang, Shengjie
    Ji, Yuan
    BIOMETRICS, 2018, 74 (02) : 584 - 594
  • [40] Bayesian Nonparametric Models of Circular Variables Based on Dirichlet Process Mixtures of Normal Distributions
    Nunez-Antonio, Gabriel
    Concepcion Ausin, Maria
    Wiper, Michael P.
    JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2015, 20 (01) : 47 - 64