Neighborhood Information-Based Method for Multivariate Association Mining

被引:2
|
作者
Cheng, Honghong [1 ,2 ]
Qian, Yuhua [3 ]
Guo, Yingjie [3 ]
Zheng, Keyin [3 ]
Zhang, Qingfu [4 ,5 ]
机构
[1] Shanxi Univ Finance & Econ, Sch Informat, Taiyuan 030012, Shanxi, Peoples R China
[2] Shanxi Univ, Inst Big Data Sci & Ind, Taiyuan 030006, Shanxi, Peoples R China
[3] Shanxi Univ, Inst Big Data Sci & Ind, Sch Comp & Informat Technol, Key Lab Comp Intelligence & China Informat Proc,Mi, Taiyuan 030006, Shanxi, Peoples R China
[4] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[5] City Univ Hong Kong, Shenzhen Res Inst, Shenzhen 518057, Peoples R China
基金
中国国家自然科学基金;
关键词
Entropy; Spirals; Noise measurement; Mutual information; Knowledge engineering; Data mining; Data engineering; Association mining; multivariate association measure; distribution-free; nonparametric; neighborhood information; ATTRIBUTE REDUCTION;
D O I
10.1109/TKDE.2022.3178090
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most current data is multivariable, exploring and identifying valuable information in these datasets has far-reaching impacts. In particular, discovering meaningful hidden association patterns in multivariate plays an important role. Plenty of measures for multivariate association have been proposed, yet it is still an open research challenge for effectively capturing association patterns among three or more variables, especially the scenario without any prior knowledge about those relationships. To do so, we desire a distribution-free, association type-independent and non-parametrical measure. For practical applications, such a measure should comparable, interpretable, scalable, intuitive, reliability, and robust. However, no exiting measures fulfill all of these desiderata. In this paper, taking advantage of the neighborhood information of a sample, we propose MNA, a maximal neighborhood multivariate association measure that satisfies all the above criteria. Extensive experiments on synthetic and real data show it outperforms state-of-the-art multivariate association measures.
引用
收藏
页码:6126 / 6135
页数:10
相关论文
共 50 条
  • [21] Information-based trade
    Bond, Philip
    Eraslan, Huelya
    JOURNAL OF ECONOMIC THEORY, 2010, 145 (05) : 1675 - 1703
  • [22] Identification of Multiview Gene Modules Using Mutual Information-Based Hypograph Mining
    Bhadra, Tapas
    Mallik, Saurav
    Bandyopadhyay, Sanghamitra
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2019, 49 (06): : 1119 - 1130
  • [23] A mutual-information-based mining method for marine abnormal association rules
    Xue Cunjin
    Song Wanjiao
    Qin Lijuan
    Dong Qing
    Wen Xiaoyang
    COMPUTERS & GEOSCIENCES, 2015, 76 : 121 - 129
  • [24] INFORMATION STRUCTURE AS INFORMATION-BASED PARTITION
    Tomioka, Satoshi
    ACTA LINGUISTICA HUNGARICA, 2008, 55 (3-4) : 309 - 317
  • [25] Joint mutual information-based input variable selection for multivariate time series modeling
    Han, Min
    Ren, Weijie
    Liu, Xiaoxin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 37 : 250 - 257
  • [26] A Fuzzy Mutual Information-based Feature Selection Method for Classification
    Hogue, N.
    Ahmed, H. A.
    Bhattacharyya, D. K.
    Kalita, J. K.
    FUZZY INFORMATION AND ENGINEERING, 2016, 8 (03) : 355 - 384
  • [27] Mutual information-based method for selecting informative feature sets
    Herman, Gunawan
    Zhang, Bang
    Wang, Yang
    Ye, Getian
    Chen, Fang
    PATTERN RECOGNITION, 2013, 46 (12) : 3315 - 3327
  • [28] A Novel Neighboring Information-Based Method for Motion Object Detection
    Wang, Bingshu
    Hu, Xuefeng
    Zhu, Wenqian
    Zhao, Yong
    PROCEEDINGS OF THE MEDITERRANEAN CONFERENCE ON INFORMATION & COMMUNICATION TECHNOLOGIES 2015 (MEDCT 2015), VOL 2, 2016, 381 : 669 - 674
  • [29] Information-Based Dichotomization: A Method for Multiclass Support Vector Machines
    Songsiri, Patoomsiri
    Kijsirikul, Boonserm
    Phetkaew, Thimaporn
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 3284 - +
  • [30] Gradient intensity: A new mutual information-based registration method
    Shams, Ramtin
    Sadeghi, Parastoo
    Kennedy, Rodney A.
    2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 3249 - +