Neighborhood Information-Based Method for Multivariate Association Mining

被引:2
|
作者
Cheng, Honghong [1 ,2 ]
Qian, Yuhua [3 ]
Guo, Yingjie [3 ]
Zheng, Keyin [3 ]
Zhang, Qingfu [4 ,5 ]
机构
[1] Shanxi Univ Finance & Econ, Sch Informat, Taiyuan 030012, Shanxi, Peoples R China
[2] Shanxi Univ, Inst Big Data Sci & Ind, Taiyuan 030006, Shanxi, Peoples R China
[3] Shanxi Univ, Inst Big Data Sci & Ind, Sch Comp & Informat Technol, Key Lab Comp Intelligence & China Informat Proc,Mi, Taiyuan 030006, Shanxi, Peoples R China
[4] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[5] City Univ Hong Kong, Shenzhen Res Inst, Shenzhen 518057, Peoples R China
基金
中国国家自然科学基金;
关键词
Entropy; Spirals; Noise measurement; Mutual information; Knowledge engineering; Data mining; Data engineering; Association mining; multivariate association measure; distribution-free; nonparametric; neighborhood information; ATTRIBUTE REDUCTION;
D O I
10.1109/TKDE.2022.3178090
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most current data is multivariable, exploring and identifying valuable information in these datasets has far-reaching impacts. In particular, discovering meaningful hidden association patterns in multivariate plays an important role. Plenty of measures for multivariate association have been proposed, yet it is still an open research challenge for effectively capturing association patterns among three or more variables, especially the scenario without any prior knowledge about those relationships. To do so, we desire a distribution-free, association type-independent and non-parametrical measure. For practical applications, such a measure should comparable, interpretable, scalable, intuitive, reliability, and robust. However, no exiting measures fulfill all of these desiderata. In this paper, taking advantage of the neighborhood information of a sample, we propose MNA, a maximal neighborhood multivariate association measure that satisfies all the above criteria. Extensive experiments on synthetic and real data show it outperforms state-of-the-art multivariate association measures.
引用
收藏
页码:6126 / 6135
页数:10
相关论文
共 50 条
  • [31] User biometric information-based secure method for smart devices
    Su, Xin
    Wang, Bingying
    Zhang, Xuewu
    Wang, Yupeng
    Choi, Dongmin
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (03):
  • [32] A local information-based kernelized OSP method for target detection
    Wang, T. (wangtingwhu@126.com), 1600, Editorial Board of Medical Journal of Wuhan University (38):
  • [33] Shape Adaptive Neighborhood Information-Based Semi-Supervised Learning for Hyperspectral Image Classification
    Hu, Yina
    An, Ru
    Wang, Benlin
    Xing, Fei
    Ju, Feng
    REMOTE SENSING, 2020, 12 (18)
  • [34] AFIFC: Adaptive fuzzy neighborhood mutual information-based feature selection via label correlation
    Sun, Lin
    Xu, Feng
    Ding, Weiping
    Xu, Jiucheng
    PATTERN RECOGNITION, 2025, 164
  • [35] A neighborhood information-based adaptive differential evolution for solving complex nonlinear equation system model
    Liao, Zuowen
    Zhu, Fangyang
    Mi, Xianyan
    Sun, Yu
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 216
  • [36] Information-based library approaches
    Ellman, JA
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2004, 227 : U168 - U169
  • [37] TOWARDS INFORMATION-BASED ECONOMIES
    CRONIN, B
    JOURNAL OF INFORMATION SCIENCE, 1986, 12 (03) : 129 - 137
  • [38] Creativity as an information-based process
    De Pisapia, Nicola
    Rastelli, Clara
    RIVISTA INTERNAZIONALE DI FILOSOFIA E PSICOLOGIA, 2022, 13 (01) : 1 - 18
  • [39] PERSPECTIVES ON INFORMATION-BASED COMPLEXITY
    TRAUB, JF
    WOZNIAKOWSKI, H
    BULLETIN OF THE AMERICAN MATHEMATICAL SOCIETY, 1992, 26 (01) : 29 - 52
  • [40] Framework for Information-based medicine
    Augen, J
    GENETIC ENGINEERING NEWS, 2003, 23 (07): : 50 - +