Unsupervised Classification under Uncertainty: The Distance-Based Algorithm

被引:1
|
作者
Ghanaiem, Alaa [1 ]
Kagan, Evgeny [2 ]
Kumar, Parteek [3 ]
Raviv, Tal [1 ]
Glynn, Peter [4 ]
Ben-Gal, Irad [1 ]
机构
[1] Tel Aviv Univ, Dept Ind Engn, IL-69978 Ramat Aviv, Israel
[2] Ariel Univ, Fac Engn, Dept Ind Engn & Management, IL-40700 Ariel, Israel
[3] Thapar Inst Engn & Technol, Dept Comp Sci & Engn, Patiala 147004, India
[4] Stanford Univ, Inst Computat & Math Engn, Dept Management Sci & Engn, Stanford, CA 94305 USA
关键词
classification; uncertainty; collective choice; likelihood;
D O I
10.3390/math11234784
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
This paper presents a method for unsupervised classification of entities by a group of agents with unknown domains and levels of expertise. In contrast to the existing methods based on majority voting ("wisdom of the crowd") and their extensions by expectation-maximization procedures, the suggested method first determines the levels of the agents' expertise and then weights their opinions by their expertise level. In particular, we assume that agents will have relatively closer classifications in their field of expertise. Therefore, the expert agents are recognized by using a weighted Hamming distance between their classifications, and then the final classification of the group is determined from the agents' classifications by expectation-maximization techniques, with preference to the recognized experts. The algorithm was verified and tested on simulated and real-world datasets and benchmarked against known existing algorithms. We show that such a method reduces incorrect classifications and effectively solves the problem of unsupervised collaborative classification under uncertainty, while outperforming other known methods.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Approximate is Enough: Distance-Based Validation for Geospatial Classification
    Li, Yangping
    Hu, Tianming
    CURRENT APPROACHES IN APPLIED ARTIFICIAL INTELLIGENCE, 2015, 9101 : 447 - 456
  • [42] Distance-based margin support vector machine for classification
    Chen, Yan-Cheng
    Su, Chao-Ton
    APPLIED MATHEMATICS AND COMPUTATION, 2016, 283 : 141 - 152
  • [43] Distance-based non-deterministic semantics for reasoning with uncertainty
    Arieli, Ofer
    Zamansky, Anna
    LOGIC JOURNAL OF THE IGPL, 2009, 17 (04) : 325 - 350
  • [44] Grid distance-based improving accuracy clustering algorithm
    Pang-Chunjiang
    Cheng-Weixiang
    Niu-Weihua
    2008 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, PROCEEDINGS, 2008, : 877 - 880
  • [45] A Novel Scattering Distance-Based Mobile Positioning Algorithm
    Zhaounia, Mohamed
    Landolsi, Mohamed Adnan
    Bouallegue, Ridha
    2009 GLOBAL INFORMATION INFRASTRUCTURE SYMPOSIUM (GIIS 2009), 2009, : 306 - +
  • [46] A distance-based parameter free algorithm for curve reconstruction
    Zeng, Yong
    Nguyen, Thanh An
    Yan, Baiquan
    Li, Shuren
    COMPUTER-AIDED DESIGN, 2008, 40 (02) : 210 - 222
  • [47] AnomalyDetect: An Online Distance-Based Anomaly Detection Algorithm
    Huo, Wunjun
    Wang, Wei
    Li, Wen
    WEB SERVICES - ICWS 2019, 2019, 11512 : 63 - 79
  • [48] A distance-based algorithm for clustering database user sessions
    Yao, QS
    An, AJ
    Huang, XJ
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2005, 3488 : 562 - 572
  • [49] Software dependability analysis under neutrosophic environment using optimized Elman recurrent neural network-based classification algorithm and Mahalanobis distance-based ranking algorithm
    Chatterjee, Subhashis
    Saha, Deepjyoti
    ANNALS OF OPERATIONS RESEARCH, 2024, 340 (01) : 83 - 115
  • [50] A comparison of distance-based and classification-based analyses of habitat use
    Conner, LM
    Smith, MD
    Burger, LW
    ECOLOGY, 2003, 84 (02) : 526 - 531