Asymptotic analysis of estimators on multi-label data

被引:0
|
作者
Streich, Andreas P. [1 ]
Buhmann, Joachim M. [2 ]
机构
[1] Phonak AG, Sci & Technol Grp, CH-8712 Stafa, Switzerland
[2] ETH, Dept Comp Sci, CH-8092 Zurich, Switzerland
关键词
Generative model; Asymptotic analysis; Multi-label classification; Consistency; DECONVOLUTION; CONSISTENCY;
D O I
10.1007/s10994-014-5457-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label classification extends the standard multi-class classification paradigm by dropping the assumption that classes have to be mutually exclusive, i.e., the same data item might belong to more than one class. Multi-label classification has many important applications in e.g. signal processing, medicine, biology and information security, but the analysis and understanding of the inference methods based on data with multiple labels are still underdeveloped. In this paper, we formulate a general generative process for multi-label data, i.e. we associate each label (or class) with a source. To generate multi-label data items, the emissions of all sources in the label set are combined. In the training phase, only the probability distributions of these (single label) sources need to be learned. Inference on multi-label data requires solving an inverse problem, models of the data generation process therefore require additional assumptions to guarantee well-posedness of the inference procedure. Similarly, in the prediction (test) phase, the distributions of all single-label sources in the label set are combined using the combination function to determine the probability of a label set. We formally describe several previously presented inference methods and introduce a novel, general-purpose approach, where the combination function is determined based on the data and/or on a priori knowledge of the data generation mechanism. This framework includes cross-training and new source training (also named label power set method) as special cases. We derive an asymptotic theory for estimators based on multi-label data and investigate the consistency and efficiency of estimators obtained by several state-of-the-art inference techniques. Several experiments confirm these findings and emphasize the importance of a sufficiently complex generative model for real-world applications.
引用
收藏
页码:373 / 409
页数:37
相关论文
共 50 条
  • [1] Asymptotic analysis of estimators on multi-label data
    Andreas P. Streich
    Joachim M. Buhmann
    Machine Learning, 2015, 99 : 373 - 409
  • [2] On the Stratification of Multi-label Data
    Sechidis, Konstantinos
    Tsoumakas, Grigorios
    Vlahavas, Ioannis
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT III, 2011, 6913 : 145 - 158
  • [3] LAIM discretization for multi-label data
    Cano, Alberto
    Maria Luna, Jose
    Gibaja, Eva L.
    Ventura, Sebastian
    INFORMATION SCIENCES, 2016, 330 : 370 - 384
  • [4] UnTangle Map: Visual Analysis of Probabilistic Multi-Label Data
    Cao, Nan
    Lin, Yu-Ru
    Gotz, David
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2016, 22 (02) : 1149 - 1163
  • [5] Assessing the Multi-labelness of Multi-label Data
    Park, Laurence A. F.
    Guo, Yi
    Read, Jesse
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II, 2020, 11907 : 164 - 179
  • [6] Prototype selection for multi-label data based on label correlation
    Haikun Li
    Min Fang
    Hang Li
    Peng Wang
    Neural Computing and Applications, 2024, 36 : 2121 - 2130
  • [7] Imbalance multi-label data learning with label specific features
    Rastogi, Reshma
    Mortaza, Sayed
    NEUROCOMPUTING, 2022, 513 : 395 - 408
  • [8] Imbalance multi-label data learning with label specific features
    Rastogi, Reshma
    Mortaza, Sayed
    Neurocomputing, 2022, 513 : 395 - 408
  • [9] An Analysis of Chaining in Multi-Label Classification
    Dembczynski, Krzysztof
    Waegeman, Willem
    Huellermeir, Eyke
    20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 : 294 - +
  • [10] Multi-label Linear Discriminant Analysis
    Wang, Hua
    Ding, Chris
    Huang, Heng
    COMPUTER VISION - ECCV 2010, PT VI, 2010, 6316 : 126 - 139