Discriminative cluster refinement: Improving object category recognition given limited training data

被引:0
|
作者
Yang, Liu [1 ]
Jin, Rong [1 ]
Pantofaru, Caroline [2 ]
Sukthankar, Rahul [2 ,3 ]
机构
[1] Michigan State Univ, Dept Comp Sci & Engn, E Lansing, MI 48824 USA
[2] Carnegie Mellon Univ, Inst Robot, Pittsburgh, PA 15213 USA
[3] Intel Res, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
A popular approach to problems in image classification is to represent the image as a bag of visual words and then employ a classifier to categorize the image. Unfortunately, a significant shortcoming of this approach is that the clustering and classification are disconnected. Since the clustering into visual words is unsupervised, the representation does not necessarily capture the aspects of the data that are most useful for classification. More seriously, the semantic relationship between clusters is lost, causing the overall classification performance to suffer. We introduce "discriminative cluster refinement" (DCR), a method that explicitly models the pairwise relationships between different visual words by exploiting their co-occurrence information. The assigned class labels are used to identify the co-occurrence patterns that are most informative for object classification. DCR employs a maximum-margin approach to generate an optimal kernel matrix for classification. One important benefit of DCR is that it integrates smoothly into existing bag-of-words information retrieval systems by employing the set of visual words generated by any clustering method. While DCR could improve a broad class of information retrieval systems, this paper focuses on object category recognition. We present a direct comparison with a state-of-the art method on the PASCAL 2006 database and show that cluster refinement results in a significant improvement in classification accuracy given a small number of training examples.
引用
收藏
页码:2303 / +
页数:2
相关论文
共 50 条
  • [11] Training speaker recognition systems with limited data
    Vaessen, Nik
    van Leeuwen, David A.
    INTERSPEECH 2022, 2022, : 4760 - 4764
  • [12] Incremental Learning of Object Detector with Limited Training Data
    Hafeez, Muhammad Abdullah
    Ul-Hasan, Adnan
    Shafait, Faisal
    2021 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA 2021), 2021, : 487 - 494
  • [13] Using Convolution and Sequence-discriminative Training to Improving Children Speech Recognition
    Meng, Fanchang
    Peng, Shouye
    Zhang, Guohui
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 644 - 649
  • [14] SPEAKER RECOGNITION IN NOISY CONDITIONS WITH LIMITED TRAINING DATA
    McLaughlin, Niall
    Ming, Ji
    Crookes, Danny
    19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 1294 - 1298
  • [15] Dual Pose-invariant Embeddings: Learning Category and Object-specific Discriminative Representations for Recognition and Retrieval
    Sarkar, Rohan
    Kak, Avinash
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 17077 - 17085
  • [16] Improving Discriminative Training for Robust Acoustic Models in Large Vocabulary Continuous Speech Recognition
    Pylkkonen, Janne
    Kurimo, Mikko
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1210 - 1213
  • [17] IMPROVING SPEECH RECOGNITION USING LIMITED ACCENT DIVERSE BRITISH ENGLISH TRAINING DATA WITH DEEP NEURAL NETWORKS
    Najafian, Maryam
    Safavi, Saeid
    Hansen, John H. L.
    Russell, Martin
    2016 IEEE 26TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2016,
  • [18] Training Convolutional Neural Networks with Limited Training Data for Ear Recognition in the Wild
    Emersic, Ziga
    Stepec, Dejan
    Struc, Vitomir
    Peer, Peter
    2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 987 - 994
  • [19] Data-Driven 3D Voxel Patterns for Object Category Recognition
    Xiang, Yu
    Choi, Wongun
    Lin, Yuanqing
    Savarese, Silvio
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 1903 - 1911
  • [20] ENSEMBLE CLASSIFIER FOR JOINT OBJECT INSTANCE AND CATEGORY RECOGNITION ON RGB-D DATA
    Seib, Viktor
    Memmesheimer, Raphael
    Paulus, Dietrich
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 143 - 147