Discriminative cluster refinement: Improving object category recognition given limited training data

被引:0
|
作者
Yang, Liu [1 ]
Jin, Rong [1 ]
Pantofaru, Caroline [2 ]
Sukthankar, Rahul [2 ,3 ]
机构
[1] Michigan State Univ, Dept Comp Sci & Engn, E Lansing, MI 48824 USA
[2] Carnegie Mellon Univ, Inst Robot, Pittsburgh, PA 15213 USA
[3] Intel Res, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
A popular approach to problems in image classification is to represent the image as a bag of visual words and then employ a classifier to categorize the image. Unfortunately, a significant shortcoming of this approach is that the clustering and classification are disconnected. Since the clustering into visual words is unsupervised, the representation does not necessarily capture the aspects of the data that are most useful for classification. More seriously, the semantic relationship between clusters is lost, causing the overall classification performance to suffer. We introduce "discriminative cluster refinement" (DCR), a method that explicitly models the pairwise relationships between different visual words by exploiting their co-occurrence information. The assigned class labels are used to identify the co-occurrence patterns that are most informative for object classification. DCR employs a maximum-margin approach to generate an optimal kernel matrix for classification. One important benefit of DCR is that it integrates smoothly into existing bag-of-words information retrieval systems by employing the set of visual words generated by any clustering method. While DCR could improve a broad class of information retrieval systems, this paper focuses on object category recognition. We present a direct comparison with a state-of-the art method on the PASCAL 2006 database and show that cluster refinement results in a significant improvement in classification accuracy given a small number of training examples.
引用
收藏
页码:2303 / +
页数:2
相关论文
共 50 条
  • [21] Learnable GAN Regularization for Improving Training Stability in Limited Data Paradigm
    Singh, Nakul (nakul692k@gmail.com), 1600, Springer Science and Business Media Deutschland GmbH (2010 CCIS):
  • [22] Learnable GAN Regularization for Improving Training Stability in Limited Data Paradigm
    Singh, Nakul
    Sandhan, Tushar
    COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT II, 2024, 2010 : 542 - 554
  • [23] Improving Teacher Training Through Emotion Recognition and Data Fusion
    Albaladejo-Gonzalez, Mariano
    Gaspar-Marco, Ruben
    Marmol, Felix Gomez
    Reich, Justin
    Ruiperez-Valiente, Jose A.
    EXPERT SYSTEMS, 2025, 42 (02)
  • [24] Improved Automatic Face Segmentation and Recognition for Applications with Limited Training Data
    Brown, Dane
    Bradshaw, Karen
    BEYOND DATABASES, ARCHITECTURES AND STRUCTURES: TOWARDS EFFICIENT SOLUTIONS FOR DATA ANALYSIS AND KNOWLEDGE REPRESENTATION, 2017, 716 : 415 - 426
  • [25] Radar Target Recognition Algorithm Based on Data Augmentation and WACGAN with a Limited Training Data
    Zhuke-Fan
    Wang J.-G.
    Liu Y.-J.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (06): : 1124 - 1131
  • [26] LANGUAGE RECOGNITION USING DEEP NEURAL NETWORKS WITH VERY LIMITED TRAINING DATA
    Ranjan, Shivesh
    Yu, Chengzhu
    Zhang, Chunlei
    Kelly, Finnian
    Hansen, John H. L.
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5830 - 5834
  • [27] AUTOMATIC RECOGNITION OF WIDEBAND TELEPHONE SPEECH WITH LIMITED AMOUNT OF MATCHED TRAINING DATA
    Bauer, Patrick
    Abel, Johannes
    Fischer, Volker
    Fingscheidt, Tim
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 1232 - 1236
  • [28] A Robust Expression Negation Algorithm for Accurate Face Recognition for Limited Training Data
    Tharshini, G.
    Dinesh, H. G. C. P.
    Godaliyadda, G. M. R. I.
    Ekanayake, M. P. B.
    2015 IEEE 10TH INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2015, : 384 - 389
  • [29] Multi-stream CNN for facial expression recognition in limited training data
    Aghamaleki, Javad Abbasi
    Chenarlogh, Vahid Ashkani
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (16) : 22861 - 22882
  • [30] Radar HRRP Target Recognition Based on Dynamic Learning with Limited Training Data
    Wang, Jingjing
    Liu, Zheng
    Xie, Rong
    Ran, Lei
    REMOTE SENSING, 2021, 13 (04) : 1 - 24