WEIGHTED BAG OF VISUAL WORDS FOR OBJECT RECOGNITION

被引:0
|
作者
San Biagio, Marco [1 ]
Bazzani, Loris [1 ,2 ]
Cristani, Marco [1 ,2 ]
Murino, Vittorio [1 ,2 ]
机构
[1] Ist Italiano Tecnol, Pattern Anal & Comp Vis, Via Morego 30, I-16163 Genoa, Italy
[2] Univ Verona, Dept Informat, I-37134 Verona, Italy
关键词
object recognition; dictionary learning; visual saliency; feature weighting;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Bag of Visual words (BoV) is one of the most successful strategy for object recognition, used to represent an image as a vector of counts using a learned vocabulary. This strategy assumes that the representation is built using patches that are either densely extracted or sampled from the images using feature detectors. However, the dense strategy captures also the noisy background information, whereas the feature detection strategy can lose important parts of the objects. In this paper we propose a solution in-between these two strategies, by densely extracting patches from the image, and weighting them accordingly to their salience. Intuitively, highly salient patches have an important role in describing an object, while those with low saliency are still taken with low emphasis, instead of discarding them. We embed this idea in the word encoding mechanism adopted in the BoV approaches. The technique is successfully applied to vector quantization and Fisher vector, on Caltech-101 and Caltech-256.
引用
收藏
页码:2734 / 2738
页数:5
相关论文
共 50 条
  • [21] Fusing Color and Shape for Bag-of-Words Based Object Recognition
    van de Weijer, Joost
    Khan, Fahad Shahbaz
    COMPUTATIONAL COLOR IMAGING, CCIW 2013, 2013, 7786 : 25 - 34
  • [22] BAG OF GROUPS OF CONVOLUTIONAL FEATURES MODEL FOR VISUAL OBJECT RECOGNITION
    Singh, Jaspreet
    Singh, Chandan
    2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,
  • [23] An Effective Bag-of-visual-word Scheme for Object Recognition
    Zhang, Chunxiao
    Wen, Gaojin
    Lin, Zhaorong
    Yao, Na
    Shang, Zhiming
    Zhong, Can
    2016 9TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2016), 2016, : 417 - 421
  • [24] 3-dimensional bag of visual words framework on action recognition
    Wang S.
    Yang Y.
    Wei R.
    Wu Q.J.
    Yang, Yimin (yyang48@lakeheadu.ca), 2020, Tech Science Press (63): : 1081 - 1091
  • [25] Bag-of-visual-words model for artificial pornographic images recognition
    Li Fang-fang
    Luo Si-wei
    Liu Xi-yao
    Zou Bei-ji
    JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2016, 23 (06) : 1383 - 1389
  • [26] Bag-of-visual-words model for artificial pornographic images recognition
    Fang-fang Li
    Si-wei Luo
    Xi-yao Liu
    Bei-ji Zou
    Journal of Central South University, 2016, 23 : 1383 - 1389
  • [27] A new bag of visual words encoding method for human action recognition
    Cortes, Xavier
    Conte, Donatello
    Cardot, Hubert
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2480 - 2485
  • [28] Exploiting Visual Saliency and Bag-of-Words for Road Sign Recognition
    Xu, Dan
    Xu, Wei
    Tang, Zhenmin
    Liu, Fan
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (09): : 2473 - 2482
  • [29] Bag-of-visual-words model for artificial pornographic images recognition
    李芳芳
    罗四伟
    刘熙尧
    邹北骥
    JournalofCentralSouthUniversity, 2016, 23 (06) : 1383 - 1389
  • [30] 3-Dimensional Bag of Visual Words Framework on Action Recognition
    Wang, Shiqi
    Yang, Yimin
    Wei, Ruizhong
    Wu, Qingming Jonathan
    CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 63 (03): : 1081 - 1091