WEIGHTED BAG OF VISUAL WORDS FOR OBJECT RECOGNITION

被引:0
|
作者
San Biagio, Marco [1 ]
Bazzani, Loris [1 ,2 ]
Cristani, Marco [1 ,2 ]
Murino, Vittorio [1 ,2 ]
机构
[1] Ist Italiano Tecnol, Pattern Anal & Comp Vis, Via Morego 30, I-16163 Genoa, Italy
[2] Univ Verona, Dept Informat, I-37134 Verona, Italy
关键词
object recognition; dictionary learning; visual saliency; feature weighting;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Bag of Visual words (BoV) is one of the most successful strategy for object recognition, used to represent an image as a vector of counts using a learned vocabulary. This strategy assumes that the representation is built using patches that are either densely extracted or sampled from the images using feature detectors. However, the dense strategy captures also the noisy background information, whereas the feature detection strategy can lose important parts of the objects. In this paper we propose a solution in-between these two strategies, by densely extracting patches from the image, and weighting them accordingly to their salience. Intuitively, highly salient patches have an important role in describing an object, while those with low saliency are still taken with low emphasis, instead of discarding them. We embed this idea in the word encoding mechanism adopted in the BoV approaches. The technique is successfully applied to vector quantization and Fisher vector, on Caltech-101 and Caltech-256.
引用
收藏
页码:2734 / 2738
页数:5
相关论文
共 50 条
  • [41] EXPANDED BAG OF WORDS REPRESENTATION FOR OBJECT CLASSIFICATION
    Liu, Tinglin
    Liu, Jing
    Liu, Qinshan
    Lu, Hanqing
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 297 - 300
  • [42] Class Representative Visual Words for Category-Level Object Recognition
    Lopez Sastre, Roberto Javier
    Tuytelaars, Tinne
    Maldonado Bascon, Saturnino
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PROCEEDINGS, 2009, 5524 : 184 - +
  • [43] Bag of visual words and fusion methods for action recognition: Comprehensive study and good practice
    Peng, Xiaojiang
    Wang, Limin
    Wang, Xingxing
    Qiao, Yu
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2016, 150 : 109 - 125
  • [44] Region Matching Techniques for Spatial Bag of Visual Words Based Image Category Recognition
    Viitaniemi, Ville
    Laaksonen, Jorma
    ARTIFICIAL NEURAL NETWORKS-ICANN 2010, PT I, 2010, 6352 : 531 - 540
  • [45] Recognition of Indian Sign Language Using ORB with Bag of Visual Words by Kinect Sensor
    Gangrade, Jayesh
    Bharti, Jyoti
    Mulye, Anchit
    IETE JOURNAL OF RESEARCH, 2022, 68 (04) : 2953 - 2967
  • [46] Human Action Recognition using Spatial-Temporal Analysis and Bag of Visual Words
    Naidoo, Denver
    Tapamo, Jules-Raymond
    Walingo, Tom
    2018 14TH INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGY & INTERNET BASED SYSTEMS (SITIS), 2018, : 697 - 702
  • [47] Traffic Sign Recognition Based on Prevailing Bag of Visual Words Representation on Feature Descriptors
    Virupakshappa, Kushal
    Han, Yan
    Oruklu, Erdal
    2015 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT), 2015, : 489 - 493
  • [48] An object detection and classification method for underwater visual images based on the bag-of-words model
    Zhang, Tianchi
    Li, Qian
    Liu, Xing
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART M-JOURNAL OF ENGINEERING FOR THE MARITIME ENVIRONMENT, 2023, 237 (02) : 487 - 497
  • [49] Bag of Visual Words Method based on PLSA and Chi-Square Model for Object Category
    Zhao, Yongwei
    Peng, Tianqiang
    Li, Bicheng
    Ke, Shengcai
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2015, 9 (07): : 2633 - 2648
  • [50] Using Bag of Visual Words and Spatial Pyramid Matching for Object Classification along with Applications for RIS
    Vyas, Kushal
    Vora, Yash
    Vastani, Raj
    TWELFTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2016 / TWELFTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2016 / TWELFTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2016, 2016, 89 : 457 - 464