WEIGHTED BAG OF VISUAL WORDS FOR OBJECT RECOGNITION

被引：0

作者：

San Biagio, Marco ^{[1
]}

Bazzani, Loris ^{[1
,2
]}

Cristani, Marco ^{[1
,2
]}

Murino, Vittorio ^{[1
,2
]}

机构：

[1] Ist Italiano Tecnol, Pattern Anal & Comp Vis, Via Morego 30, I-16163 Genoa, Italy

[2] Univ Verona, Dept Informat, I-37134 Verona, Italy

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2014年

关键词：

object recognition; dictionary learning; visual saliency; feature weighting;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Bag of Visual words (BoV) is one of the most successful strategy for object recognition, used to represent an image as a vector of counts using a learned vocabulary. This strategy assumes that the representation is built using patches that are either densely extracted or sampled from the images using feature detectors. However, the dense strategy captures also the noisy background information, whereas the feature detection strategy can lose important parts of the objects. In this paper we propose a solution in-between these two strategies, by densely extracting patches from the image, and weighting them accordingly to their salience. Intuitively, highly salient patches have an important role in describing an object, while those with low saliency are still taken with low emphasis, instead of discarding them. We embed this idea in the word encoding mechanism adopted in the BoV approaches. The technique is successfully applied to vector quantization and Fisher vector, on Caltech-101 and Caltech-256.

引用

页码：2734 / 2738

页数：5

共 50 条

[21] Fusing Color and Shape for Bag-of-Words Based Object Recognition
van de Weijer, Joost
Khan, Fahad Shahbaz
COMPUTATIONAL COLOR IMAGING, CCIW 2013, 2013, 7786 : 25 - 34
[22] BAG OF GROUPS OF CONVOLUTIONAL FEATURES MODEL FOR VISUAL OBJECT RECOGNITION
Singh, Jaspreet
Singh, Chandan
2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,
[23] An Effective Bag-of-visual-word Scheme for Object Recognition
Zhang, Chunxiao
Wen, Gaojin
Lin, Zhaorong
Yao, Na
Shang, Zhiming
Zhong, Can
2016 9TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2016), 2016, : 417 - 421
[24] 3-dimensional bag of visual words framework on action recognition
Wang S.
Yang Y.
Wei R.
Wu Q.J.
Yang, Yimin (yyang48@lakeheadu.ca), 2020, Tech Science Press (63): : 1081 - 1091
[25] Bag-of-visual-words model for artificial pornographic images recognition
Li Fang-fang
Luo Si-wei
Liu Xi-yao
Zou Bei-ji
JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2016, 23 (06) : 1383 - 1389
[26] Bag-of-visual-words model for artificial pornographic images recognition
Fang-fang Li
Si-wei Luo
Xi-yao Liu
Bei-ji Zou
Journal of Central South University, 2016, 23 : 1383 - 1389
[27] A new bag of visual words encoding method for human action recognition
Cortes, Xavier
Conte, Donatello
Cardot, Hubert
2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2480 - 2485
[28] Exploiting Visual Saliency and Bag-of-Words for Road Sign Recognition
Xu, Dan
Xu, Wei
Tang, Zhenmin
Liu, Fan
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (09): : 2473 - 2482
[29] Bag-of-visual-words model for artificial pornographic images recognition
李芳芳
罗四伟
刘熙尧
邹北骥
JournalofCentralSouthUniversity, 2016, 23 (06) : 1383 - 1389
[30] 3-Dimensional Bag of Visual Words Framework on Action Recognition
Wang, Shiqi
Yang, Yimin
Wei, Ruizhong
Wu, Qingming Jonathan
CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 63 (03): : 1081 - 1091

← 1 2 3 4 5 →