Scale-space multi-view bag of words for scene categorization

被引：24

作者：

Giveki, Davar ^{[1
]}

机构：

[1] Malayer Univ, Dept Comp Engn, POB 65719-95863, Malayer, Iran

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2021年 / 80卷 / 01期

关键词：

Scene categorization; Bag of words; Scale-space features; Feature fusion; TF-IDF weighting; OF-VISUAL-WORDS; NEURAL-NETWORK; SPARSE REPRESENTATION; IMAGE CLASSIFICATION; FEATURES; MODEL; RETRIEVAL;

D O I：

10.1007/s11042-020-09759-9

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

As a widely-used method in the image categorization tasks, the Bag-of-Words (BoW) method still suffers from many limitations such as overlooking spatial information. In this paper, we propose four improvements to the BoW method to consider spatial and semantic information as well as information from multiple views. In particular, our contributions are: (a) encoding spatial information based on a combination of wavelet transform image scaling and a new image partitioning scheme, (b) proposing a spatial-information- and content-aware visual word dictionary generation approach, (c) developing a content-aware feature weighting approach to considers the significance of the features for different semantics, (d) proposing a novel weighting strategy to fuse color information when discriminative shape features are lacking. We call our method Scale-Space Multi-View Bag of Words (SSMV-BoW). We conducted extensive experiments to evaluate our SSMV-BoW and compare it to the state-of-the-art scene categorization methods. For our experiments, we use four publicly available and widely used scene categorization benchmark datasets. Results demonstrate that our SSMV-BoW outperforms the methods using both hand-crafted and deep learning features. In addition, ablation studies show that all four improvements contribute to the performance of our SSMV-BoW.

引用

页码：1223 / 1245

页数：23

共 50 条

[1] Scale-space multi-view bag of words for scene categorization
Davar Giveki
Multimedia Tools and Applications, 2021, 80 : 1223 - 1245
[2] Perceptually learning multi-view sparse representation for scene categorization
Yin, Weibin
Xu, Dongsheng
Wang, Zheng
Zhao, Zhijun
Chen, Chao
Yao, Yiyang
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 60 : 59 - 63
[3] Improving bag-of-words scheme for scene categorization
Li, Qun
Zhang, Hong-Gang
Guo, Jun
Bhanu, Bir
An, Le
Li, Q. (liqun@bupt.edu.cn), 1600, Beijing University of Posts and Telecommunications (19): : 166 - 171
[4] Hierarchical Bag-of-Words Model for Joint Multi-View Object Representation and Classification
Fu, Xiang
Purushotham, Sanjay
Xu, Daru
Li, Jian
Kuo, C. -C. Jay
2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
[5] Beyond Bag-of-Words: combining generative and discriminative models for scene categorization
Li, Zhen
Yap, Kim-Hui
MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 71 (03) : 1033 - 1050
[6] Beyond Bag-of-Words: combining generative and discriminative models for scene categorization
Zhen Li
Kim-Hui Yap
Multimedia Tools and Applications, 2014, 71 : 1033 - 1050
[7] Multi-view Remote Sensing Image Scene Classification by Fusing Multi-scale Attention
Shi Y.
Zhou W.
Shao Z.
Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2024, 49 (03): : 366 - 375
[8] Deep scene-scale material estimation from multi-view indoor captures
Prakash, Siddhant
Rainer, Gilles
Bousseau, Adrien
Drettakis, George
COMPUTERS & GRAPHICS-UK, 2022, 109 : 15 - 29
[9] Multi-view representation learning in multi-task scene
Run-kun Lu
Jian-wei Liu
Si-ming Lian
Xin Zuo
Neural Computing and Applications, 2020, 32 : 10403 - 10422
[10] Multi-view representation learning in multi-task scene
Lu, Run-kun
Liu, Jian-wei
Lian, Si-ming
Zuo, Xin
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (14): : 10403 - 10422

← 1 2 3 4 5 →