Scale-space multi-view bag of words for scene categorization

被引:24
|
作者
Giveki, Davar [1 ]
机构
[1] Malayer Univ, Dept Comp Engn, POB 65719-95863, Malayer, Iran
关键词
Scene categorization; Bag of words; Scale-space features; Feature fusion; TF-IDF weighting; OF-VISUAL-WORDS; NEURAL-NETWORK; SPARSE REPRESENTATION; IMAGE CLASSIFICATION; FEATURES; MODEL; RETRIEVAL;
D O I
10.1007/s11042-020-09759-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As a widely-used method in the image categorization tasks, the Bag-of-Words (BoW) method still suffers from many limitations such as overlooking spatial information. In this paper, we propose four improvements to the BoW method to consider spatial and semantic information as well as information from multiple views. In particular, our contributions are: (a) encoding spatial information based on a combination of wavelet transform image scaling and a new image partitioning scheme, (b) proposing a spatial-information- and content-aware visual word dictionary generation approach, (c) developing a content-aware feature weighting approach to considers the significance of the features for different semantics, (d) proposing a novel weighting strategy to fuse color information when discriminative shape features are lacking. We call our method Scale-Space Multi-View Bag of Words (SSMV-BoW). We conducted extensive experiments to evaluate our SSMV-BoW and compare it to the state-of-the-art scene categorization methods. For our experiments, we use four publicly available and widely used scene categorization benchmark datasets. Results demonstrate that our SSMV-BoW outperforms the methods using both hand-crafted and deep learning features. In addition, ablation studies show that all four improvements contribute to the performance of our SSMV-BoW.
引用
收藏
页码:1223 / 1245
页数:23
相关论文
共 50 条
  • [1] Scale-space multi-view bag of words for scene categorization
    Davar Giveki
    Multimedia Tools and Applications, 2021, 80 : 1223 - 1245
  • [2] Perceptually learning multi-view sparse representation for scene categorization
    Yin, Weibin
    Xu, Dongsheng
    Wang, Zheng
    Zhao, Zhijun
    Chen, Chao
    Yao, Yiyang
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 60 : 59 - 63
  • [3] Improving bag-of-words scheme for scene categorization
    Li, Qun
    Zhang, Hong-Gang
    Guo, Jun
    Bhanu, Bir
    An, Le
    Li, Q. (liqun@bupt.edu.cn), 1600, Beijing University of Posts and Telecommunications (19): : 166 - 171
  • [4] Hierarchical Bag-of-Words Model for Joint Multi-View Object Representation and Classification
    Fu, Xiang
    Purushotham, Sanjay
    Xu, Daru
    Li, Jian
    Kuo, C. -C. Jay
    2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [5] Beyond Bag-of-Words: combining generative and discriminative models for scene categorization
    Li, Zhen
    Yap, Kim-Hui
    MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 71 (03) : 1033 - 1050
  • [6] Beyond Bag-of-Words: combining generative and discriminative models for scene categorization
    Zhen Li
    Kim-Hui Yap
    Multimedia Tools and Applications, 2014, 71 : 1033 - 1050
  • [7] Multi-view Remote Sensing Image Scene Classification by Fusing Multi-scale Attention
    Shi Y.
    Zhou W.
    Shao Z.
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2024, 49 (03): : 366 - 375
  • [8] Deep scene-scale material estimation from multi-view indoor captures
    Prakash, Siddhant
    Rainer, Gilles
    Bousseau, Adrien
    Drettakis, George
    COMPUTERS & GRAPHICS-UK, 2022, 109 : 15 - 29
  • [9] Multi-view representation learning in multi-task scene
    Run-kun Lu
    Jian-wei Liu
    Si-ming Lian
    Xin Zuo
    Neural Computing and Applications, 2020, 32 : 10403 - 10422
  • [10] Multi-view representation learning in multi-task scene
    Lu, Run-kun
    Liu, Jian-wei
    Lian, Si-ming
    Zuo, Xin
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (14): : 10403 - 10422