Perceptual multi-channel visual feature fusion for scene categorization

被引:14
|
作者
Sun, Xiao [1 ]
Liu, Zhenguang [2 ]
Hu, Yuxing [3 ]
Zhang, Luming [1 ]
Zimmermann, Roger [2 ]
机构
[1] Hefei Univ Technol, Sch Comp & Informat, Hefei, Anhui, Peoples R China
[2] Natl Univ Singapore, Sch Comp, Singapore, Singapore
[3] Tsinghua Univ, Sch Aerosp Engn, Beijing, Peoples R China
关键词
Image kernel; Feature fusion; Scene categoriztion; Perception; MACHINE; CLASSIFICATION; MODEL;
D O I
10.1016/j.ins.2017.10.051
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Effectively recognizing sceneries from a variety of categories is an indispensable but challenging technique in computer vision and intelligent systems. In this work, we propose a novel image kernel based on human gaze shifting, aiming at discovering the mechanism of humans perceiving visually/semantically salient regions within a scenery. More specifically, we first design a weakly supervised embedding algorithm which projects the local image features (i.e., graphlets in this work) onto the pre-defined semantic space. Thereby, we describe each graphlet by multiple visual features at both low-level and high-level. It is generally acknowledged that humans attend to only a few regions within a scenery. Thus we formulate a sparsity-constrained graphlet ranking algorithm which incorporates visual clues at both the low-level and the high-level. According to human visual perception, these top-ranked graphlets are either visually or semantically salient. We sequentially connect them into a path which mimics human gaze shifting. Lastly, a so-called gaze shifting kernel (GSK) is calculated based on the learned paths from a collection of scene images. And a kernel SVM is employed for calculating the scene categories. Comprehensive experiments on a series of well-known scene image sets shown the competitiveness and robustness of our GSK. We also demonstrated the high consistency of the predicted path with real human gaze shifting path. (C) 2017 Published by Elsevier Inc.
引用
收藏
页码:37 / 48
页数:12
相关论文
共 50 条
  • [41] Full-Reference Image Quality Assessment Based on Multi-Channel Visual Information Fusion
    Jiang, Benchi
    Bian, Shilei
    Shi, Chenyang
    Wu, Lulu
    APPLIED SCIENCES-BASEL, 2023, 13 (15):
  • [42] A Feature Integration Network for Multi-Channel Speech Enhancement
    Zeng, Xiao
    Zhang, Xue
    Wang, Mingjiang
    SENSORS, 2024, 24 (22)
  • [43] Multi-Channel Deep Feature Learning for Intrusion Detection
    Andresini, Giuseppina
    Appice, Annalisa
    Di Mauro, Nicola
    Loglisci, Corrado
    Malerba, Donato
    IEEE ACCESS, 2020, 8 : 53346 - 53359
  • [44] Multi-Channel Feature Dimension Adaption for Correlation Tracking
    Wu, Lingyue
    Xu, Tingfa
    Zhang, Yushan
    Wu, Fan
    Xu, Chang
    Li, Xiangmin
    Wang, Jihui
    IEEE ACCESS, 2021, 9 : 63814 - 63824
  • [45] Pedestrian Detection Based on DCT of Multi-channel Feature
    Liu, Chun-yang
    Wu, Ze-min
    Zhang, Zhao-feng
    Hu, Lei
    2017 IEEE 9TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2017, : 1182 - 1186
  • [46] Multi-Channel Feature Adaptation for Robust Speech Recognition
    Zhang, Zhaofeng
    Xiao, Xiong
    Wang, Longbiao
    Dang, Jianwu
    Iwahashi, Masahiro
    Chng, Eng Siong
    Li, Haizhou
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [47] Multi-scene image enhancement based on multi-channel illumination estimation
    Zhao, Runxing
    Wang, Zhiwen
    Guo, Wuyuan
    Zhang, Canlong
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 226
  • [48] CENTRIST: A Visual Descriptor for Scene Categorization
    Wu, Jianxin
    Rehg, James M.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (08) : 1489 - 1501
  • [49] Scene categorization at large visual eccentricities
    Boucart, Muriel
    Moroni, Christine
    Thibaut, Miguel
    Szaffarczyk, Sebastien
    Greene, Michelle
    VISION RESEARCH, 2013, 86 : 35 - 42
  • [50] Scene categorization at large visual eccentricities
    Boucart, M.
    Thibaut, M.
    Moroni, C.
    Greene, M.
    Szaffarczyk, S.
    PERCEPTION, 2012, 41 : 228 - 228