Deep feature voting: a semantic-driven and local context-aware approach for image classification

被引:0
|
作者
Xu, Ye [1 ]
Duan, Lihua [1 ]
Huang, Conggui [1 ]
Huang, Chongpeng [1 ]
机构
[1] Wuxi Inst Technol, Sch IoT Technol, 1600 Gaolang West Rd, Wuxi 214121, Jiangsu, Peoples R China
关键词
Image classification; Deep learning model; Deep feature; Voting; Decision tree; NEURAL-NETWORKS; FEATURE FUSION; CNN; MODELS;
D O I
10.1007/s11042-023-17881-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the context of addressing new image classification tasks with insufficient training samples via pre-trained deep learning models, the methods based on the Bag-of-Deep-Visual-Words (BoDVW) model have achieved higher classification accuracy across various image classification tasks compared to directly using the new classification layer of the pre-trained model for classification. These methods perform a sequence of operations on the input image - deep feature extraction, feature encoding, and feature pooling - to obtain an image representation vector, which is then fed into classifiers for classification. However, they ignore two crucial aspects: the high-level semantic characteristics of deep features and their local context within the feature space, which limits the image classification performance. To address this issue, we propose a new image classification method with a unique workflow. Specifically, our method identifies low-entropy local regions in the feature space by constructing multiple decision trees, using the set of labelled deep features built from training images. For a given image, the voting vector of each deep feature from the image is calculated based on the category label distributions of the low-entropy local regions where it is located. This vector reflects the degree of support that the feature provides for the hypothesis that it belongs to each category. The voting vectors of all features are aggregated according to image regions of different sizes and positions to obtain the representation vector of the image. The representation vectors of testing images are input into Support Vector Machines (SVMs) trained using those of training images to predict their categories. Experimental results on six public datasets show that our method achieves higher classification accuracy by 0.07% to 3.6% (averaging at 0.8%) compared to two BoDVW methods, and by 0.1% to 10.69% (averaging at 2.69%) compared to directly using the new classification layer of the pre-trained model for classification. These results demonstrate the effectiveness of considering the high-level semantic characteristics of deep features and their local context within the feature space for image classification. Importantly, the unique workflow of our method opens up new potential avenues for improving classification performance. These include increasing the number of local regions where deep features primarily originate from one or a few image categories, improving the accuracy of low-entropy local region identification, and developing an end-to-end deep learning model based on this workflow. While maintaining classification accuracy comparable to recent works, our method offers notable potential for the advancement of the image classification field.
引用
收藏
页码:58607 / 58643
页数:37
相关论文
共 50 条
  • [21] CONTEXT-AWARE CASCADE NETWORK FOR SEMANTIC LABELING IN VHR IMAGE
    Liu, Yongcheng
    Fan, Bin
    Wang, Lingfeng
    Bai, Jun
    Xiang, Shiming
    Pan, Chunhong
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 575 - 579
  • [22] CONTEXT-AWARE AND CONTRASTIVENESS-DRIVEN FEATURE LEARNING FOR CROSS-DOMAIN FEW-SHOT HYPERSPECTRAL IMAGE CLASSIFICATION
    Zhang, Suhua
    Zhong, Fangming
    Chen, Zhikui
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 6755 - 6759
  • [23] Context-aware deep kernel networks for image annotation
    Jiu, Mingyuan
    Sahbi, Hichem
    NEUROCOMPUTING, 2022, 474 : 154 - 167
  • [24] Multimodal, Context-Aware, Feature Representation Learning for Classification and Localization
    Das Bhattacharjee, Sreyasee
    Tolone, William J.
    Cheria, Roy
    Sarka, Urmimala
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 1034 - 1039
  • [25] Context-aware Feature Generation for Zero-shot Semantic Segmentation
    Gu, Zhangxuan
    Zhou, Siyuan
    Niu, Li
    Zhao, Zihan
    Zhang, Liqing
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1921 - 1929
  • [26] Context-aware semantic segmentation network for tunnel face feature identification
    Zhao, Liang
    Hao, Shuya
    Song, Zhanping
    AUTOMATION IN CONSTRUCTION, 2024, 165
  • [27] Context-Aware Local Binary Feature Learning for Face Recognition
    Duan, Yueqi
    Lu, Jiwen
    Feng, Jianjiang
    Zhou, Jie
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (05) : 1139 - 1153
  • [28] A Model-Driven Approach for Context-Aware Recommendation
    Haddad, Mohamed Ramzi
    Baazaoui, Hajer
    Ziou, Djemel
    Ben Ghezala, Henda
    2012 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2012, : 755 - 760
  • [29] Context-aware Dynamic Data-driven Pattern Classification
    Phoha, Shashi
    Virani, Nurali
    Chattopadhyay, Pritthi
    Sarkar, Soumalya
    Smith, Brian
    Ray, Asok
    2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2014, 29 : 1324 - 1333
  • [30] A Semantic-Based Approach for Context-Aware Service Discovery
    Grifoni, Patrizia
    D'Ulizia, Arianna
    Ferri, Fernando
    INTERNATIONAL JOURNAL OF INFORMATION SYSTEMS IN THE SERVICE SECTOR, 2014, 6 (04) : 1 - 26