Deep feature voting: a semantic-driven and local context-aware approach for image classification

被引：0

作者：

Xu, Ye ^{[1
]}

Duan, Lihua ^{[1
]}

Huang, Conggui ^{[1
]}

Huang, Chongpeng ^{[1
]}

机构：

[1] Wuxi Inst Technol, Sch IoT Technol, 1600 Gaolang West Rd, Wuxi 214121, Jiangsu, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2023年 / 83卷 / 20期

关键词：

Image classification; Deep learning model; Deep feature; Voting; Decision tree; NEURAL-NETWORKS; FEATURE FUSION; CNN; MODELS;

D O I：

10.1007/s11042-023-17881-7

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the context of addressing new image classification tasks with insufficient training samples via pre-trained deep learning models, the methods based on the Bag-of-Deep-Visual-Words (BoDVW) model have achieved higher classification accuracy across various image classification tasks compared to directly using the new classification layer of the pre-trained model for classification. These methods perform a sequence of operations on the input image - deep feature extraction, feature encoding, and feature pooling - to obtain an image representation vector, which is then fed into classifiers for classification. However, they ignore two crucial aspects: the high-level semantic characteristics of deep features and their local context within the feature space, which limits the image classification performance. To address this issue, we propose a new image classification method with a unique workflow. Specifically, our method identifies low-entropy local regions in the feature space by constructing multiple decision trees, using the set of labelled deep features built from training images. For a given image, the voting vector of each deep feature from the image is calculated based on the category label distributions of the low-entropy local regions where it is located. This vector reflects the degree of support that the feature provides for the hypothesis that it belongs to each category. The voting vectors of all features are aggregated according to image regions of different sizes and positions to obtain the representation vector of the image. The representation vectors of testing images are input into Support Vector Machines (SVMs) trained using those of training images to predict their categories. Experimental results on six public datasets show that our method achieves higher classification accuracy by 0.07% to 3.6% (averaging at 0.8%) compared to two BoDVW methods, and by 0.1% to 10.69% (averaging at 2.69%) compared to directly using the new classification layer of the pre-trained model for classification. These results demonstrate the effectiveness of considering the high-level semantic characteristics of deep features and their local context within the feature space for image classification. Importantly, the unique workflow of our method opens up new potential avenues for improving classification performance. These include increasing the number of local regions where deep features primarily originate from one or a few image categories, improving the accuracy of low-entropy local region identification, and developing an end-to-end deep learning model based on this workflow. While maintaining classification accuracy comparable to recent works, our method offers notable potential for the advancement of the image classification field.

引用

页码：58607 / 58643

页数：37

共 50 条

[21] CONTEXT-AWARE CASCADE NETWORK FOR SEMANTIC LABELING IN VHR IMAGE
Liu, Yongcheng
Fan, Bin
Wang, Lingfeng
Bai, Jun
Xiang, Shiming
Pan, Chunhong
2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 575 - 579
[22] CONTEXT-AWARE AND CONTRASTIVENESS-DRIVEN FEATURE LEARNING FOR CROSS-DOMAIN FEW-SHOT HYPERSPECTRAL IMAGE CLASSIFICATION
Zhang, Suhua
Zhong, Fangming
Chen, Zhikui
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 6755 - 6759
[23] Context-aware deep kernel networks for image annotation
Jiu, Mingyuan
Sahbi, Hichem
NEUROCOMPUTING, 2022, 474 : 154 - 167
[24] Multimodal, Context-Aware, Feature Representation Learning for Classification and Localization
Das Bhattacharjee, Sreyasee
Tolone, William J.
Cheria, Roy
Sarka, Urmimala
2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 1034 - 1039
[25] Context-aware Feature Generation for Zero-shot Semantic Segmentation
Gu, Zhangxuan
Zhou, Siyuan
Niu, Li
Zhao, Zihan
Zhang, Liqing
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1921 - 1929
[26] Context-aware semantic segmentation network for tunnel face feature identification
Zhao, Liang
Hao, Shuya
Song, Zhanping
AUTOMATION IN CONSTRUCTION, 2024, 165
[27] Context-Aware Local Binary Feature Learning for Face Recognition
Duan, Yueqi
Lu, Jiwen
Feng, Jianjiang
Zhou, Jie
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (05) : 1139 - 1153
[28] A Model-Driven Approach for Context-Aware Recommendation
Haddad, Mohamed Ramzi
Baazaoui, Hajer
Ziou, Djemel
Ben Ghezala, Henda
2012 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2012, : 755 - 760
[29] Context-aware Dynamic Data-driven Pattern Classification
Phoha, Shashi
Virani, Nurali
Chattopadhyay, Pritthi
Sarkar, Soumalya
Smith, Brian
Ray, Asok
2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2014, 29 : 1324 - 1333
[30] A Semantic-Based Approach for Context-Aware Service Discovery
Grifoni, Patrizia
D'Ulizia, Arianna
Ferri, Fernando
INTERNATIONAL JOURNAL OF INFORMATION SYSTEMS IN THE SERVICE SECTOR, 2014, 6 (04) : 1 - 26

← 1 2 3 4 5 →