Scene Description for Visually Impaired People with Multi-Label Convolutional SVM Networks

被引：10

作者：

Bazi, Yakoub ^{[1
]}

Alhichri, Haikel ^{[1
]}

Alajlan, Naif ^{[1
]}

Melgani, Farid ^{[2
]}

机构：

[1] King Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Riyadh 11543, Saudi Arabia

[2] Univ Trento, Dept Informat Engn & Comp Sci, Via Sommarive 9, I-38123 Trento, Italy

来源：

APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 23期

关键词：

visually impaired (VI); computer vision; deep learning; multi-label convolutional support vector machine (M-CSVM); OBJECT DETECTION; RECOGNITION; AID;

D O I：

10.3390/app9235062

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

In this paper, we present a portable camera-based method for helping visually impaired (VI) people to recognize multiple objects in images. This method relies on a novel multi-label convolutional support vector machine (CSVM) network for coarse description of images. The core idea of CSVM is to use a set of linear SVMs as filter banks for feature map generation. During the training phase, the weights of the SVM filters are obtained using a forward-supervised learning strategy unlike the backpropagation algorithm used in standard convolutional neural networks (CNNs). To handle multi-label detection, we introduce a multi-branch CSVM architecture, where each branch will be used for detecting one object in the image. This architecture exploits the correlation between the objects present in the image by means of an opportune fusion mechanism of the intermediate outputs provided by the convolution layers of each branch. The high-level reasoning of the network is done through binary classification SVMs for predicting the presence/absence of objects in the image. The experiments obtained on two indoor datasets and one outdoor dataset acquired from a portable camera mounted on a lightweight shield worn by the user, and connected via a USB wire to a laptop processing unit are reported and discussed.

引用

页数：13

共 50 条

[41] Multi-Label Image Classification Based on Object Detection and Dynamic Graph Convolutional Networks
Liu, Xiaoyu
Hu, Yong
CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (03): : 4413 - 4432
[42] Organ detection in thorax abdomen CT using multi-label convolutional neural networks
Mamani, Gabriel Efrain Humpire
Setio, Arnaud Arindra Adiyoso
van Ginneken, Bram
Jacobs, Colin
MEDICAL IMAGING 2017: COMPUTER-AIDED DIAGNOSIS, 2017, 10134
[43] Deep Recurrent Architecture based Scene Description Generator for Visually Impaired
Chharia, Aviral
Upadhyay, Rahul
2020 12TH INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS AND WORKSHOPS (ICUMT 2020), 2020, : 136 - 141
[44] Comparison of Representations of Named Entities for Multi-label Document Classification with Convolutional Neural Networks
Pivovarova, Lidia
Yangarber, Roman
REPRESENTATION LEARNING FOR NLP, 2018, : 64 - 68
[45] A BASELINE FOR MULTI-LABEL IMAGE CLASSIFICATION USING AN ENSEMBLE OF DEEP CONVOLUTIONAL NEURAL NETWORKS
Wang, Qian
Jia, Ning
Breckon, Toby P.
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 644 - 648
[46] Multi-Label Graph Convolutional Network Representation Learning
Shi, Min
Tang, Yufei
Zhu, Xingquan
Liu, Jianxun
IEEE TRANSACTIONS ON BIG DATA, 2022, 8 (05) : 1169 - 1181
[47] PageCNNs: Convolutional Neural Networks for Multi-label Chinese Webpage Classification with Multi-information Fusion
Zheng, Jiawei
Chen, Junying
Cai, Yi
WEB AND BIG DATA, PT III, APWEB-WAIM 2023, 2024, 14333 : 204 - 219
[48] An Improved Multi-label Classification Based on Label Ranking and Delicate Boundary SVM
Chen, Benhui
Gu, Weifeng
Hu, Jinglu
2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
[49] Combining binary-SVM and pairwise label constraints for multi-label classification
Gu, Weifeng
Chen, Benhui
Hu, Jinglu
IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010,
[50] Scene-Aware Label Graph Learning for Multi-Label Image Classification
Zhu, Xuelin
Liu, Jian
Liu, Weijia
Ge, Jiawei
Liu, Bo
Cao, Jiuxin
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 1473 - 1482

← 1 2 3 4 5 →