Arabic Captioning for Images of Clothing Using Deep Learning

被引：2

作者：

Al-Malki, Rasha Saleh ^{[1
]}

Al-Aama, Arwa Yousuf ^{[1
]}

机构：

[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Comp Sci Dept, Jeddah 21589, Saudi Arabia

来源：

SENSORS | 2023年 / 23卷 / 08期

关键词：

deep learning; image captioning; transfer learning; image attributes;

D O I：

10.3390/s23083783

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Fashion is one of the many fields of application that image captioning is being used in. For e-commerce websites holding tens of thousands of images of clothing, automated item descriptions are quite desirable. This paper addresses captioning images of clothing in the Arabic language using deep learning. Image captioning systems are based on Computer Vision and Natural Language Processing techniques because visual and textual understanding is needed for these systems. Many approaches have been proposed to build such systems. The most widely used methods are deep learning methods which use the image model to analyze the visual content of the image, and the language model to generate the caption. Generating the caption in the English language using deep learning algorithms received great attention from many researchers in their research, but there is still a gap in generating the caption in the Arabic language because public datasets are often not available in the Arabic language. In this work, we created an Arabic dataset for captioning images of clothing which we named "ArabicFashionData" because this model is the first model for captioning images of clothing in the Arabic language. Moreover, we classified the attributes of the images of clothing and used them as inputs to the decoder of our image captioning model to enhance Arabic caption quality. In addition, we used the attention mechanism. Our approach achieved a BLEU-1 score of 88.52. The experiment findings are encouraging and suggest that, with a bigger dataset, the attributes-based image captioning model can achieve excellent results for Arabic image captioning.

引用

页数：17

共 50 条

[21] Deep Learning for Military Image Captioning
Das, Subrata
Jain, Lalit
Das, Amp
2018 21ST INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2018, : 2165 - 2171
[22] Deep Learning for Video Captioning: A Review
Chen, Shaoxiang
Yao, Ting
Jiang, Yu-Gang
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 6283 - 6290
[23] A reference-based model using deep learning for image captioning
Tiago do Carmo Nogueira
Cássio Dener Noronha Vinhal
Gélson da Cruz Júnior
Matheus Rudolfo Diedrich Ullmann
Thyago Carvalho Marques
Multimedia Systems, 2023, 29 : 1665 - 1681
[24] A reference-based model using deep learning for image captioning
Nogueira, Tiago do Carmo
Noronha Vinhal, Cassio Dener
da Cruz, Gelson, Jr.
Diedrich Ullmann, Matheus Rudolfo
Marques, Thyago Carvalho
MULTIMEDIA SYSTEMS, 2023, 29 (03) : 1665 - 1681
[25] Enhanced Image Captioning with Color Recognition Using Deep Learning Methods
Chang, Yeong-Hwa
Chen, Yen-Jen
Huang, Ren-Hung
Yu, Yi-Ting
APPLIED SCIENCES-BASEL, 2022, 12 (01):
[26] Arabic spam tweets classification using deep learning
Sanaa Kaddoura
Suja A. Alex
Maher Itani
Safaa Henno
Asma AlNashash
D. Jude Hemanth
Neural Computing and Applications, 2023, 35 : 17233 - 17246
[27] Classification of Arabic Poetry Emotions Using Deep Learning
Shahriar, Sakib
Al Roken, Noora
Zualkernan, Imran
COMPUTERS, 2023, 12 (05)
[28] Arabic text summarization using deep learning approach
Molham Al-Maleh
Said Desouki
Journal of Big Data, 7
[29] Arabic text classification using deep learning models
Elnagar, Ashraf
Al-Debsi, Ridhwan
Einea, Omar
INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (01)
[30] Arabic Sentiment Analysis Using Deep Learning: A Review
Hakami, Zainab
Alshathri, Muneera
Alqhtani, Nora
Alharthi, Latifah
Alhumoud, Sarah
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2019, 19 (04): : 255 - 263

← 1 2 3 4 5 →