Arabic Captioning for Images of Clothing Using Deep Learning

被引:2
|
作者
Al-Malki, Rasha Saleh [1 ]
Al-Aama, Arwa Yousuf [1 ]
机构
[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Comp Sci Dept, Jeddah 21589, Saudi Arabia
关键词
deep learning; image captioning; transfer learning; image attributes;
D O I
10.3390/s23083783
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Fashion is one of the many fields of application that image captioning is being used in. For e-commerce websites holding tens of thousands of images of clothing, automated item descriptions are quite desirable. This paper addresses captioning images of clothing in the Arabic language using deep learning. Image captioning systems are based on Computer Vision and Natural Language Processing techniques because visual and textual understanding is needed for these systems. Many approaches have been proposed to build such systems. The most widely used methods are deep learning methods which use the image model to analyze the visual content of the image, and the language model to generate the caption. Generating the caption in the English language using deep learning algorithms received great attention from many researchers in their research, but there is still a gap in generating the caption in the Arabic language because public datasets are often not available in the Arabic language. In this work, we created an Arabic dataset for captioning images of clothing which we named "ArabicFashionData" because this model is the first model for captioning images of clothing in the Arabic language. Moreover, we classified the attributes of the images of clothing and used them as inputs to the decoder of our image captioning model to enhance Arabic caption quality. In addition, we used the attention mechanism. Our approach achieved a BLEU-1 score of 88.52. The experiment findings are encouraging and suggest that, with a bigger dataset, the attributes-based image captioning model can achieve excellent results for Arabic image captioning.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Deep Learning for Military Image Captioning
    Das, Subrata
    Jain, Lalit
    Das, Amp
    2018 21ST INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2018, : 2165 - 2171
  • [22] Deep Learning for Video Captioning: A Review
    Chen, Shaoxiang
    Yao, Ting
    Jiang, Yu-Gang
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 6283 - 6290
  • [23] A reference-based model using deep learning for image captioning
    Tiago do Carmo Nogueira
    Cássio Dener Noronha Vinhal
    Gélson da Cruz Júnior
    Matheus Rudolfo Diedrich Ullmann
    Thyago Carvalho Marques
    Multimedia Systems, 2023, 29 : 1665 - 1681
  • [24] A reference-based model using deep learning for image captioning
    Nogueira, Tiago do Carmo
    Noronha Vinhal, Cassio Dener
    da Cruz, Gelson, Jr.
    Diedrich Ullmann, Matheus Rudolfo
    Marques, Thyago Carvalho
    MULTIMEDIA SYSTEMS, 2023, 29 (03) : 1665 - 1681
  • [25] Enhanced Image Captioning with Color Recognition Using Deep Learning Methods
    Chang, Yeong-Hwa
    Chen, Yen-Jen
    Huang, Ren-Hung
    Yu, Yi-Ting
    APPLIED SCIENCES-BASEL, 2022, 12 (01):
  • [26] Arabic spam tweets classification using deep learning
    Sanaa Kaddoura
    Suja A. Alex
    Maher Itani
    Safaa Henno
    Asma AlNashash
    D. Jude Hemanth
    Neural Computing and Applications, 2023, 35 : 17233 - 17246
  • [27] Classification of Arabic Poetry Emotions Using Deep Learning
    Shahriar, Sakib
    Al Roken, Noora
    Zualkernan, Imran
    COMPUTERS, 2023, 12 (05)
  • [28] Arabic text summarization using deep learning approach
    Molham Al-Maleh
    Said Desouki
    Journal of Big Data, 7
  • [29] Arabic text classification using deep learning models
    Elnagar, Ashraf
    Al-Debsi, Ridhwan
    Einea, Omar
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (01)
  • [30] Arabic Sentiment Analysis Using Deep Learning: A Review
    Hakami, Zainab
    Alshathri, Muneera
    Alqhtani, Nora
    Alharthi, Latifah
    Alhumoud, Sarah
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2019, 19 (04): : 255 - 263