Self-Attention-Based Edge Computing Model for Synthesis Image to Text through Next-Generation AI Mechanism

被引:2
|
作者
Alshehri, Hamdan Ali [1 ]
Junath, N. [2 ]
Panwar, Poonam [3 ]
Shukla, Kirti [4 ]
Rahin, Saima Ahmed [5 ]
Martin, R. John [1 ]
机构
[1] Jazan Univ, Fac Comp Sci & Informat Technol, Jizan, Saudi Arabia
[2] Univ Technol & Appl Sci, Informat Technol, Ibri, Oman
[3] Chitkara Univ, Inst Engn & Technol, Chandigarh, Punjab, India
[4] Galgotias Univ, Noida, India
[5] United Int Univ, Dhaka, Bangladesh
关键词
ADVERSARIAL NETWORK; SEGMENTATION;
D O I
10.1155/2022/4973535
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Image synthesis based on natural language description has become a research hotspot in edge computing in artificial intelligence. With the help of generative adversarial edge computing networks, the field has made great strides in high-resolution image synthesis. However, there are still some defects in the authenticity of synthetic single-target images. For example, there will be abnormal situations such as "multiple heads" and "multiple mouths" when synthesizing bird graphics. Aiming at such problems, a text generation single-target model SA-AttnGAN based on a self-attention mechanism is proposed. SA-AttnGAN (Attentional Generative Adversarial Network) refines text features into word features and sentence features to improve the semantic alignment of text and images; in the initialization stage of AttnGAN, the self-attention mechanism is used to improve the stability of the text-generated image model; the multistage GAN network is used to superimpose, finally synthesizing high-resolution images. Experimental data show that SA-AttnGAN outperforms other comparable models in terms of Inception Score and Frechet Inception Distance; synthetic image analysis shows that this model can learn background and colour information and correctly capture bird heads and mouths. The structural information of other components is improved, and the AttnGAN model generates incorrect images such as "multiple heads" and "multiple mouths." Furthermore, SA-AttnGAN is successfully applied to description-based clothing image synthesis with good generalization ability.
引用
收藏
页数:12
相关论文
共 20 条
  • [1] Self-Attention-Based Edge Computing Model for Synthesis Image to Text through Next-Generation AI Mechanism
    Alshehri, Hamdan Ali
    Junath, N.
    Panwar, Poonam
    Shukla, Kirti
    Rahin, Saima Ahmed
    Martin, R. John
    Mathematical Problems in Engineering, 2022, 2022
  • [2] Self-Attention-Based Edge Computing Model for Synthesis Image to Text through Next-Generation AI Mechanism
    Alshehri, Hamdan Ali
    Junath, N.
    Panwar, Poonam
    Shukla, Kirti
    Rahin, Saima Ahmed
    Martin, R. John
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [3] AI/ML-Based Sensing-Assisted Edge Computing in Next-Generation Mobile Networks
    Hossain, Abdullah Ridwan
    Kiani, Abbas
    Saboorian, Tony
    Xiang, Amanda
    Kaippallimali, John
    Ansari, Nirwan
    2023 IEEE CONFERENCE ON STANDARDS FOR COMMUNICATIONS AND NETWORKING, CSCN, 2023, : 77 - 82
  • [4] Self-Attention-Based BiLSTM Model for Short Text Fine-Grained Sentiment Classification
    Xie, Jun
    Chen, Bo
    Gu, Xinglong
    Liang, Fengmei
    Xu, Xinying
    IEEE ACCESS, 2019, 7 : 180558 - 180570
  • [5] Next-Generation Edge Computing Assisted Autonomous Driving Based Artificial Intelligence Algorithms
    Ibn-Khedher, Hatem
    Laroui, Mohammed
    Moungla, Hassine
    Afifi, Hossam
    Abd-Elrahman, Emad
    IEEE ACCESS, 2022, 10 : 53987 - 54001
  • [6] A Text Sentiment Analysis Model Based on Self-Attention Mechanism
    Ji, Likun
    Gong, Ping
    Yao, Zhuyu
    2019 THE 3RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPILATION, COMPUTING AND COMMUNICATIONS (HP3C 2019), 2019, : 33 - 37
  • [7] Text-to-Image Method Based on Attention Model with Increased Gate Mechanism
    Chen, Jize
    Jiang, Xiaoyan
    Gao, Yongbin
    Computer Engineering and Applications, 2023, 59 (12): : 208 - 216
  • [8] Wind turbine fault detection and identification through self-attention-based mechanism embedded with a multivariable query pattern
    Wang, Anqi
    Pei, Yan
    Zhu, Yunyi
    Qian, Zheng
    RENEWABLE ENERGY, 2023, 211 : 918 - 937
  • [9] Text-to-image generation method based on self-supervised attention and image features fusion
    Liao, Yonghui
    Zhang, Haitao
    Jin, Haibo
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2024, 39 (02) : 180 - 191
  • [10] Next-generation reservoir computing water quality prediction model based on the whale optimization algorithm
    Zhou, Junyu
    Pei, Lijun
    Zheng, Zhiwei
    INTERNATIONAL JOURNAL OF DYNAMICS AND CONTROL, 2025, 13 (04)