Self-Attention-Based Edge Computing Model for Synthesis Image to Text through Next-Generation AI Mechanism

被引:2
|
作者
Alshehri, Hamdan Ali [1 ]
Junath, N. [2 ]
Panwar, Poonam [3 ]
Shukla, Kirti [4 ]
Rahin, Saima Ahmed [5 ]
Martin, R. John [1 ]
机构
[1] Jazan Univ, Fac Comp Sci & Informat Technol, Jizan, Saudi Arabia
[2] Univ Technol & Appl Sci, Informat Technol, Ibri, Oman
[3] Chitkara Univ, Inst Engn & Technol, Chandigarh, Punjab, India
[4] Galgotias Univ, Noida, India
[5] United Int Univ, Dhaka, Bangladesh
关键词
ADVERSARIAL NETWORK; SEGMENTATION;
D O I
10.1155/2022/4973535
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Image synthesis based on natural language description has become a research hotspot in edge computing in artificial intelligence. With the help of generative adversarial edge computing networks, the field has made great strides in high-resolution image synthesis. However, there are still some defects in the authenticity of synthetic single-target images. For example, there will be abnormal situations such as "multiple heads" and "multiple mouths" when synthesizing bird graphics. Aiming at such problems, a text generation single-target model SA-AttnGAN based on a self-attention mechanism is proposed. SA-AttnGAN (Attentional Generative Adversarial Network) refines text features into word features and sentence features to improve the semantic alignment of text and images; in the initialization stage of AttnGAN, the self-attention mechanism is used to improve the stability of the text-generated image model; the multistage GAN network is used to superimpose, finally synthesizing high-resolution images. Experimental data show that SA-AttnGAN outperforms other comparable models in terms of Inception Score and Frechet Inception Distance; synthetic image analysis shows that this model can learn background and colour information and correctly capture bird heads and mouths. The structural information of other components is improved, and the AttnGAN model generates incorrect images such as "multiple heads" and "multiple mouths." Furthermore, SA-AttnGAN is successfully applied to description-based clothing image synthesis with good generalization ability.
引用
收藏
页数:12
相关论文
共 20 条
  • [11] Edge-Net: A Self-supervised Medical Image Segmentation Model Based on Edge Attention
    Wang, Miao
    Zheng, Zechen
    Fan, Chao
    Wang, Congqian
    He, Xuelei
    He, Xiaowei
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XV, 2025, 15045 : 241 - 254
  • [12] Image classification model based on large kernel attention mechanism and relative position self-attention mechanism
    Liu S.
    Wei J.
    Liu G.
    Zhou B.
    PeerJ Computer Science, 2023, 9
  • [13] Image classification model based on large kernel attention mechanism and relative position self-attention mechanism
    Liu, Siqi
    Wei, Jiangshu
    Liu, Gang
    Zhou, Bei
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [14] GAN with opposition-based blocks and channel self-attention mechanism for image synthesis
    Liu, Gang
    Ke, Aihua
    Wu, Xinyun
    Zhang, Haifeng
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 246
  • [15] FAGAN :AN ADVERSARIAL GENERATION METHOD OF SOLAR CELLS DEFECT IMAGE BASED ON MODEL TRANSFER AND ATTENTION MECHANISM
    Sun L.
    Mao J.
    Liu K.
    Taiyangneng Xuebao/Acta Energiae Solaris Sinica, 2023, 44 (09): : 78 - 84
  • [16] Finetuning of GLIDE stable diffusion model for AI-based text-conditional image synthesis of dermoscopic images
    Shavlokhova, Veronika
    Vollmer, Andreas
    Zouboulis, Christos C.
    Vollmer, Michael
    Wollborn, Jakob
    Lang, Gernot
    Kuebler, Alexander
    Hartmann, Stefan
    Stoll, Christian
    Roider, Elisabeth
    Saravi, Babak
    FRONTIERS IN MEDICINE, 2023, 10
  • [17] BVA-Transformer: Image-text multimodal classification and dialogue model architecture based on Blip and visual attention mechanism
    Zhang, Kaiyu
    Wu, Fei
    Zhang, Guowei
    Liu, Jiawei
    Li, Min
    DISPLAYS, 2024, 83
  • [18] Obj-SA-GAN: Object-Driven Text-to-Image Synthesis with Self-Attention Based Full Semantic Information Mining
    Li, Ruijun
    Li, Weihua
    Yang, Yi
    Bai, Quan
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2022, 13629 : 339 - 350
  • [19] Multiscale Chiral Synthesis via Self-Assembly of Achiral Nanoparticles for Next-Generation Chiral Material-Based Applications: The Role of Metal Ions as Chirality Messengers
    Jung, I. L. John
    Lee, Jaewon
    Kwon, Junyoung
    Park, Ki Hyun
    Jung, Wookjin
    Yeom, Jihyeon
    ACS APPLIED NANO MATERIALS, 2023, 6 (21) : 19632 - 19638
  • [20] A novel mutation-proof, next-generation vaccine to fight against upcoming SARS-CoV-2 variants and subvariants, designed through AI enabled approaches and tools, along with the machine learning based immune simulation: A vaccine breakthrough
    Bhattacharya, Manojit
    Alshammari, Abdulrahman
    Alharbi, Metab
    Dhama, Kuldeep
    Lee, Sang -Soo
    Chakraborty, Chiranjib
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2023, 242