Self-Attention-Based Edge Computing Model for Synthesis Image to Text through Next-Generation AI Mechanism

被引：2

作者：

Alshehri, Hamdan Ali ^{[1
]}

Junath, N. ^{[2
]}

Panwar, Poonam ^{[3
]}

Shukla, Kirti ^{[4
]}

Rahin, Saima Ahmed ^{[5
]}

Martin, R. John ^{[1
]}

机构：

[1] Jazan Univ, Fac Comp Sci & Informat Technol, Jizan, Saudi Arabia

[2] Univ Technol & Appl Sci, Informat Technol, Ibri, Oman

[3] Chitkara Univ, Inst Engn & Technol, Chandigarh, Punjab, India

[4] Galgotias Univ, Noida, India

[5] United Int Univ, Dhaka, Bangladesh

来源：

MATHEMATICAL PROBLEMS IN ENGINEERING | 2022年 / 2022卷

关键词：

ADVERSARIAL NETWORK; SEGMENTATION;

D O I：

10.1155/2022/4973535

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Image synthesis based on natural language description has become a research hotspot in edge computing in artificial intelligence. With the help of generative adversarial edge computing networks, the field has made great strides in high-resolution image synthesis. However, there are still some defects in the authenticity of synthetic single-target images. For example, there will be abnormal situations such as "multiple heads" and "multiple mouths" when synthesizing bird graphics. Aiming at such problems, a text generation single-target model SA-AttnGAN based on a self-attention mechanism is proposed. SA-AttnGAN (Attentional Generative Adversarial Network) refines text features into word features and sentence features to improve the semantic alignment of text and images; in the initialization stage of AttnGAN, the self-attention mechanism is used to improve the stability of the text-generated image model; the multistage GAN network is used to superimpose, finally synthesizing high-resolution images. Experimental data show that SA-AttnGAN outperforms other comparable models in terms of Inception Score and Frechet Inception Distance; synthetic image analysis shows that this model can learn background and colour information and correctly capture bird heads and mouths. The structural information of other components is improved, and the AttnGAN model generates incorrect images such as "multiple heads" and "multiple mouths." Furthermore, SA-AttnGAN is successfully applied to description-based clothing image synthesis with good generalization ability.

引用

页数：12

共 20 条

[11] Edge-Net: A Self-supervised Medical Image Segmentation Model Based on Edge Attention
Wang, Miao
Zheng, Zechen
Fan, Chao
Wang, Congqian
He, Xuelei
He, Xiaowei
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XV, 2025, 15045 : 241 - 254
[12] Image classification model based on large kernel attention mechanism and relative position self-attention mechanism
Liu S.
Wei J.
Liu G.
Zhou B.
PeerJ Computer Science, 2023, 9
[13] Image classification model based on large kernel attention mechanism and relative position self-attention mechanism
Liu, Siqi
Wei, Jiangshu
Liu, Gang
Zhou, Bei
PEERJ COMPUTER SCIENCE, 2023, 9
[14] GAN with opposition-based blocks and channel self-attention mechanism for image synthesis
Liu, Gang
Ke, Aihua
Wu, Xinyun
Zhang, Haifeng
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 246
[15] FAGAN ：AN ADVERSARIAL GENERATION METHOD OF SOLAR CELLS DEFECT IMAGE BASED ON MODEL TRANSFER AND ATTENTION MECHANISM
Sun L.
Mao J.
Liu K.
Taiyangneng Xuebao/Acta Energiae Solaris Sinica, 2023, 44 (09): : 78 - 84
[16] Finetuning of GLIDE stable diffusion model for AI-based text-conditional image synthesis of dermoscopic images
Shavlokhova, Veronika
Vollmer, Andreas
Zouboulis, Christos C.
Vollmer, Michael
Wollborn, Jakob
Lang, Gernot
Kuebler, Alexander
Hartmann, Stefan
Stoll, Christian
Roider, Elisabeth
Saravi, Babak
FRONTIERS IN MEDICINE, 2023, 10
[17] BVA-Transformer: Image-text multimodal classification and dialogue model architecture based on Blip and visual attention mechanism
Zhang, Kaiyu
Wu, Fei
Zhang, Guowei
Liu, Jiawei
Li, Min
DISPLAYS, 2024, 83
[18] Obj-SA-GAN: Object-Driven Text-to-Image Synthesis with Self-Attention Based Full Semantic Information Mining
Li, Ruijun
Li, Weihua
Yang, Yi
Bai, Quan
PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2022, 13629 : 339 - 350
[19] Multiscale Chiral Synthesis via Self-Assembly of Achiral Nanoparticles for Next-Generation Chiral Material-Based Applications: The Role of Metal Ions as Chirality Messengers
Jung, I. L. John
Lee, Jaewon
Kwon, Junyoung
Park, Ki Hyun
Jung, Wookjin
Yeom, Jihyeon
ACS APPLIED NANO MATERIALS, 2023, 6 (21) : 19632 - 19638
[20] A novel mutation-proof, next-generation vaccine to fight against upcoming SARS-CoV-2 variants and subvariants, designed through AI enabled approaches and tools, along with the machine learning based immune simulation: A vaccine breakthrough
Bhattacharya, Manojit
Alshammari, Abdulrahman
Alharbi, Metab
Dhama, Kuldeep
Lee, Sang -Soo
Chakraborty, Chiranjib
INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2023, 242

← 1 2 →