Transformer-Based adversarial network for semi-supervised face sketch synthesis

被引:1
|
作者
Shi, Zhihua [1 ]
Wan, Weiguo [1 ]
机构
[1] Jiangxi Univ Finance & Econ, Sch Software & Internet Things Engn, Nanchang 330032, Peoples R China
基金
中国国家自然科学基金;
关键词
Face sketch synthesis; Vision Transformer; Detail extractor; Adaptive window attention; Adversarial network; ATTENTION;
D O I
10.1016/j.jvcir.2024.104204
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Face sketch synthesis is a technique utilized to convert real face images into artistic sketches, which holds vast potential within criminal investigation and entertainment. The existing methods usually train the generation models on the paired face photo -sketch datasets, which are challenging to acquire. Moreover, their results usually produce blurring, artifacts, and structural distortion, leading to inferior visual effects. To solve the above issues, we propose a semi -supervised Transformer -based adversarial network for face sketch synthesis, which can be trained on unpaired datasets. In the network, the Transformer encoder structure is modified with the adaptive window attention (AWA) to better extract local and global facial features while minimizing computational complexity. A Transformer -based feature fusion module is used to fuse the extracted features. In addition, a detail extractor module is designed by Laplacian operators to effectively preserve the detail information of the face photo images to the face sketch images. In the detail extractor module, we introduce a mask operation to remove the textures that do not exist in the original face photo images. Experimental results on the CUHK, AR, XM2VTS, and CUFSF datasets showcase the excellent subjective and objective performance of the proposed face sketch synthesis method compared to current state-of-the-art methods.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Semi-supervised Learning for Face Sketch Synthesis in the Wild
    Chen, Chaofeng
    Liu, Wei
    Tan, Xiao
    Wong, Kwan-Yee K.
    COMPUTER VISION - ACCV 2018, PT I, 2019, 11361 : 216 - 231
  • [2] Surgivisor: Transformer-based semi-supervised instrument segmentation for endoscopic surgery
    Wu, Zhiying
    Lau, Chun Yin
    Zhou, Qianang
    Wu, Jinlin
    Wang, Yuxi
    Liu, Qifeng
    Lei, Zhen
    Liu, Hongbin
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 87
  • [3] Staged Sketch-to-Image Synthesis via Semi-supervised Generative Adversarial Networks
    Li, Zeyu
    Deng, Cheng
    Yang, Erkun
    Tao, Dacheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 2694 - 2705
  • [4] A Transformer-Based Contrastive Semi-Supervised Learning Framework for Automatic Modulation Recognition
    Kong, Weisi
    Jiao, Xun
    Xu, Yuhua
    Zhang, Bolin
    Yang, Qinghai
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2023, 9 (04) : 950 - 962
  • [5] A semi-supervised image segmentation method based on generative adversarial network
    Nie, Wei
    Gou, Peng
    Liu, Yang
    Zhou, Tianyu
    Xu, Nuo
    Wang, Peng
    Du, QiQi
    IEEE Joint International Information Technology and Artificial Intelligence Conference (ITAIC), 2022, 2022-June : 1217 - 1223
  • [6] Coupled adversarial learning for semi-supervised heterogeneous face recognition
    He, Ran
    Li, Yi
    Wu, Xiang
    Song, Lingxiao
    Chai, Zhenhua
    Wei, Xiaolin
    PATTERN RECOGNITION, 2021, 110
  • [7] Spoof Face Detection Via Semi-Supervised Adversarial Training
    Chen, Chengwei
    Jing, Yaping
    Lu, Xuequan
    Yuan, Wang
    Ma, Lizhuang
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [8] Integrating pseudo labeling with contrastive clustering for transformer-based semi-supervised action recognition
    Li, Nannan
    Huang, Kan
    Wu, Qingtian
    Zhao, Yang
    APPLIED INTELLIGENCE, 2024, 54 (22) : 11177 - 11195
  • [9] Generative adversarial network for semi-supervised image captioning
    Liang, Xu
    Li, Chen
    Tian, Lihua
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 249
  • [10] A semi-supervised framework with generative adversarial network for pansharpening
    Yu-Xuan Wang
    Ting-Zhu Huang
    Ran Ran
    Rui Wen
    Liang-Jian Deng
    Signal, Image and Video Processing, 2025, 19 (6)