A Graph-Transformer Network for Scene Text Detection

被引:0
|
作者
Wu, Yongrong [1 ]
Lin, Jingyu [1 ]
Chen, Houjin [1 ]
Chen, Dinghao [1 ]
Yang, Lvqing [1 ]
Xiahou, Jianbing [2 ]
机构
[1] Xiamen Univ, Sch Informat, Xiamen 361000, Peoples R China
[2] Quanzhou Normal Univ, Quanzhou 362000, Fujian, Peoples R China
关键词
Scene Text Detection; Transformer; Graph convolutional network;
D O I
10.1007/978-981-99-4761-4_57
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting text in natural images with varying orientations and shapes is challenging. Existing detectors often fail with text instances having extreme aspect ratios. This paper introduces GTNet, a Graph- Transformer network for scene text detection. GTNet uses a Graph-based Shared Feature Learning Module (GSFL) for feature extraction and a Transformer-based Regression Module (TRM) for bounding box prediction. Our architecture offers a flexible receptive field, combining global attention and local features for enhanced text representation. Extensive experiments show our method surpasses existing detectors in accuracy and effectiveness.
引用
收藏
页码:680 / 690
页数:11
相关论文
共 50 条
  • [21] Causal diffused graph-transformer network with stacked early classification loss for efficient stream classification of rumours
    Cheung, Tsun-Hin
    Lam, Kin-Man
    KNOWLEDGE-BASED SYSTEMS, 2023, 277
  • [22] Weakly supervised detection and classification of basal cell carcinoma using graph-transformer on whole slide images
    Filmon Yacob
    Jan Siarov
    Kajsa Villiamsson
    Juulia T. Suvilehto
    Lisa Sjöblom
    Magnus Kjellberg
    Noora Neittaanmäki
    Scientific Reports, 13 (1)
  • [23] Local Information-Enhanced Graph-Transformer for Hyperspectral Image Change Detection With Limited Training Samples
    Dong, Wenqian
    Yang, Yufei
    Qu, Jiahui
    Xiao, Song
    Li, Yunsong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [24] Weakly supervised detection and classification of basal cell carcinoma using graph-transformer on whole slide images
    Yacob, Filmon
    Siarov, Jan
    Villiamsson, Kajsa
    Suvilehto, Juulia T.
    Sjoblom, Lisa
    Kjellberg, Magnus
    Neittaanmaeki, Noora
    SCIENTIFIC REPORTS, 2023, 13 (01):
  • [25] Multi-Scale Efficient Graph-Transformer for Whole Slide Image Classification
    Ding, Saisai
    Li, Juncheng
    Wang, Jun
    Ying, Shihui
    Shi, Jun
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (12) : 5926 - 5936
  • [26] HIGT: Hierarchical Interaction Graph-Transformer for Whole Slide Image Analysis
    Guo, Ziyu
    Zhao, Weiqin
    Wang, Shujun
    Yu, Lequan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VI, 2023, 14225 : 755 - 764
  • [27] Text Enhancement Network for Cross-Domain Scene Text Detection
    Deng, Jinhong
    Luo, Xiulian
    Zheng, Jiawen
    Dang, Wanli
    Li, Wen
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2203 - 2207
  • [28] Text-Attentional Convolutional Neural Network for Scene Text Detection
    He, Tong
    Huang, Weilin
    Qiao, Yu
    Yao, Jian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (06) : 2529 - 2541
  • [29] Scene Text Detection with Text Statistical Characteristics and Deep Neural Network
    Qu, Yanyun
    Yang, Xiaodong
    Lin, Li
    COMPUTER VISION, PT III, 2017, 773 : 245 - 254
  • [30] SPN: short path network for scene text detection
    Yuanqiang Cai
    Weiqiang Wang
    Haiqing Ren
    Ke Lu
    Neural Computing and Applications, 2020, 32 : 6075 - 6087