A Graph-Transformer Network for Scene Text Detection

被引:0
|
作者
Wu, Yongrong [1 ]
Lin, Jingyu [1 ]
Chen, Houjin [1 ]
Chen, Dinghao [1 ]
Yang, Lvqing [1 ]
Xiahou, Jianbing [2 ]
机构
[1] Xiamen Univ, Sch Informat, Xiamen 361000, Peoples R China
[2] Quanzhou Normal Univ, Quanzhou 362000, Fujian, Peoples R China
关键词
Scene Text Detection; Transformer; Graph convolutional network;
D O I
10.1007/978-981-99-4761-4_57
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting text in natural images with varying orientations and shapes is challenging. Existing detectors often fail with text instances having extreme aspect ratios. This paper introduces GTNet, a Graph- Transformer network for scene text detection. GTNet uses a Graph-based Shared Feature Learning Module (GSFL) for feature extraction and a Transformer-based Regression Module (TRM) for bounding box prediction. Our architecture offers a flexible receptive field, combining global attention and local features for enhanced text representation. Extensive experiments show our method surpasses existing detectors in accuracy and effectiveness.
引用
收藏
页码:680 / 690
页数:11
相关论文
共 50 条
  • [1] Irregular Scene Text Detection Based on a Graph Convolutional Network
    Zhang, Shiyu
    Zhou, Caiying
    Li, Yonggang
    Zhang, Xianchao
    Ye, Lihua
    Wei, Yuanwang
    SENSORS, 2023, 23 (03)
  • [2] Graph Induced Transformer Network for Detection of Politeness and Formality in Text
    Sinha, Manjira
    Dasgupta, Tirthankar
    Pardheev, Chunduru Geetha
    COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 221 - 224
  • [3] A Graph-Transformer for Whole Slide Image Classification
    Zheng, Yi
    Gindra, Rushin H.
    Green, Emily J.
    Burks, Eric J.
    Betke, Margrit
    Beane, Jennifer E.
    Kolachalama, Vijaya B.
    IEEE Transactions on Medical Imaging, 2022, 41 (11) : 3003 - 3015
  • [4] Multi-scale graph-transformer network for trajectory prediction of the autonomous vehicles
    Singh, Divya
    Srivastava, Rajeev
    INTELLIGENT SERVICE ROBOTICS, 2022, 15 (03) : 307 - 320
  • [5] Multi-scale graph-transformer network for trajectory prediction of the autonomous vehicles
    Divya Singh
    Rajeev Srivastava
    Intelligent Service Robotics, 2022, 15 : 307 - 320
  • [6] A Graph-Transformer Method for Landslide Susceptibility Mapping
    Zhang, Qing
    He, Yi
    Zhang, Yalei
    Lu, Jiangang
    Zhang, Lifeng
    Huo, Tianbao
    Tang, Jiapeng
    Fang, Yumin
    Zhang, Yunhao
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 14556 - 14574
  • [7] A graph-transformer for whole slide image classification
    Zheng, Yi
    Gindra, Rushin H.
    Green, Emily J.
    Burks, Eric J.
    Betke, Margrit
    Beane, Jennifer E.
    Kolachalama, Vijaya B.
    arXiv, 2022,
  • [8] SGGformer: Shifted Graph Convolutional Graph-Transformer for Traffic Prediction
    Pu, Shilin
    Chu, Liang
    Hu, Jincheng
    Li, Shibo
    Li, Jihao
    Sun, Wen
    SENSORS, 2022, 22 (22)
  • [9] Transformer and Graph Convolutional Network for Text Classification
    Boting Liu
    Weili Guan
    Changjin Yang
    Zhijie Fang
    Zhiheng Lu
    International Journal of Computational Intelligence Systems, 16
  • [10] Transformer and Graph Convolutional Network for Text Classification
    Liu, Boting
    Guan, Weili
    Yang, Changjin
    Fang, Zhijie
    Lu, Zhiheng
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2023, 16 (01)