HubNet: An E2E Model for Wheel Hub Text Detection and Recognition Using Global and Local Features

被引:1
|
作者
Zeng, Yue [1 ]
Meng, Cai [1 ]
机构
[1] Beihang Univ, Image Proc Ctr, Sch Astronaut, Beijing 100191, Peoples R China
关键词
deep learning; wheel hub text; text detection; text recognition;
D O I
10.3390/s24196183
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Automatic detection and recognition of wheel hub text, which can boost the efficiency and accuracy of product information recording, are undermined by the obscurity and orientation variability of text on wheel hubs. To address these issues, this paper constructs a wheel hub text dataset and proposes a wheel hub text detection and recognition model called HubNet. The dataset captured images on real industrial production line scenes, including 446 images, 934 word instances, and 2947 character instances. HubNet is an end-to-end text detection and recognition model, not only comprising conventional detection and recognition heads but also incorporating a feature cross-fusion module, which improves the accuracy of recognizing wheel hub texts by utilizing both global and local features. Experimental results show that on the wheel hub text dataset, the HubNet achieves an accuracy of 86.5%, a recall of 79.4%, and an F1-score of 0.828, and the feature cross-fusion module increases the accuracy by 2% to 4%. The wheel hub dataset and the HubNet offer a significant reference for automatic detection and recognition of wheel hub text.
引用
收藏
页数:13
相关论文
共 19 条
  • [1] INTERNAL LANGUAGE MODEL PERSONALIZATION OF E2E AUTOMATIC SPEECH RECOGNITION USING RANDOM ENCODER FEATURES
    Stooke, Adam
    Sim, Khe Chai
    Chua, Mason
    Munkhdalai, Tsendsuren
    Strohman, Trevor
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 213 - 220
  • [2] TWD: A New Deep E2E Model for Text Watermark/Caption and Scene Text Detection in Video
    Banerjee, Ayan
    Shivakumara, Palaiahnakote
    Acharya, Parikshit
    Pal, Umapada
    Canet, Josep Llados
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1492 - 1498
  • [3] Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus
    Peyser, Cal
    Mavandadi, Sepand
    Sainath, Tara N.
    Apfel, James
    Pang, Ruoming
    Kumar, Shankar
    INTERSPEECH 2020, 2020, : 4921 - 4925
  • [4] Streaming Intended Query Detection using E2E Modeling for Continued Conversation
    Chang, Shuo-yiin
    Prakash, Guru
    Wu, Zelin
    Liang, Qiao
    Sainath, Tara N.
    Li, Bo
    Stambler, Adam
    Upadhyay, Shyam
    Faruqui, Manaal
    Strohman, Trevor
    INTERSPEECH 2022, 2022, : 1826 - 1830
  • [5] Rapid Language Adaptation for Multilingual E2E Speech Recognition Using Encoder Prompting
    Kashiwagi, Yosuke
    Futami, Hayato
    Tsunoo, Emiru
    Arora, Siddhant
    Watanabe, Shinji
    INTERSPEECH 2024, 2024, : 2900 - 2904
  • [6] Detection of Anomalous e2e Encrypted Function Invocation in FaaS using Zero-Knowledge Proofs
    Andreotti, Davide
    Verticale, Giacomo
    2024 IEEE 10TH INTERNATIONAL CONFERENCE ON NETWORK SOFTWARIZATION, NETSOFT 2024, 2024, : 175 - 179
  • [7] VEHICLE MAKE AND MODEL RECOGNITION USING LOCAL FEATURES AND LOGO DETECTION
    Tafazzoli, Faezeh
    Frigui, Hichem
    2016 INTERNATIONAL SYMPOSIUM ON SIGNAL, IMAGE, VIDEO AND COMMUNICATIONS (ISIVC), 2016, : 353 - 358
  • [8] Exploring ecosystem effects of underwater noise in the nordic seas, using the NoBa-Atlantis E2E model
    Skartsaeterhagen, Maria
    Hansen, Cecilie
    Fulton, Elizabeth A.
    ECOLOGICAL MODELLING, 2024, 492
  • [9] Obfuscated Malware Detection Using Deep Generative Model based on Global/Local Features
    Kim, Jin-Young
    Cho, Sung-Bae
    COMPUTERS & SECURITY, 2022, 112
  • [10] Obfuscated Malware Detection Using Deep Generative Model based on Global/Local Features
    Kim, Jin-Young
    Cho, Sung-Bae
    Computers and Security, 2022, 112