Cross-Domain Object Detection Algorithm for Complex End-to-End Scene Understanding

被引:0
|
作者
Chen, Aoran [1 ]
Huang, Hai [1 ]
Zhu, Yueyan [1 ]
Xue, Junsheng [1 ]
机构
[1] School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing,100876, China
关键词
Computer vision - Convolutional neural networks - Image reconstruction - Multilayer neural networks - Object detection - Object recognition;
D O I
10.13190/j.jbupt.2023-285
中图分类号
学科分类号
摘要
Conventional deep learning training approaches often assume a similarity between the deployment scenario and the visual domain features present in the training data. However, this assumption might not hold true in complex end-to-end scenarios, making it difficult to meet the demands of intelligent detection services in open environments. In response, an object detection algorithm based on artificial intelligence closed-loop ensemble theory with cross-domain capabilities has been introduced. Within the detection framework, construct a backbone network and bottleneck layer network with multiscale convolutional layers. A visual domain discriminator featuring long-range dependency attention works as a secondary detection head to refine the results. Moreover, a background focusing module, based on spatial reconstruction attention units, is able to enhance learning focused on pseudo-background representations, thereby improving the accuracy of cross-domain object detection. Experimental results show that, compared to two-stage algorithms, the proposed algorithm yields an average precision increase 6.9%, and surpasses single-stage algorithms by 9.0% in complex end-to-end scenarios. © 2024 Beijing University of Posts and Telecommunications. All rights reserved.
引用
收藏
页码:57 / 62
相关论文
共 50 条
  • [21] Scene Recognition via Object-to-Scene Class Conversion: End-to-End Training
    Seong, Hongje
    Hyun, Junhyuk
    Chang, Hyunbae
    Lee, Suhyeon
    Woo, Suhan
    Kim, Euntai
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [22] Selective transfer subspace learning for small-footprint end-to-end cross-domain keyword spotting
    Ma, Fei
    Wang, Chengliang
    Li, Xusheng
    Zeng, Zhuo
    SPEECH COMMUNICATION, 2024, 156
  • [23] Modeling and Experimental Evaluation of End-to-End Delay Jitter for Cross-domain Interconnection in SD-TSN
    Zhang, Xiaodong
    Shou, Guochu
    Xue, Junli
    2024 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXHIBITION, OFC, 2024,
  • [24] Domain generalization improves end-to-end object detection for real-time surgical tool detection
    Wolfgang Reiter
    International Journal of Computer Assisted Radiology and Surgery, 2023, 18 : 939 - 944
  • [25] End-to-End Object Detection with Fully Convolutional Network
    Wang, Jianfeng
    Song, Lin
    Li, Zeming
    Sun, Hongbin
    Sun, Jian
    Zheng, Nanning
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15844 - 15853
  • [26] Domain generalization improves end-to-end object detection for real-time surgical tool detection
    Reiter, Wolfgang
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2023, 18 (05) : 939 - 944
  • [27] SRDD: a lightweight end-to-end object detection with transformer
    Zhu, Yuan
    Xia, Qingyuan
    Jin, Wen
    CONNECTION SCIENCE, 2022, 34 (01) : 2448 - 2465
  • [28] Progressive End-to-End Object Detection in Crowded Scenes
    Zheng, Anlin
    Zhang, Yuang
    Zhang, Xiangyu
    Qi, Xiaojuan
    Sun, Jian
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 847 - 856
  • [29] Toward End-to-End Object Detection and Tracking on the Edge
    Tabkhi, Hamed
    SEC 2017: 2017 THE SECOND ACM/IEEE SYMPOSIUM ON EDGE COMPUTING (SEC'17), 2017,
  • [30] Dense Distinct Query for End-to-End Object Detection
    Zhang, Shilong
    Wang, Xinjiang
    Wang, Jiaqi
    Pang, Jiangmiao
    Lyu, Chengqi
    Zhang, Wenwei
    Luo, Ping
    Chen, Kai
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 7329 - 7338