Cross-Domain Object Detection Algorithm for Complex End-to-End Scene Understanding

被引：0

作者：

Chen, Aoran ^{[1
]}

Huang, Hai ^{[1
]}

Zhu, Yueyan ^{[1
]}

Xue, Junsheng ^{[1
]}

机构：

[1] School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing,100876, China

来源：

Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications | 2024年 / 47卷 / 04期

关键词：

Computer vision - Convolutional neural networks - Image reconstruction - Multilayer neural networks - Object detection - Object recognition;

D O I：

10.13190/j.jbupt.2023-285

中图分类号：

学科分类号：

摘要：

Conventional deep learning training approaches often assume a similarity between the deployment scenario and the visual domain features present in the training data. However, this assumption might not hold true in complex end-to-end scenarios, making it difficult to meet the demands of intelligent detection services in open environments. In response, an object detection algorithm based on artificial intelligence closed-loop ensemble theory with cross-domain capabilities has been introduced. Within the detection framework, construct a backbone network and bottleneck layer network with multiscale convolutional layers. A visual domain discriminator featuring long-range dependency attention works as a secondary detection head to refine the results. Moreover, a background focusing module, based on spatial reconstruction attention units, is able to enhance learning focused on pseudo-background representations, thereby improving the accuracy of cross-domain object detection. Experimental results show that, compared to two-stage algorithms, the proposed algorithm yields an average precision increase 6.9%, and surpasses single-stage algorithms by 9.0% in complex end-to-end scenarios. © 2024 Beijing University of Posts and Telecommunications. All rights reserved.

引用

页码：57 / 62

共 50 条

[21] Scene Recognition via Object-to-Scene Class Conversion: End-to-End Training
Seong, Hongje
Hyun, Junhyuk
Chang, Hyunbae
Lee, Suhyeon
Woo, Suhan
Kim, Euntai
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[22] Selective transfer subspace learning for small-footprint end-to-end cross-domain keyword spotting
Ma, Fei
Wang, Chengliang
Li, Xusheng
Zeng, Zhuo
SPEECH COMMUNICATION, 2024, 156
[23] Modeling and Experimental Evaluation of End-to-End Delay Jitter for Cross-domain Interconnection in SD-TSN
Zhang, Xiaodong
Shou, Guochu
Xue, Junli
2024 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXHIBITION, OFC, 2024,
[24] Domain generalization improves end-to-end object detection for real-time surgical tool detection
Wolfgang Reiter
International Journal of Computer Assisted Radiology and Surgery, 2023, 18 : 939 - 944
[25] End-to-End Object Detection with Fully Convolutional Network
Wang, Jianfeng
Song, Lin
Li, Zeming
Sun, Hongbin
Sun, Jian
Zheng, Nanning
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15844 - 15853
[26] Domain generalization improves end-to-end object detection for real-time surgical tool detection
Reiter, Wolfgang
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2023, 18 (05) : 939 - 944
[27] SRDD: a lightweight end-to-end object detection with transformer
Zhu, Yuan
Xia, Qingyuan
Jin, Wen
CONNECTION SCIENCE, 2022, 34 (01) : 2448 - 2465
[28] Progressive End-to-End Object Detection in Crowded Scenes
Zheng, Anlin
Zhang, Yuang
Zhang, Xiangyu
Qi, Xiaojuan
Sun, Jian
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 847 - 856
[29] Toward End-to-End Object Detection and Tracking on the Edge
Tabkhi, Hamed
SEC 2017: 2017 THE SECOND ACM/IEEE SYMPOSIUM ON EDGE COMPUTING (SEC'17), 2017,
[30] Dense Distinct Query for End-to-End Object Detection
Zhang, Shilong
Wang, Xinjiang
Wang, Jiaqi
Pang, Jiangmiao
Lyu, Chengqi
Zhang, Wenwei
Luo, Ping
Chen, Kai
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 7329 - 7338

← 1 2 3 4 5 →