Attention-based video object segmentation algorithm

被引：0

作者：

Cao, Ying ^{[1
]}

Sun, Lijuan ^{[2
,3
]}

Han, Chong ^{[2
,3
]}

Guo, Jian ^{[2
,3
]}

机构：

[1] Henan Univ, Kaifeng, Henan, Peoples R China

[2] Nanjing Univ Posts & Telecommun, Nanjing, Peoples R China

[3] Nanjing Univ Posts & Telecommun, Jiangsu High Technol Res Key Lab Wireless Sensor, Nanjing, Peoples R China

来源：

IET IMAGE PROCESSING | 2021年 / 15卷 / 08期

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1049/ipr2.12135

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

To improve the segmentation performance on videos with large object motion or deformation, a novel scheme is proposed which has two branches. In one branch, the attention mechanism is first utilized to highlight objects-related features. Then, to well consider the temporal coherence of videos, Conv3D is integrated to capture short-term temporal features, and the designed attention residual convolutional long-short-term memory is adopted to capture the long-short-term temporal information of objects under the interference of redundant video frames. Meanwhile, considering the negative effect of background motion, in another branch, the optical flow-based prediction model is introduced to predict objects regions in subsequent video frames with the annotated initial frame. At last, based on the fused results of two branches, the global thresholds and noising area clean method are employed to obtain segmented objects. The experiments on DAVIS2016 and CDnet2014 exhibit the competitive performance of the proposed scheme.

引用

页码：1668 / 1678

页数：11

共 50 条

[41] Attention-based video summarisation in rushes collection
Ren, Reede
Swamy, Punitha Puttu
Jose, Joemon M.
Urban, Jana
Proceedings of the ACM International Multimedia Conference and Exhibition, 2007, : 89 - 93
[42] Residual attention-based LSTM for video captioning
Li, Xiangpeng
Zhou, Zhilong
Chen, Lijiang
Gao, Lianli
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (02): : 621 - 636
[43] Video Object Segmentation Based on Disparity
Xingming, Ouyang
Wei, Wei
ADVANCES IN WEB AND NETWORK TECHNOLOGIES, AND INFORMATION MANAGEMENT, 2009, 5731 : 36 - 44
[44] Attention-based fusion factor in FPN for object detection
Li, Yuancheng
Zhou, Shenglong
Chen, Hui
APPLIED INTELLIGENCE, 2022, 52 (13) : 15547 - 15556
[45] Attention-Based Transformers for Instance Segmentation of Cells in Microstructures
Prangemeier, Tim
Reich, Christoph
Koeppl, Heinz
2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 700 - 707
[46] Saliency-based dual-attention network for unsupervised video object segmentation
Zhang, Guifang
Wong, Hon-Cheng
JOURNAL OF SUPERCOMPUTING, 2024, 80 (04): : 4996 - 5010
[47] Guided Interactive Video Object Segmentation Using Reliability-Based Attention Maps
Heo, Yuk
Koh, Yeong Jun
Kim, Chang-Su
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7318 - 7326
[48] Attention-based Weighted Fusion Network for Object Detection
Yu, Ruixing
Wang, Chuyin
Tang, Yifei
JOURNAL OF IMAGING SCIENCE AND TECHNOLOGY, 2024, 68 (06) : 1 - 18
[49] Where and What: Driver Attention-based Object Detection
Rong Y.
Kassautzki N.-R.
Fuhl W.
Kasneci E.
Proceedings of the ACM on Human-Computer Interaction, 2022, 6 (ETRA)
[50] Saliency-based dual-attention network for unsupervised video object segmentation
Guifang Zhang
Hon-Cheng Wong
The Journal of Supercomputing, 2024, 80 (4) : 4996 - 5010

← 1 2 3 4 5 →