Attention-based video object segmentation algorithm

被引：0

作者：

Cao, Ying ^{[1
]}

Sun, Lijuan ^{[2
,3
]}

Han, Chong ^{[2
,3
]}

Guo, Jian ^{[2
,3
]}

机构：

[1] Henan Univ, Kaifeng, Henan, Peoples R China

[2] Nanjing Univ Posts & Telecommun, Nanjing, Peoples R China

[3] Nanjing Univ Posts & Telecommun, Jiangsu High Technol Res Key Lab Wireless Sensor, Nanjing, Peoples R China

来源：

IET IMAGE PROCESSING | 2021年 / 15卷 / 08期

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1049/ipr2.12135

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

To improve the segmentation performance on videos with large object motion or deformation, a novel scheme is proposed which has two branches. In one branch, the attention mechanism is first utilized to highlight objects-related features. Then, to well consider the temporal coherence of videos, Conv3D is integrated to capture short-term temporal features, and the designed attention residual convolutional long-short-term memory is adopted to capture the long-short-term temporal information of objects under the interference of redundant video frames. Meanwhile, considering the negative effect of background motion, in another branch, the optical flow-based prediction model is introduced to predict objects regions in subsequent video frames with the annotated initial frame. At last, based on the fused results of two branches, the global thresholds and noising area clean method are employed to obtain segmented objects. The experiments on DAVIS2016 and CDnet2014 exhibit the competitive performance of the proposed scheme.

引用

页码：1668 / 1678

页数：11

共 50 条

[21] Improved semantic video object segmentation algorithm
Ren, He
Hua, Chazhen
Jisuanji Gongcheng/Computer Engineering, 2002, 28 (08):
[22] An Attention-based Activity Recognition for Egocentric Video
Matsuo, Kenji
Yamada, Kentaro
Ueno, Satoshi
Naito, Sei
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2014, : 565 - +
[23] Attention-Based Convolutional LSTM for Describing Video
Liu, Zhongyu
Chen, Tian
Ding, Enjie
Liu, Yafeng
Yu, Wanli
IEEE Access, 2020, 8 : 133713 - 133724
[24] Multimodal attention-based transformer for video captioning
Hemalatha Munusamy
Chandra Sekhar C
Applied Intelligence, 2023, 53 : 23349 - 23368
[25] Attention-Based Convolutional LSTM for Describing Video
Liu, Zhongyu
Chen, Tian
Ding, Enjie
Liu, Yafeng
Yu, Wanli
IEEE ACCESS, 2020, 8 : 133713 - 133724
[26] A novel automatic video object segmentation algorithm
Zhang Guangyu
Gong Guangzhen
Zhu Weile
CHINESE JOURNAL OF ELECTRONICS, 2007, 16 (01): : 115 - 118
[27] A fast algorithm for video segmentation and object tracking
Giusto, DD
Massidda, F
Perra, C
DSP 2002: 14TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2, 2002, : 697 - 700
[28] Video object segmentation using the EM algorithm
Doulamis, ND
Doulamis, AD
Kollias, SD
ADVANCES IN INTELLIGENT SYSTEMS: CONCEPTS, TOOLS AND APPLICATIONS, 1999, 21 : 321 - 332
[29] Residual Attention-based Fusion for Video Classification
Pouyanfar, Samira
Wang, Tianyi
Chen, Shu-Ching
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 478 - 480
[30] Multimodal attention-based transformer for video captioning
Munusamy, Hemalatha
Sekhar, C. Chandra
APPLIED INTELLIGENCE, 2023, 53 (20) : 23349 - 23368

← 1 2 3 4 5 →