Attention-based video object segmentation algorithm

被引：0

作者：

Cao, Ying ^{[1
]}

Sun, Lijuan ^{[2
,3
]}

Han, Chong ^{[2
,3
]}

Guo, Jian ^{[2
,3
]}

机构：

[1] Henan Univ, Kaifeng, Henan, Peoples R China

[2] Nanjing Univ Posts & Telecommun, Nanjing, Peoples R China

[3] Nanjing Univ Posts & Telecommun, Jiangsu High Technol Res Key Lab Wireless Sensor, Nanjing, Peoples R China

来源：

IET IMAGE PROCESSING | 2021年 / 15卷 / 08期

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1049/ipr2.12135

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

To improve the segmentation performance on videos with large object motion or deformation, a novel scheme is proposed which has two branches. In one branch, the attention mechanism is first utilized to highlight objects-related features. Then, to well consider the temporal coherence of videos, Conv3D is integrated to capture short-term temporal features, and the designed attention residual convolutional long-short-term memory is adopted to capture the long-short-term temporal information of objects under the interference of redundant video frames. Meanwhile, considering the negative effect of background motion, in another branch, the optical flow-based prediction model is introduced to predict objects regions in subsequent video frames with the annotated initial frame. At last, based on the fused results of two branches, the global thresholds and noising area clean method are employed to obtain segmented objects. The experiments on DAVIS2016 and CDnet2014 exhibit the competitive performance of the proposed scheme.

引用

页码：1668 / 1678

页数：11

共 50 条

[31] A video object segmentation algorithm based on temporal edge memory compensation
Zhu, Shi-Ping
Ma, Li
Hou, Yang-Shuan
Guangdianzi Jiguang/Journal of Optoelectronics Laser, 2010, 21 (08): : 1241 - 1246
[32] A STEREO VIDEO OBJECT SEGMENTATION ALGORITHM BASED ON MOTION DETECTION AND DISPARITY
Wang, Lingyun
Li, Zhaohui
Li, Dongmei
PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC 2012), 2012, : 356 - 359
[33] Attention-Based Multimodal Fusion for Video Description
Hori, Chiori
Hori, Takaaki
Lee, Teng-Yok
Zhang, Ziming
Harsham, Bret
Hershey, John R.
Marks, Tim K.
Sumi, Kazuhiko
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4203 - 4212
[34] A VIDEO OBJECT SEGMENTATION ALGORITHM BASED ON THE FEATURE LEARNING AND SHAPE TRACKING
Lee, Sang Hak
Koo, Hyung Il
Cho, Nam Ik
2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 4673 - 4676
[35] Automatic video object segmentation based on clone algorithm and fuzzy mathematics
Zhang Guangyu
Gong Guangzhen
Zhu Weile
CHINESE JOURNAL OF ELECTRONICS, 2006, 15 (03): : 482 - 486
[36] Attention-based Video Virtual Try-On
Tsai, Wen-Jiin
Tien, Yi-Cheng
PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 209 - 216
[37] Attention-based interpolation network for video deblurring
Zhang, Xiaoqin
Jiang, Runhua
Wang, Tao
Huang, Pengcheng
Zhao, Li
NEUROCOMPUTING, 2021, 453 : 865 - 875
[38] Describing Video With Attention-Based Bidirectional LSTM
Bin, Yi
Yang, Yang
Shen, Fumin
Xie, Ning
Shen, Heng Tao
Li, Xuelong
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (07) : 2631 - 2641
[39] The effect of using video title in attention-based video summarization
Li, Changwei
Yeh, Zhi-Ting
Gunuganti, Jeshmitha
Chang, Jia-Bin
Norouzi, Mehdi
2024 2ND ASIA CONFERENCE ON COMPUTER VISION, IMAGE PROCESSING AND PATTERN RECOGNITION, CVIPPR 2024, 2024,
[40] Residual attention-based LSTM for video captioning
Xiangpeng Li
Zhilong Zhou
Lijiang Chen
Lianli Gao
World Wide Web, 2019, 22 : 621 - 636

← 1 2 3 4 5 →