Video Segmentation with Absorbing Markov Chains and Skeleton Mapping

被引：0

作者：

Liang Y. ^{[1
]}

Zhang Y.-Q. ^{[2
]}

Zheng J.-T. ^{[1
]}

Zhang Y. ^{[2
]}

机构：

[1] College of Mathematics and Informatics, South China Agricultural University, Guangzhou

[2] Faculty of Information Technology, Beijing University of Technology, Beijing

来源：

Ruan Jian Xue Bao/Journal of Software | 2024年 / 35卷 / 03期

关键词：

absorbing Markov chain; long-term/short-term spatial-temporal cue; skeleton mapping network; video segmentation;

D O I：

10.13328/j.cnki.jos.006821

中图分类号：

学科分类号：

摘要：

As challenges such as serious occlusions and deformations coexist, video segmentation with accurate robustness has become one of the hot topics in computer vision. This study proposes a video segmentation method with absorbing Markov chains and skeleton mapping, which progressively produces accurate object contours through the process of pre-segmentation—optimization—improvement. In the phase of pre-segmentation, based on the twin network and the region proposal network, the study obtains regions of interest for objects, constructs the absorbing Markov chains of superpixels in these regions, and calculates the labels of foreground/background of the superpixels. The absorbing Markov chains can perceive and propagate the object features flexibly and effectively and preliminarily presegment the target object from the complex scene. In the phase of optimization, the study designs the short-term and long-term spatial-temporal cue models to obtain the short-term variation and the long-term feature of the object, so as to optimize superpixel labels and reduce errors caused by similar objects and noise. In the phase of improvement, to reduce the artifacts and discontinuities of optimization results, this study proposes an automatic generation algorithm for foreground/background skeleton based on superpixel labels and positions and constructs a skeleton mapping network based on encoding and decoding, so as to learn the pixel-level object contour and finally obtain accurate video segmentation results. Many experiments on standard datasets show that the proposed method is superior to the existing mainstream video segmentation methods and can produce segmentation results with higher region similarity and contour accuracy. © 2024 Chinese Academy of Sciences. All rights reserved.

引用

页码：1552 / 1568

页数：16

共 55 条

[1] Tsai YH, Yang MH, Black MJ., Video segmentation via object flow, Proc. of the 2016 IEEE Conf. on Computer Vision and Pattern Recognition, pp. 3899-3908, (2016)
[2] Ding MY, Wang Z, Zhou BL, Shi JP, Lu ZW, Luo P., Every frame counts: Joint learning of video segmentation and optical flow, Proc. of the 2020 AAAI Conf. on Artificial Intelligence, pp. 10713-10720, (2020)
[3] Li XX, Loy CC., Video object segmentation with joint re-identification and attention-aware mask propagation, Proc. of the 15th European Conf. on Computer Vision, pp. 93-110, (2018)
[4] Zheng Y, Chen YD, Hao CY., Video object segmentation algorithm based on consistent features, Journal of Image and Graphics, 25, 8, pp. 1558-1566, (2020)
[5] Hao CY, Chen YD, Yang ZX, Wu EH., Higher-order potentials for video object segmentation in bilateral space, Neurocomputing, 401, pp. 28-35, (2020)
[6] Liu ZY, Wang L, Hua G, Zhang QL, Niu ZX, Wu Y, Zheng NN., Joint video object discovery and segmentation by coupled dynamic Markov networks, IEEE Trans. on Image Processing, 27, 12, pp. 5840-5853, (2018)
[7] Chen X, Li ZX, Yuan Y, Yu G, Shen JX, Qi DL., State-aware tracker for real-time video object segmentation, Proc. of the 2020 IEEE/CVF Conf. on Computer Vision and Pattern Recognition, pp. 9381-9390, (2020)
[8] Cheng JC, Tsai YH, Hung WC, Wang SJ, Yang MH., Fast and accurate online video object segmentation via tracking parts, Proc. of the 2018 IEEE/CVF Conf. on Computer Vision and Pattern Recognition, pp. 7415-7424, (2018)
[9] Wang Q, Zhang L, Bertinetto L, Hu WM, Torr PHS., Fast online object tracking and segmentation: A unifying approach, Proc. of the 2019 IEEE/CVF Conf. on Computer Vision and Pattern Recognition, pp. 1328-1338, (2019)
[10] Liu MH, Wang CS, Hu Q, Wang CX, Cui XH., Part-based object tracking based on multi collaborative model, Ruan Jian Xue Bao/Journal of Software, 31, 2, pp. 511-530, (2020)

← 1 2 3 4 5 6 →