Soft video parsing by label distribution learning

被引:0
|
作者
Miaogen Ling
Xin Geng
机构
[1] Southeast University,Department of Computer Science and Engineering
来源
关键词
video parsing; label distribution learning; subactions; graduality;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we tackle the problem of segmenting out a sequence of actions from videos. The videos contain background and actions which are usually composed of ordered sub-actions. We refer the sub-actions and the background as semantic units. Considering the possible overlap between two adjacent semantic units, we propose a bidirectional sliding window method to generate the label distributions for various segments in the video. The label distribution covers a certain number of semantic unit labels, representing the degree to which each label describes the video segment. The mapping from a video segment to its label distribution is then learned by a Label Distribution Learning (LDL) algorithm. Based on the LDL model, a soft video parsing method with segmental regular grammars is proposed to construct a tree structure for the video. Each leaf of the tree stands for a video clip of background or sub-action. The proposed method shows promising results on the THUMOS’14, MSR-II and UCF101 datasets and its computational complexity is much less than the compared state-of-the-art video parsing method.
引用
收藏
页码:302 / 317
页数:15
相关论文
共 50 条
  • [31] Label Distribution Learning with Label Correlations on Local Samples
    Jia, Xiuyi
    Li, Zechao
    Zheng, Xiang
    Li, Weiwei
    Huang, Sheng-Jun
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (04) : 1619 - 1631
  • [32] Label Distribution Learning by Maintaining Label Ranking Relation
    Jia, Xiuyi
    Shen, Xiaoxia
    Li, Weiwei
    Lu, Yunan
    Zhu, Jihua
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (02) : 1695 - 1707
  • [33] Label Distribution Learning with Label-Specific Features
    Ren, Tingting
    Jia, Xiuyi
    Li, Weiwei
    Chen, Lei
    Li, Zechao
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3318 - 3324
  • [34] Partial Multi-Label Learning with Label Distribution
    Xu, Ning
    Liu, Yun-Peng
    Geng, Xin
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 6510 - 6517
  • [35] Label Distribution Learning by Exploiting Fuzzy Label Correlation
    Wang, Jing
    Kou, Zhiqiang
    Jia, Yuheng
    Lv, Jianhui
    Geng, Xin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [36] Cross-Modal learning for Audio-Visual Video Parsing
    Lamba, Jatin
    Abhishek
    Akula, Jayaprakash
    Dabral, Rishabh
    Jyothi, Preethi
    Ramakrishnan, Ganesh
    INTERSPEECH 2021, 2021, : 1937 - 1941
  • [37] Action Parsing-Driven Video Summarization Based on Reinforcement Learning
    Lei, Jie
    Luan, Qiao
    Song, Xinhui
    Liu, Xiao
    Tao, Dapeng
    Song, Mingli
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (07) : 2126 - 2137
  • [38] Label distribution for multimodal machine learning
    REN Yi
    XU Ning
    LING Miaogen
    GENG Xin
    Frontiers of Computer Science, 2022, 16 (01)
  • [39] Label distribution for multimodal machine learning
    Ren, Yi
    Xu, Ning
    Ling, Miaogen
    Geng, Xin
    FRONTIERS OF COMPUTER SCIENCE, 2022, 16 (01)
  • [40] A label distribution manifold learning algorithm
    Tan, Chao
    Chen, Sheng
    Geng, Xin
    Ji, Genlin
    PATTERN RECOGNITION, 2023, 135