Laplacian Eigenmaps for Automatic Story Segmentation of Broadcast News

被引:27
|
作者
Xie, Lei [1 ]
Zheng, Lilei [1 ]
Liu, Zihan [2 ]
Zhang, Yanning [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710129, Peoples R China
[2] City Univ Hong Kong, Sch Creat Media, Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Laplacian Eigenmaps (LE); spoken document retrieval; story segmentation; topic segmentation; IMAGE SEGMENTATION; TEXT SEGMENTATION; SPEECH; ALGORITHM; PROSODY; CUES;
D O I
10.1109/TASL.2011.2160853
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose Laplacian Eigenmaps (LE)-based approaches to automatic story segmentation on speech recognition transcripts of broadcast news. We reinforce story boundaries by applying LE analysis to sentence connective strength matrix and reveal the intrinsic geometric structure of stories. Specifically, we construct a Euclidean space in which each sentence is mapped to a vector. As a result, the original inter-sentence connective strength is reflected by the Euclidean distances between the corresponding vectors and cohesive relations between sentences become geometrically evident. Taking advantage of LE, we present three story segmentation approaches: LE-TextTiling, spectral clustering and LE-DP. In LE-DP, we formalize story segmentation as a straightforward criterion minimization problem and give a fast dynamic programming solution to it. Extensive story segmentation experiments on three corpora demonstrate that the proposed LE-based approaches achieve superior performances and significantly outperform several state-of-the-art methods. For instance, LE-TextTiling obtains a relative F1-measure increase of 17.8% on CCTV Mandarin BN corpus as compared to conventional TextTiling; LE-DP achieves a high F1-measure of 0.7460, which significantly outperforms a recent CRF-prosody approach with an F1-measure of 0.6783 on TDT2 Mandarin BN corpus.
引用
收藏
页码:276 / 289
页数:14
相关论文
共 50 条
  • [1] Subword Lexical Chaining for Automatic Story Segmentation in Chinese Broadcast News
    Xie, Lei
    Yang, Yulian
    Zeng, Jia
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2008, 9TH PACIFIC RIM CONFERENCE ON MULTIMEDIA, 2008, 5353 : 248 - +
  • [2] Story Segmentation in TV News Broadcast
    Kannao, Raghvendra
    Guha, Prithwijit
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2948 - 2953
  • [3] Multi-scale TextTiling for automatic story segmentation in Chinese broadcast news
    Xie, Lei
    Zeng, Jia
    Feng, Wei
    INFORMATION RETRIEVAL TECHNOLOGY, 2008, 4993 : 345 - +
  • [4] A Subword Normalized Cut Approach to Automatic Story Segmentation of Chinese Broadcast News
    Zhang, Jin
    Xie, Lei
    Feng, Wei
    Zhang, Yanning
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2009, 5839 : 136 - +
  • [5] Broadcast news navigation using story segmentation
    Merlino, A
    Morey, D
    Maybury, M
    ACM MULTIMEDIA 97, PROCEEDINGS, 1997, : 381 - 391
  • [6] Discovering salient prosodic cues and their interactions for automatic story segmentation in Mandarin broadcast news
    Xie, Lei
    MULTIMEDIA SYSTEMS, 2008, 14 (04) : 237 - 253
  • [7] Discovering salient prosodic cues and their interactions for automatic story segmentation in Mandarin broadcast news
    Lei Xie
    Multimedia Systems, 2008, 14 : 237 - 253
  • [8] Unsupervised story segmentation and indexing of broadcast news video
    Pranabjyoti Haloi
    M.K. Bhuyan
    Dibyajyoti Chatterjee
    Pooja Rani Borah
    Multimedia Tools and Applications, 2023, 82 : 8645 - 8664
  • [9] Story segmentation and detection of commercials in broadcast news video
    Hauptmann, AG
    Witbrock, MJ
    IEEE INTERNATIONAL FORUM ON RESEARCH AND TECHNOLOGY ADVANCES IN DIGITAL LIBRARIES -ADL'98-, PROCEEDINGS, 1998, : 168 - 179
  • [10] Unsupervised story segmentation and indexing of broadcast news video
    Haloi, Pranabjyoti
    Bhuyan, M. K.
    Chatterjee, Dibyajyoti
    Borah, Pooja Rani
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (06) : 8645 - 8664