Laplacian Eigenmaps for Automatic Story Segmentation of Broadcast News

被引:27
|
作者
Xie, Lei [1 ]
Zheng, Lilei [1 ]
Liu, Zihan [2 ]
Zhang, Yanning [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710129, Peoples R China
[2] City Univ Hong Kong, Sch Creat Media, Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Laplacian Eigenmaps (LE); spoken document retrieval; story segmentation; topic segmentation; IMAGE SEGMENTATION; TEXT SEGMENTATION; SPEECH; ALGORITHM; PROSODY; CUES;
D O I
10.1109/TASL.2011.2160853
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose Laplacian Eigenmaps (LE)-based approaches to automatic story segmentation on speech recognition transcripts of broadcast news. We reinforce story boundaries by applying LE analysis to sentence connective strength matrix and reveal the intrinsic geometric structure of stories. Specifically, we construct a Euclidean space in which each sentence is mapped to a vector. As a result, the original inter-sentence connective strength is reflected by the Euclidean distances between the corresponding vectors and cohesive relations between sentences become geometrically evident. Taking advantage of LE, we present three story segmentation approaches: LE-TextTiling, spectral clustering and LE-DP. In LE-DP, we formalize story segmentation as a straightforward criterion minimization problem and give a fast dynamic programming solution to it. Extensive story segmentation experiments on three corpora demonstrate that the proposed LE-based approaches achieve superior performances and significantly outperform several state-of-the-art methods. For instance, LE-TextTiling obtains a relative F1-measure increase of 17.8% on CCTV Mandarin BN corpus as compared to conventional TextTiling; LE-DP achieves a high F1-measure of 0.7460, which significantly outperforms a recent CRF-prosody approach with an F1-measure of 0.6783 on TDT2 Mandarin BN corpus.
引用
收藏
页码:276 / 289
页数:14
相关论文
共 50 条
  • [21] BROADCAST NEWS STORY SEGMENTATION USING LATENT TOPICS ON DATA MANIFOLD
    Lu, Xiaoming
    Leung, Cheung-Chi
    Xie, Lei
    Ma, Bin
    Li, Haizhou
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8465 - 8469
  • [22] Modeling Latent Topics and Temporal Distance for Story Segmentation of Broadcast News
    Chen, Hongjie
    Xie, Lei
    Leung, Cheung-Chi
    Lu, Xiaoming
    Ma, Bin
    Li, Haizhou
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (01) : 112 - 123
  • [23] A hierarchical approach to story segmentation of large broadcast news video corpus
    Chaisorn, L
    Chua, TS
    Lee, CH
    Tian, Q
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1095 - 1098
  • [24] Multiple style exploration for story unit segmentation of broadcast news video
    Bailan Feng
    Zhineng Chen
    Rong Zheng
    Bo Xu
    Multimedia Systems, 2014, 20 : 347 - 361
  • [25] Multiple style exploration for story unit segmentation of broadcast news video
    Feng, Bailan
    Chen, Zhineng
    Zheng, Rong
    Xu, Bo
    MULTIMEDIA SYSTEMS, 2014, 20 (04) : 347 - 361
  • [26] Automatic Segmentation of Broadcast News Audio using Self Similarity Matrix
    Soni, Sapna
    Ahmed, Imran
    Kopparapu, Sunil Kumar
    2014 INTERNATIONAL CONFERENCE FOR CONVERGENCE OF TECHNOLOGY (I2CT), 2014,
  • [27] On the effectiveness of subwords for lexical cohesion based story segmentation of Chinese broadcast news
    Xie, L.
    Yang, Y. -L.
    Liu, Z. -Q.
    INFORMATION SCIENCES, 2011, 181 (13) : 2873 - 2891
  • [28] MULTI-MODAL INFORMATION FUSION FOR NEWS STORY SEGMENTATION IN BROADCAST VIDEO
    Feng, Bailan
    Ding, Peng
    Chen, Jiansong
    Bai, Jinfeng
    Xu, Su
    Xu, Bo
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 1417 - 1420
  • [29] Broadcast News Story Segmentation Using Conditional Random Fields and Multimodal Features
    Wang, Xiaoxuan
    Xie, Lei
    Lu, Mimi
    Ma, Bin
    Chng, Eng Siong
    Li, Haizhou
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (05) : 1206 - 1215
  • [30] Contrastive Laplacian Eigenmaps
    Zhu, Hao
    Sun, Ke
    Koniusz, Piotr
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34