Laplacian Eigenmaps for Automatic Story Segmentation of Broadcast News

被引:27
|
作者
Xie, Lei [1 ]
Zheng, Lilei [1 ]
Liu, Zihan [2 ]
Zhang, Yanning [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710129, Peoples R China
[2] City Univ Hong Kong, Sch Creat Media, Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Laplacian Eigenmaps (LE); spoken document retrieval; story segmentation; topic segmentation; IMAGE SEGMENTATION; TEXT SEGMENTATION; SPEECH; ALGORITHM; PROSODY; CUES;
D O I
10.1109/TASL.2011.2160853
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose Laplacian Eigenmaps (LE)-based approaches to automatic story segmentation on speech recognition transcripts of broadcast news. We reinforce story boundaries by applying LE analysis to sentence connective strength matrix and reveal the intrinsic geometric structure of stories. Specifically, we construct a Euclidean space in which each sentence is mapped to a vector. As a result, the original inter-sentence connective strength is reflected by the Euclidean distances between the corresponding vectors and cohesive relations between sentences become geometrically evident. Taking advantage of LE, we present three story segmentation approaches: LE-TextTiling, spectral clustering and LE-DP. In LE-DP, we formalize story segmentation as a straightforward criterion minimization problem and give a fast dynamic programming solution to it. Extensive story segmentation experiments on three corpora demonstrate that the proposed LE-based approaches achieve superior performances and significantly outperform several state-of-the-art methods. For instance, LE-TextTiling obtains a relative F1-measure increase of 17.8% on CCTV Mandarin BN corpus as compared to conventional TextTiling; LE-DP achieves a high F1-measure of 0.7460, which significantly outperforms a recent CRF-prosody approach with an F1-measure of 0.6783 on TDT2 Mandarin BN corpus.
引用
收藏
页码:276 / 289
页数:14
相关论文
共 50 条
  • [41] Improving broadcast news segmentation processing
    Boykin, S
    Merlino, A
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 1, 1999, : 744 - 749
  • [42] Maximum entropy segmentation of broadcast news
    Christensen, H
    Kolluru, BK
    Gotoh, Y
    Renals, S
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1029 - 1032
  • [43] Automatic language identification in broadcast news
    Backfried, G
    Rainoldi, R
    Riedler, J
    PROCEEDING OF THE 2002 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, 2002, : 1406 - 1410
  • [44] Automatic categorization design for broadcast news
    Luo, HT
    Huang, Q
    STORAGE AND RETRIEVAL FOR MEDIA DATABASES 2002, 2002, 4676 : 285 - 295
  • [45] Automatic transcription of Broadcast News data
    Pallett, DS
    Lamel, L
    SPEECH COMMUNICATION, 2002, 37 (1-2) : 1 - 2
  • [46] Story segmentation in news video
    Feng, HM
    Zhai, XF
    Fan, JW
    Fang, Y
    PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : 831 - 835
  • [47] News video story segmentation
    Fang, Yong
    Zhai, Xiaofei
    Fan, Jingwang
    12TH INTERNATIONAL MULTI-MEDIA MODELLING CONFERENCE PROCEEDINGS, 2006, : 397 - 400
  • [48] Broadcast news segmentation by audio type analysis
    Nwe, TL
    Li, HZ
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1065 - 1068
  • [49] Exploring the Structure of Broadcast News for Topic Segmentation
    Amaral, Rui
    Trancoso, Isabel
    HUMAN LANGUAGE TECHNOLOGY: CHALLENGES OF THE INFORMATION SOCIETY, 2009, 5603 : 1 - 12
  • [50] Story co-segmentation of Chinese broadcast news using weakly-supervised semantic similarity
    Feng, Wei
    Nie, Xuecheng
    Zhang, Yujun
    Liu, Zhi-Qiang
    Dang, Jianwu
    NEUROCOMPUTING, 2019, 355 : 121 - 133