Segmenting broadcast news streams using lexical chains

被引:0
|
作者
Stokes, N [1 ]
Carthy, J [1 ]
Smeaton, AF [1 ]
机构
[1] Univ Coll Dublin, Dept Comp Sci, Dublin 2, Ireland
来源
STAIRS 2002, PROCEEDINGS | 2002年 / 78卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a course-grained NLP approach to text segmentation based on the analysis of lexical cohesion within text. Most work in this area has focused on the discovery of textual units that discuss subtopic structure within documents. In contrast our segmentation task requires the discovery of topical units of text i.e. distinct news stories from broadcast news programmes. Our system SeLeCT first builds a set of lexical chains, in order to model the discourse structure of the text. A boundary detector is then used to search for breaking points in this structure indicated by patterns of cohesive strength and weakness within the text. We evaluate this technique on a test set of concatenated CNN news story transcripts and compare it with an established statistical approach to segmentation called TextTiling.
引用
收藏
页码:145 / 154
页数:10
相关论文
共 50 条
  • [11] On the effectiveness of subwords for lexical cohesion based story segmentation of Chinese broadcast news
    Xie, L.
    Yang, Y. -L.
    Liu, Z. -Q.
    INFORMATION SCIENCES, 2011, 181 (13) : 2873 - 2891
  • [12] Broadcast news transcription using HTK
    Woodland, PC
    Gales, MJF
    Pye, D
    Young, SJ
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 719 - 722
  • [13] Using lexical chains for keyword extraction
    Ercan, Gonenc
    Cicekli, Ilyas
    INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (06) : 1705 - 1714
  • [14] Keyword extraction based on lexical chains for Chinese news web pages
    Hu, Xue-Gang
    Li, Xing-Hua
    Xie, Fei
    Wu, Xin-Dong
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2010, 23 (01): : 45 - 51
  • [15] Broadcast News
    Macnab, Geoffrey
    SIGHT AND SOUND, 2011, 21 (04): : 85 - 85
  • [16] Broadcast news
    Sherwood, RJ
    FORBES, 1999, 164 (08): : 136 - +
  • [17] BROADCAST NEWS
    不详
    NATURE, 1989, 342 (6251) : 722 - 722
  • [18] Broadcast news navigation using story segmentation
    Merlino, A
    Morey, D
    Maybury, M
    ACM MULTIMEDIA 97, PROCEEDINGS, 1997, : 381 - 391
  • [19] Broadcast news
    Bates, ME
    DATABASE, 1997, 20 (06): : 88 - 88
  • [20] Progress in transcription of broadcast news using Byblos
    Nguyen, L
    Matsoukas, S
    Davenport, J
    Kubala, F
    Schwartz, R
    Makhoul, J
    SPEECH COMMUNICATION, 2002, 38 (1-2) : 213 - 230