TRAED: Speech audio editing using imperfect transcripts

被引:0
|
作者
Masoodian, Masood [1 ]
Rogers, Bill [1 ]
Ware, David [1 ]
McKoy, Sam [1 ]
机构
[1] Univ Waikato, Dept Comp Sci, Hamilton, New Zealand
来源
12TH INTERNATIONAL MULTI-MEDIA MODELLING CONFERENCE PROCEEDINGS | 2006年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although digital recording, of speech is widespread, and an increasing range of applications allow recording and inclusion of speech data in documents, editing mid retrievol of speech audio remains generally a challenging task. We have previously developed a speech audio editing and browsing application which utilizes imperfect transcripts of speech os a mechanism for text-based editing and retrieval of speech audio documents. This paper presents a second prototype, called TRAED, which enhances the functionality provided by our earlier prototype, and further facilitates the task of speech audio editing and access.
引用
收藏
页码:454 / 459
页数:6
相关论文
共 50 条
  • [31] Analysis of disfluency in audio and chat transcripts
    Denisleam , Sibel
    Trausan-Matu, Stefan
    2016 20TH INTERNATIONAL CONFERENCE ON SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2016, : 174 - 179
  • [32] Audio-Speech Watermarking Using a Channel Equalizer
    Shervin Shokri
    Mahamod Ismail
    Nasharuddin Zainal
    Majid Moghaddasi
    Wireless Personal Communications, 2017, 95 : 4457 - 4476
  • [33] How to Talk about Speech and Audio Quality with Speech and Audio People
    Raake, Alexander
    Waeltermann, Marcel
    Wuestenhagen, Ulf
    Feiten, Bernhard
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2012, 60 (03): : 147 - 155
  • [34] DESCRIPTION OF AN AUDIO EDITING SYSTEM USING A COMPUTER MAGNETIC HARD DISK
    WEISSER, A
    KOMLY, A
    SEIDEL, N
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 1986, 34 (05): : 378 - &
  • [35] EDITING DIGITAL AUDIO SIGNALS IN A DIGITAL AUDIO VIDEO SYSTEM
    YOUNGQUIST, RJ
    SMPTE JOURNAL, 1982, 91 (12): : 1158 - 1160
  • [36] Affect editing in speech
    Shikler, TS
    Robinson, P
    AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 3784 : 411 - 418
  • [37] DIGITAL AUDIO EDITING .2.
    WATKINSON, J
    ELECTRONICS & WIRELESS WORLD, 1986, 93 (1602): : 52 - 54
  • [38] Embedded coding using a mixed speech and audio coding paradigm
    Ramprashad S.A.
    International Journal of Speech Technology, 1999, 2 (4) : 359 - 372
  • [39] Robust audio and speech watermarking using Gaussian and Laplacian modeling
    Akhaee, Mohammad Ali
    Kalantari, Nima Khademi
    Marvasti, Farokh
    SIGNAL PROCESSING, 2010, 90 (08) : 2487 - 2497
  • [40] IMPROVING ACOUSTIC MODELING USING AUDIO-VISUAL SPEECH
    Abdelaziz, Ahmed Hussen
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 1081 - 1086