TRAED: Speech audio editing using imperfect transcripts

被引：0

作者：

Masoodian, Masood ^{[1
]}

Rogers, Bill ^{[1
]}

Ware, David ^{[1
]}

McKoy, Sam ^{[1
]}

机构：

[1] Univ Waikato, Dept Comp Sci, Hamilton, New Zealand

来源：

12TH INTERNATIONAL MULTI-MEDIA MODELLING CONFERENCE PROCEEDINGS | 2006年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Although digital recording, of speech is widespread, and an increasing range of applications allow recording and inclusion of speech data in documents, editing mid retrievol of speech audio remains generally a challenging task. We have previously developed a speech audio editing and browsing application which utilizes imperfect transcripts of speech os a mechanism for text-based editing and retrieval of speech audio documents. This paper presents a second prototype, called TRAED, which enhances the functionality provided by our earlier prototype, and further facilitates the task of speech audio editing and access.

引用

页码：454 / 459

页数：6

共 50 条

[31] Analysis of disfluency in audio and chat transcripts
Denisleam , Sibel
Trausan-Matu, Stefan
2016 20TH INTERNATIONAL CONFERENCE ON SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2016, : 174 - 179
[32] Audio-Speech Watermarking Using a Channel Equalizer
Shervin Shokri
Mahamod Ismail
Nasharuddin Zainal
Majid Moghaddasi
Wireless Personal Communications, 2017, 95 : 4457 - 4476
[33] How to Talk about Speech and Audio Quality with Speech and Audio People
Raake, Alexander
Waeltermann, Marcel
Wuestenhagen, Ulf
Feiten, Bernhard
JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2012, 60 (03): : 147 - 155
[34] DESCRIPTION OF AN AUDIO EDITING SYSTEM USING A COMPUTER MAGNETIC HARD DISK
WEISSER, A
KOMLY, A
SEIDEL, N
JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 1986, 34 (05): : 378 - &
[35] EDITING DIGITAL AUDIO SIGNALS IN A DIGITAL AUDIO VIDEO SYSTEM
YOUNGQUIST, RJ
SMPTE JOURNAL, 1982, 91 (12): : 1158 - 1160
[36] Affect editing in speech
Shikler, TS
Robinson, P
AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 3784 : 411 - 418
[37] DIGITAL AUDIO EDITING .2.
WATKINSON, J
ELECTRONICS & WIRELESS WORLD, 1986, 93 (1602): : 52 - 54
[38] Embedded coding using a mixed speech and audio coding paradigm
Ramprashad S.A.
International Journal of Speech Technology, 1999, 2 (4) : 359 - 372
[39] Robust audio and speech watermarking using Gaussian and Laplacian modeling
Akhaee, Mohammad Ali
Kalantari, Nima Khademi
Marvasti, Farokh
SIGNAL PROCESSING, 2010, 90 (08) : 2487 - 2497
[40] IMPROVING ACOUSTIC MODELING USING AUDIO-VISUAL SPEECH
Abdelaziz, Ahmed Hussen
2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 1081 - 1086

← 1 2 3 4 5 →