LOW-RESOURCE CONTEXTUAL TOPIC IDENTIFICATION ON SPEECH

被引：0

作者：

Liu, Chunxi ^{[1
]}

Wiesner, Matthew ^{[1
]}

Watanabe, Shinji ^{[1
]}

Harman, Craig ^{[1
]}

Trmal, Jan ^{[1
,2
]}

Dehak, Najim ^{[1
]}

Khudanpur, Sanjeev ^{[1
,2
]}

机构：

[1] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA

[2] Johns Hopkins Univ, Human Language Technol Ctr Excellence, Baltimore, MD 21218 USA

来源：

2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018) | 2018年

关键词：

Topic identification; universal acoustic modeling; recurrent neural networks; attention;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In topic identification (topic ID) on real-world unstructured audio, an audio instance of variable topic shifts is first broken into sequential segments, and each segment is independently classified. We first present a general purpose method for topic ID on spoken segments in low-resource languages, using a cascade of universal acoustic modeling, translation lexicons to English, and English-language topic classification. Next, instead of classifying each segment independently, we demonstrate that exploring the contextual dependencies across sequential segments can provide large improvements. In particular, we propose an attention-based contextual model which is able to leverage the contexts in a selective manner. We test both our contextual and non-contextual models on four LORELEI languages, and on all but one our attention-based contextual model significantly outperforms the context-independent models.

引用

页码：656 / 663

页数：8

共 50 条

[21] ON SCALING CONTRASTIVE REPRESENTATIONS FOR LOW-RESOURCE SPEECH RECOGNITION
Borgholt, Lasse
Tax, Tycho M. S.
Havtorn, Jakob D.
Maaloe, Lars
Igel, Christian
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3885 - 3889
[22] Implementation of speech feature extraction for low-resource devices
Abed, Sa'ed
Mohd, Bassam Jamil
Al Shayeji, Mohammad H.
IET CIRCUITS DEVICES & SYSTEMS, 2019, 13 (06) : 863 - 872
[23] Leveraging translations for speech transcription in low-resource settings
Anastasopoulos, Antonios
Chiang, David
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1279 - 1283
[24] GlotLID: Language Identification for Low-Resource Languages
Kargaran, Amir Hossein
Imani, Ayyoob
Yvon, Francois
Schuetze, Hinrich
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 6155 - 6218
[25] Multilingual Speech Corpus in Low-Resource Eastern and Northeastern Indian Languages for Speaker and Language Identification
Joyanta Basu
Soma Khan
Rajib Roy
Tapan Kumar Basu
Swanirbhar Majumder
Circuits, Systems, and Signal Processing, 2021, 40 : 4986 - 5013
[26] Using Explainable AI (XAI) for Identification of Subjectivity in Hate Speech Annotations for Low-Resource Languages
Sawant, Madhuri
Qureshi, M. Atif
Younus, Arjumand
Caton, Simon
PROCEEDINGS OF THE 2024 WORKSHOP ON OPEN CHALLENGES IN ONLINE SOCIAL NETWORKS, OASIS 2024, 2024, : 10 - 17
[27] Multilingual Speech Corpus in Low-Resource Eastern and Northeastern Indian Languages for Speaker and Language Identification
Basu, Joyanta
Khan, Soma
Roy, Rajib
Basu, Tapan Kumar
Majumder, Swanirbhar
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2021, 40 (10) : 4986 - 5013
[28] USING SPEECH ENHANCEMENT TO REALIZE SPEECH SYNTHESIS OF LOW-RESOURCE DUNGAN LANGUAGES
Jiang, Rui
Chen, Chengsi
Shan, Xin
Yang, Hongwu
2021 24TH CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA), 2021, : 193 - 198
[29] Low-resource automatic speech recognition and error analyses of oral cancer speech
Halpern, Bence Mark
Feng, Siyuan
van Son, Rob
van den Brekel, Michiel
Scharenborg, Odette
SPEECH COMMUNICATION, 2022, 141 : 14 - 27
[30] LIMITED RESOURCE TERM DETECTION FOR EFFECTIVE TOPIC IDENTIFICATION OF SPEECH
Wintrode, Jonathan
Khudanpur, Sanjeev
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,

← 1 2 3 4 5 →