Attention Weight Smoothing Using Prior Distributions for Transformer-Based End-to-End ASR

被引：0

作者：

Maekaku, Takashi ^{[1
]}

Fujita, Yuya ^{[1
]}

Peng, Yifan ^{[2
]}

Watanabe, Shinji ^{[2
]}

机构：

[1] Yahoo Japan Corporation, Tokyo, Japan

[2] Carnegie Mellon University, PA, United States

来源：

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH | 2022年 / 2022-September卷

关键词：

751.5; Speech;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

引用

页码：1071 / 1075

共 50 条

[21] Transformer-based Long-context End-to-end Speech Recognition
Hori, Takaaki
Moritz, Niko
Hori, Chiori
Le Roux, Jonathan
INTERSPEECH 2020, 2020, : 5011 - 5015
[22] On-device Streaming Transformer-based End-to-End Speech Recognition
Oh, Yoo Rhee
Park, Kiyoung
INTERSPEECH 2021, 2021, : 967 - 968
[23] An Investigation of Positional Encoding in Transformer-based End-to-end Speech Recognition
Yue, Fengpeng
Ko, Tom
2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
[24] UNSUPERVISED SPEAKER ADAPTATION USING ATTENTION-BASED SPEAKER MEMORY FOR END-TO-END ASR
Sari, Leda
Moritz, Niko
Hori, Takaaki
Le Roux, Jonathan
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7384 - 7388
[25] Non-autoregressive Deliberation-Attention based End-to-End ASR
Gao, Changfeng
Cheng, Gaofeng
Zhou, Jun
Zhang, Pengyuan
Yan, Yonghong
2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
[26] ETEH: Unified Attention-Based End-to-End ASR and KWS Architecture
Cheng, Gaofeng
Miao, Haoran
Yang, Runyan
Deng, Keqi
Yan, Yonghong
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1360 - 1373
[27] End-to-End Transformer-Based Open-Vocabulary Keyword Spotting with Location-Guided Local Attention
Wei, Bo
Yang, Meirong
Zhang, Tao
Tang, Xiao
Huang, Xing
Kim, Kyuhong
Lee, Jaeyun
Cho, Kiho
Park, Sung-Un
INTERSPEECH 2021, 2021, : 361 - 365
[28] End-to-End ASR with Adaptive Span Self-Attention
Chang, Xuankai
Subramanian, Aswin Shanmugam
Guo, Pengcheng
Watanabe, Shinji
Fujita, Yuya
Omachi, Motoi
INTERSPEECH 2020, 2020, : 3595 - 3599
[29] A study of transformer-based end-to-end speech recognition system for Kazakh language
Mamyrbayev Orken
Oralbekova Dina
Alimhan Keylan
Turdalykyzy Tolganay
Othman Mohamed
Scientific Reports, 12
[30] TMSS: An End-to-End Transformer-Based Multimodal Network for Segmentation and Survival Prediction
Saeed, Numan
Sobirov, Ikboljon
Al Majzoub, Roba
Yaqub, Mohammad
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VII, 2022, 13437 : 319 - 329

← 1 2 3 4 5 →