Streaming Intended Query Detection using E2E Modeling for Continued Conversation

被引:1
|
作者
Chang, Shuo-yiin [1 ]
Prakash, Guru [1 ]
Wu, Zelin [1 ]
Liang, Qiao [1 ]
Sainath, Tara N. [1 ]
Li, Bo [1 ]
Stambler, Adam [1 ]
Upadhyay, Shyam [1 ]
Faruqui, Manaal [1 ]
Strohman, Trevor [1 ]
机构
[1] Google Inc, Mountain View, CA 94043 USA
来源
关键词
end-to-end models; continued conversation;
D O I
10.21437/Interspeech.2022-569
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In voice-enabled applications, a predetermined hotword is usually used to activate a device in order to attend to the query. However, speaking queries followed by a hotword each time introduces a cognitive burden in continued conversations. To avoid repeating a hotword, we propose a streaming end-to-end (E2E) intended query detector that identifies the utterances directed towards the device and filters out other utterances not directed towards device. The proposed approach incorporates the intended query detector into the E2E model that already folds different components of the speech recognition pipeline into one neural network. The E2E modeling on speech decoding and intended query detection also allows us to declare a quick intended query detection based on early partial recognition result, which is important to decrease latency and make the system responsive. We demonstrate that the proposed E2E approach yields a 22% relative improvement on equal error rate (EER) for the detection accuracy and 600 ms latency improvement compared with an independent intended query detector. In our experiment, the proposed model detects whether the user is talking to the device with a 8.7% EER within 1.4 seconds of median latency after user starts speaking.
引用
收藏
页码:1826 / 1830
页数:5
相关论文
共 50 条
  • [21] E2E Traffic Engineering Routing for Transport SDN
    Iovanna, Paola
    Ubaldi, Fabio
    Di Michele, Francesco
    Fernandez-Palacios Gimenez, Juan Pedro
    Lopez, Victor
    2014 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXHIBITION (OFC), 2014,
  • [22] E2E Transport API demonstration in hierarchical scenarios
    Lopez, V.
    Maor, I.
    Sethuraman, K.
    Mayoral, A.
    Ong, L.
    Szwedowski, R.
    Marques, F.
    Sharma, A.
    Bosisio, F.
    de Dios, O. Gonzalez
    Gerstel, O.
    Druesedau, F.
    Vilalta, R.
    Silva, H.
    Autenrieth, A.
    Borges, N.
    Liou, C.
    Cazzaniga, G.
    Fernandez-Palacios, J. P.
    2017 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXHIBITION (OFC), 2017,
  • [23] Digital to the Core – E2E Optimization of Mining Operations
    Schoone, Sunny
    Fodor, Dan
    Marinho, Bernardo
    Moese-Singer, Christian
    Mining Report, 2023, 159 (01) : 96 - 101
  • [24] An E2E Network Slicing Framework for Slice Creation and Deployment Using Machine Learning
    Venkatapathy, Sujitha
    Srinivasan, Thiruvenkadam
    Jo, Han-Gue
    Ra, In-Ho
    SENSORS, 2023, 23 (23)
  • [26] Rapid Language Adaptation for Multilingual E2E Speech Recognition Using Encoder Prompting
    Kashiwagi, Yosuke
    Futami, Hayato
    Tsunoo, Emiru
    Arora, Siddhant
    Watanabe, Shinji
    INTERSPEECH 2024, 2024, : 2900 - 2904
  • [27] E2E blocking probability of IPTV and P2PTV
    Lu, Yue
    Kuipers, Fernando
    Janic, Milena
    Van Mieghem, Piet
    NETWORKING 2008: AD HOC AND SENSOR NETWORKS, WIRELESS NETWORKS, NEXT GENERATION INTERNET, PROCEEDINGS, 2008, 4982 : 445 - +
  • [28] The role of the TSN controller in E2E deterministic services provisioning
    Spadaro, Salvatore
    Agraz, Fernando
    Pages, Albert
    2024 24TH INTERNATIONAL CONFERENCE ON TRANSPARENT OPTICAL NETWORKS, ICTON 2024, 2024,
  • [29] An e2e Communication System Operating in the Electromagnetic Near Field
    Walk, Jasmin
    Edelmann, Jan-Christoph
    Ussmueller, Thomas
    2022 IEEE MTT-S INTERNATIONAL MICROWAVE BIOMEDICAL CONFERENCE (IMBIOC), 2022, : 4 - 6
  • [30] Mutta: a novel tool for E2E web mutation testing
    Leotta, Maurizio
    Paparella, Davide
    Ricca, Filippo
    SOFTWARE QUALITY JOURNAL, 2024, 32 (01) : 5 - 26