Streaming Intended Query Detection using E2E Modeling for Continued Conversation

被引:1
|
作者
Chang, Shuo-yiin [1 ]
Prakash, Guru [1 ]
Wu, Zelin [1 ]
Liang, Qiao [1 ]
Sainath, Tara N. [1 ]
Li, Bo [1 ]
Stambler, Adam [1 ]
Upadhyay, Shyam [1 ]
Faruqui, Manaal [1 ]
Strohman, Trevor [1 ]
机构
[1] Google Inc, Mountain View, CA 94043 USA
来源
关键词
end-to-end models; continued conversation;
D O I
10.21437/Interspeech.2022-569
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In voice-enabled applications, a predetermined hotword is usually used to activate a device in order to attend to the query. However, speaking queries followed by a hotword each time introduces a cognitive burden in continued conversations. To avoid repeating a hotword, we propose a streaming end-to-end (E2E) intended query detector that identifies the utterances directed towards the device and filters out other utterances not directed towards device. The proposed approach incorporates the intended query detector into the E2E model that already folds different components of the speech recognition pipeline into one neural network. The E2E modeling on speech decoding and intended query detection also allows us to declare a quick intended query detection based on early partial recognition result, which is important to decrease latency and make the system responsive. We demonstrate that the proposed E2E approach yields a 22% relative improvement on equal error rate (EER) for the detection accuracy and 600 ms latency improvement compared with an independent intended query detector. In our experiment, the proposed model detects whether the user is talking to the device with a 8.7% EER within 1.4 seconds of median latency after user starts speaking.
引用
收藏
页码:1826 / 1830
页数:5
相关论文
共 50 条
  • [1] Retrieval-oriented E2E ASR Modeling for Improved Query-by-example Spoken Term Detection
    Kurokawa, Takumi
    Kai, Atsuhiko
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1037 - 1042
  • [2] A CIF-Based Speech Segmentation Method for Streaming E2E ASR
    Shu, Yuchun
    Luo, Haoneng
    Zhang, Shiliang
    Wang, Longbiao
    Dang, Jianwu
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 344 - 348
  • [3] A proposed new knowledge management framework with an intended validation approach: The E2E model
    Faucher, Jean-Baptiste P. L.
    Everett, Andre M.
    Lawson, Rob
    Proceedings of the Sixth International Conference on Information and Management Sciences, 2007, 6 : 349 - 355
  • [4] On Persistent Implications of E2E Testing
    Frajtak, Karel
    Cerny, Tomas
    ENTERPRISE INFORMATION SYSTEMS, ICEIS 2021, 2022, 455 : 326 - 338
  • [5] E2E数据采集网络
    张振华
    宫海波
    李国星
    中国科技信息, 2017, (06) : 67 - 70
  • [6] Detection of Anomalous e2e Encrypted Function Invocation in FaaS using Zero-Knowledge Proofs
    Andreotti, Davide
    Verticale, Giacomo
    2024 IEEE 10TH INTERNATIONAL CONFERENCE ON NETWORK SOFTWARIZATION, NETSOFT 2024, 2024, : 175 - 179
  • [7] HubNet: An E2E Model for Wheel Hub Text Detection and Recognition Using Global and Local Features
    Zeng, Yue
    Meng, Cai
    SENSORS, 2024, 24 (19)
  • [8] End-to-End (e2e) Quality of Service (QoS) For IPv6 Video Streaming
    Hassan, Rosilah
    Jabbar, Rana
    2017 19TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATIONS TECHNOLOGY (ICACT) - OPENING NEW ERA OF SMART SOCIETY, 2017, : 1 - 4
  • [9] NOAO E2E Integrated Data Cache Initiative Using iRODS
    Barg, Irene
    Scott, Derec
    Timmermann, Erik
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XX, 2011, 442 : 497 - 500
  • [10] POSTER: An E2E Trusted Cloud Infrastructure
    Wang, Juan
    Zhao, Bo
    Zhang, Huanguo
    Yan, Fei
    Zhang, Liqiang
    Yu, Fajiang
    Hu, Hongxin
    CCS'14: PROCEEDINGS OF THE 21ST ACM CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2014, : 1517 - 1519