Toward hardware-aware deep-learning-based dialogue systems

被引:0
|
作者
Vlad Pandelea
Edoardo Ragusa
Tom Young
Paolo Gastaldo
Erik Cambria
机构
[1] Nanyang Technological University,School of Computer Science and Engineering
[2] University of Genoa,Department of Electrical, Electronic, Telecommunications Engineering and Naval Architecture, DITEN
来源
关键词
Dialogue systems; Natural language processing; Artificial intelligence;
D O I
暂无
中图分类号
学科分类号
摘要
In the past few years, the use of transformer-based models has experienced increasing popularity as new state-of-the-art performance was achieved in several natural language processing tasks. As these models are often extremely large, however, their use for applications within embedded devices may not be feasible. In this work, we look at one such specific application, retrieval-based dialogue systems, that poses additional difficulties when deployed in environments characterized by limited resources. Research on building dialogue systems able to engage in natural sounding conversation with humans has attracted increasing attention in recent years. This has led to the rise of commercial conversational agents, such as Google Home, Alexa and Siri situated on embedded devices, that enable users to interface with a wide range of underlying functionalities in a natural and seamless manner. In part due to memory and computational power constraints, these agents necessitate frequent communication with a server in order to process the users’ queries. This communication may act as a bottleneck, resulting in delays as well as in the halt of the system should the network connection be lost or unavailable. We propose a new framework for hardware-aware retrieval-based dialogue systems based on the Dual-Encoder architecture, coupled with a clustering method to group candidates pertaining to a same conversation, that reduces storage capacity and computational power requirements.
引用
收藏
页码:10397 / 10408
页数:11
相关论文
共 50 条
  • [31] Deep-Learning-Based Signal Detection for Banded Linear Systems
    Fan, Congmin
    Yuan, Xiaojun
    Zhang, Ying-Jun Angela
    2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
  • [32] Guest Editorial: ACM JETC Special Issue on Hardware-Aware Learning for Medical Applications
    Shi, Yiyu
    Liu, Yongpan
    Chen, Jianxu
    Jiang, Steve
    ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2022, 18 (02)
  • [33] Deep-Learning-Based Localization Approach with pseudorange for Pseudolite Systems
    Runlong Ouyang
    Guo, Xiye
    Yang, Jun
    Liu, Kai
    Meng, Zhijun
    Li, Xiaoyu
    Chen, Guokai
    Liu, Suyang
    2022 IEEE 6TH ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2022, : 1799 - 1806
  • [34] Deep-Learning-Based Methodology for Fault Diagnosis in Electromechanical Systems
    Arellano-Espitia, Francisco
    Delgado-Prieto, Miguel
    Martinez-Viol, Victor
    Jose Saucedo-Dorantes, Juan
    Alfredo Osornio-Rios, Roque
    SENSORS, 2020, 20 (14) : 1 - 23
  • [35] Deep-Learning-Based Identification of LPV Models for Nonlinear Systems
    Verhoek, Chris
    Beintema, Gerben I.
    Haesaert, Sofie
    Schoukens, Maarten
    Toth, Roland
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 3274 - 3280
  • [36] Deep-learning-based line enhancer for passive sonar systems
    Ju, Donghao
    Chi, Cheng
    Li, Zigao
    Li, Yu
    Zhang, Chunhua
    Huang, Haining
    IET RADAR SONAR AND NAVIGATION, 2022, 16 (03): : 589 - 601
  • [37] Deep-Learning-Based Network Intrusion Detection for SCADA Systems
    Yang, Huan
    Cheng, Liang
    Chuah, Mooi Choo
    2019 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2019,
  • [38] Quantization error-based regularization for hardware-aware neural network training
    Hirose, Kazutoshi
    Uematsu, Ryota
    Ando, Kota
    Ueyoshi, Kodai
    Ikebe, Masayuki
    Asai, Tetsuya
    Motomura, Masato
    Takamaeda-Yamazaki, Shinya
    IEICE NONLINEAR THEORY AND ITS APPLICATIONS, 2018, 9 (04): : 453 - 465
  • [39] HDK: Toward High-Performance Deep-Learning-Based Kirchhoff Analysis
    Wang, Xinying
    Tawose, Olamide Timothy
    Yan, Feng
    Zhao, Dongfang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 997 - 1004
  • [40] Accountable Deep-Learning-Based Vision Systems for Preterm Infant Monitoring
    Migliorelli, Lucia
    Tiribelli, Simona
    Cacciatore, Alessandro
    Giovanola, Benedetta
    Frontoni, Emanuele
    Moccia, Sara
    COMPUTER, 2023, 56 (05) : 84 - 93