Toward hardware-aware deep-learning-based dialogue systems

被引:0
|
作者
Vlad Pandelea
Edoardo Ragusa
Tom Young
Paolo Gastaldo
Erik Cambria
机构
[1] Nanyang Technological University,School of Computer Science and Engineering
[2] University of Genoa,Department of Electrical, Electronic, Telecommunications Engineering and Naval Architecture, DITEN
来源
关键词
Dialogue systems; Natural language processing; Artificial intelligence;
D O I
暂无
中图分类号
学科分类号
摘要
In the past few years, the use of transformer-based models has experienced increasing popularity as new state-of-the-art performance was achieved in several natural language processing tasks. As these models are often extremely large, however, their use for applications within embedded devices may not be feasible. In this work, we look at one such specific application, retrieval-based dialogue systems, that poses additional difficulties when deployed in environments characterized by limited resources. Research on building dialogue systems able to engage in natural sounding conversation with humans has attracted increasing attention in recent years. This has led to the rise of commercial conversational agents, such as Google Home, Alexa and Siri situated on embedded devices, that enable users to interface with a wide range of underlying functionalities in a natural and seamless manner. In part due to memory and computational power constraints, these agents necessitate frequent communication with a server in order to process the users’ queries. This communication may act as a bottleneck, resulting in delays as well as in the halt of the system should the network connection be lost or unavailable. We propose a new framework for hardware-aware retrieval-based dialogue systems based on the Dual-Encoder architecture, coupled with a clustering method to group candidates pertaining to a same conversation, that reduces storage capacity and computational power requirements.
引用
收藏
页码:10397 / 10408
页数:11
相关论文
共 50 条
  • [1] Toward hardware-aware deep-learning-based dialogue systems
    Pandelea, Vlad
    Ragusa, Edoardo
    Young, Tom
    Gastaldo, Paolo
    Cambria, Erik
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (13): : 10397 - 10408
  • [2] Hardware-Aware Machine Learning: Modeling and Optimization
    Marculescu, Diana
    Stamoulis, Dimitrios
    Cai, Ermao
    2018 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD) DIGEST OF TECHNICAL PAPERS, 2018,
  • [3] Hardware-aware approach to deep neural network optimization
    Li, Hengyi
    Meng, Lin
    NEUROCOMPUTING, 2023, 559
  • [4] DEEP-LEARNING-BASED ENERGY AWARE IMAGES
    Le Meur, Olivier
    Demarty, Claire-Helene
    Blonde, Laurent
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 590 - 594
  • [5] Hardware-Aware Softmax Approximation for Deep Neural Networks
    Geng, Xue
    Lin, Jie
    Zhao, Bin
    Kong, Anmin
    Aly, Mohamed M. Sabry
    Chandrasekhar, Vijay
    COMPUTER VISION - ACCV 2018, PT IV, 2019, 11364 : 107 - 122
  • [6] Hardware-Aware In Situ Learning Based on Stochastic Magnetic Tunnel junctions
    Kaiser, Jan
    Borders, William A.
    Camsari, Kerem Y.
    Fukami, Shunsuke
    Ohno, Hideo
    Datta, Supriyo
    PHYSICAL REVIEW APPLIED, 2022, 17 (01)
  • [7] Towards Hardware-Aware Tractable Learning of Probabilistic Models
    Olascoaga, Laura I. Galindez
    Meert, Wannes
    Shah, Nimish
    Verhelst, Marian
    Van den Broeck, Guy
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [8] Neuromorphic Silicon Photonics and Hardware-Aware Deep Learning for High-Speed Inference
    Moralis-Pegios, Miltiadis
    Mourgias-Alexandris, George
    Tsakyridis, Apostolos
    Giamougiannis, George
    Totovic, Angelina
    Dabos, George
    Passalis, Nikolaos
    Kirtas, Manos
    Rutirawut, T.
    Gardes, F. Y.
    Tefas, Anastasios
    Pleros, Nikos
    JOURNAL OF LIGHTWAVE TECHNOLOGY, 2022, 40 (10) : 3243 - 3254
  • [9] Noise-Tolerant Hardware-Aware Pruning for Deep Neural Networks
    Lu, Shun
    Chen, Cheng
    Zhang, Kunlong
    Zheng, Yang
    Hu, Zheng
    Hong, Wenjing
    Li, Guiying
    Yao, Xin
    ADVANCES IN SWARM INTELLIGENCE, ICSI 2023, PT II, 2023, 13969 : 127 - 138
  • [10] Hardware-Aware Affordance Detection for Application in Portable Embedded Systems
    Ragusa, Edoardo
    Gianoglio, Christian
    Dosen, Strahinja
    Gastaldo, Paolo
    IEEE ACCESS, 2021, 9 : 123178 - 123193