Virtual home assistant for voice based controlling and scheduling with short speech speaker identification

被引:0
|
作者
Varun Tiwari
Mohammad Farukh Hashmi
Avinash Keskar
N. C. Shivaprakash
机构
[1] Visvesvaraya National Institute of Technology,Department of Electronics and Communication Engineering
[2] National Institute of Technology Campus,Department of Electronics and Communication Engineering
[3] Indian Institute of Science,Department of Instrumentation and Applied Physics
来源
关键词
Cloud services; Gaussian mixture models; Internet of things; Principal component analysis; Speaker identification; Vector quantization;
D O I
暂无
中图分类号
学科分类号
摘要
With the advancement of interface technologies in smart devices, voice-controlled assistants have quickly gained popularity. These assistants are designed to use voice commands to achieve a more human-friendly interaction. On these lines, we propose a cloud-connected voice based home assistant in this paper. It accepts voice commands to control or monitor devices in a home. It can understand and schedule device operations based on time or sensor data through a simple voice based approach. To enhance its capability, it is designed to identify the speakers. Mel-Frequency Cepstrum Coefficients (MFCC) in combination with other speech features are used as feature vector. We use Vector Quantization (VQ) and Principal Component Analysis (PCA) for dimensionality reduction of the feature vector, followed by Gaussian Mixture Model (GMM) for classification. The validation of the short speech speaker identification is carried out on a set of Indian speakers in an uncontrolled indoor environment. An accuracy greater than 92% is achieved for speech samples as small as 1 second. A database of more than 50 different commands per speaker is also created for validation of the proposed virtual assistant. IBM’s Bluemix and Google’s cloud service is used for speech to text conversion.
引用
收藏
页码:5243 / 5268
页数:25
相关论文
共 50 条
  • [1] Virtual home assistant for voice based controlling and scheduling with short speech speaker identification
    Tiwari, Varun
    Hashmi, Mohammad Farukh
    Keskar, Avinash
    Shivaprakash, N. C.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (7-8) : 5243 - 5268
  • [2] An Improvement of the Degradation of Speaker Recognition in Continuous Cold Speech for Home Assistant
    Ai, Haojun
    Wang, Yifeng
    Yang, Yuhong
    Zhang, Quanxin
    CYBERSPACE SAFETY AND SECURITY, PT I, 2020, 11982 : 363 - 373
  • [3] An IoT based Smart Home with Virtual Assistant
    Vamsi, T. M. N.
    Suchitra, B.
    Kumar, Sai
    Varma, K. V. V.
    Kumar, K. N. S. Harshit
    2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,
  • [4] Alexa-Based Voice Assistant for Smart Home Applications
    Jimenez C.
    Saavedra E.
    Del Campo G.
    Santamaria A.
    IEEE Potentials, 2021, 40 (04): : 31 - 38
  • [5] Speaker Authentication System Based on Voice Biometrics and Speech Recognition
    Dovydaitis, Laurynas
    Rasymas, Tomas
    Rudzionis, Vytautas
    BUSINESS INFORMATION SYSTEMS WORKSHOPS, BIS 2016, 2017, 263 : 79 - 84
  • [6] Speaker identification from emotional and noisy speech using learned voice segregation and speech VGG
    Hamsa, Shibani
    Shahin, Ismail
    Iraqi, Youssef
    Damiani, Ernesto
    Nassif, Ali Bou
    Werghi, Naoufel
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 224
  • [7] Multimodal Speaker Identification Based on Text and Speech
    Moschonas, Panagiotis
    Kotropoulos, Constantine
    BIOMETRICS AND IDENTITY MANAGEMENT, 2008, 5372 : 100 - 109
  • [8] CHA: A Caching Framework for Home-based Voice Assistant Systems
    Xu, Lanyu
    Iyengar, Arun
    Shi, Weisong
    2020 IEEE/ACM SYMPOSIUM ON EDGE COMPUTING (SEC 2020), 2020, : 293 - 306
  • [9] Speaker Identification in Noisy Conditions Using Short Sequences of Speech Frames
    Biagetti, Giorgio
    Crippa, Paolo
    Falaschetti, Laura
    Orcioni, Simone
    Turchetti, Claudio
    INTELLIGENT DECISION TECHNOLOGIES 2017, KES-IDT 2017, PT II, 2018, 73 : 43 - 52
  • [10] Speaker Identification Based on Physical Variation of Speech Signal
    Nandan, Durgesh
    Singh, Mahesh Kumar
    Kumar, Sanjeev
    Yadav, Harendra Kumar
    TRAITEMENT DU SIGNAL, 2022, 39 (02) : 711 - 716