Virtual home assistant for voice based controlling and scheduling with short speech speaker identification

被引：0

作者：

Varun Tiwari

Mohammad Farukh Hashmi

Avinash Keskar

N. C. Shivaprakash

机构：

[1] Visvesvaraya National Institute of Technology,Department of Electronics and Communication Engineering

[2] National Institute of Technology Campus,Department of Electronics and Communication Engineering

[3] Indian Institute of Science,Department of Instrumentation and Applied Physics

来源：

Multimedia Tools and Applications | 2020年 / 79卷

关键词：

Cloud services; Gaussian mixture models; Internet of things; Principal component analysis; Speaker identification; Vector quantization;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

With the advancement of interface technologies in smart devices, voice-controlled assistants have quickly gained popularity. These assistants are designed to use voice commands to achieve a more human-friendly interaction. On these lines, we propose a cloud-connected voice based home assistant in this paper. It accepts voice commands to control or monitor devices in a home. It can understand and schedule device operations based on time or sensor data through a simple voice based approach. To enhance its capability, it is designed to identify the speakers. Mel-Frequency Cepstrum Coefficients (MFCC) in combination with other speech features are used as feature vector. We use Vector Quantization (VQ) and Principal Component Analysis (PCA) for dimensionality reduction of the feature vector, followed by Gaussian Mixture Model (GMM) for classification. The validation of the short speech speaker identification is carried out on a set of Indian speakers in an uncontrolled indoor environment. An accuracy greater than 92% is achieved for speech samples as small as 1 second. A database of more than 50 different commands per speaker is also created for validation of the proposed virtual assistant. IBM’s Bluemix and Google’s cloud service is used for speech to text conversion.

引用

页码：5243 / 5268

页数：25

共 50 条

[1] Virtual home assistant for voice based controlling and scheduling with short speech speaker identification
Tiwari, Varun
Hashmi, Mohammad Farukh
Keskar, Avinash
Shivaprakash, N. C.
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (7-8) : 5243 - 5268
[2] An Improvement of the Degradation of Speaker Recognition in Continuous Cold Speech for Home Assistant
Ai, Haojun
Wang, Yifeng
Yang, Yuhong
Zhang, Quanxin
CYBERSPACE SAFETY AND SECURITY, PT I, 2020, 11982 : 363 - 373
[3] An IoT based Smart Home with Virtual Assistant
Vamsi, T. M. N.
Suchitra, B.
Kumar, Sai
Varma, K. V. V.
Kumar, K. N. S. Harshit
2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,
[4] Alexa-Based Voice Assistant for Smart Home Applications
Jimenez C.
Saavedra E.
Del Campo G.
Santamaria A.
IEEE Potentials, 2021, 40 (04): : 31 - 38
[5] Speaker Authentication System Based on Voice Biometrics and Speech Recognition
Dovydaitis, Laurynas
Rasymas, Tomas
Rudzionis, Vytautas
BUSINESS INFORMATION SYSTEMS WORKSHOPS, BIS 2016, 2017, 263 : 79 - 84
[6] Speaker identification from emotional and noisy speech using learned voice segregation and speech VGG
Hamsa, Shibani
Shahin, Ismail
Iraqi, Youssef
Damiani, Ernesto
Nassif, Ali Bou
Werghi, Naoufel
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 224
[7] Multimodal Speaker Identification Based on Text and Speech
Moschonas, Panagiotis
Kotropoulos, Constantine
BIOMETRICS AND IDENTITY MANAGEMENT, 2008, 5372 : 100 - 109
[8] CHA: A Caching Framework for Home-based Voice Assistant Systems
Xu, Lanyu
Iyengar, Arun
Shi, Weisong
2020 IEEE/ACM SYMPOSIUM ON EDGE COMPUTING (SEC 2020), 2020, : 293 - 306
[9] Speaker Identification in Noisy Conditions Using Short Sequences of Speech Frames
Biagetti, Giorgio
Crippa, Paolo
Falaschetti, Laura
Orcioni, Simone
Turchetti, Claudio
INTELLIGENT DECISION TECHNOLOGIES 2017, KES-IDT 2017, PT II, 2018, 73 : 43 - 52
[10] Speaker Identification Based on Physical Variation of Speech Signal
Nandan, Durgesh
Singh, Mahesh Kumar
Kumar, Sanjeev
Yadav, Harendra Kumar
TRAITEMENT DU SIGNAL, 2022, 39 (02) : 711 - 716

← 1 2 3 4 5 →