Revolutionizing crowd surveillance through voice-driven face recognition empowering rapid identification: towards development of sustainable smart cities

被引:0
|
作者
Bhat, Manish [1 ]
Paul, Samuel [1 ]
Sahu, Umesh Kumar [1 ]
Yadav, Umesh Kumar [2 ]
机构
[1] Manipal Inst Technol, Manipal Acad Higher Educ, Dept Mechatron, Manipal 576104, Karnataka, India
[2] Malaviya Natl Inst Technol, Dept Elect Engn, Jaipur 302017, Rajasthan, India
来源
ENGINEERING RESEARCH EXPRESS | 2024年 / 6卷 / 02期
关键词
crowd surveillance; conformer architecture; facial recognition; speech recognition; sustainable smart cities; Viola-Jones algorithm; voice-activated system;
D O I
10.1088/2631-8695/ad4ae9
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Recent global efforts to create sustainable smart cities have significantly transformed society and improved the lives of people. Nowadays, crowd surveillance (CS) has become essential in sustainable smart cities and society to protect public safety and security. In this regard, the face-based human detection system has received considerable attention because it is recognized as an emerging method in crowd surveillance applications. Thus, in this work, a new method for real-time identification of people for a crowd surveillance system (CSS) that uses facial and speech recognition technology has been introduced. In traditional CS systems, human operators are frequently used by crowd surveillance systems to watch and evaluate video feeds. Human error and operator weariness may result in lost opportunities or slow replies, which reduce the system's efficacy. Certain procedures, including the initial identification and monitoring of people in video feeds, can be automated using a voice-activated system. To address the issues with the present CSS, a new framework Voice-Activated Face Recognition (VAFR) is proposed in this work. The proposed framework combines the speech and face recognition models for crowd surveillance. Experimental and simulation studies have been performed to analyze the performance of the proposed VAFR framework. The proposed framework uses the Viola-Jones algorithm for face identification and the Conformer architecture for speech analysis, reaching a noteworthy 99.8% accuracy rate in live video feeds. In addition, the ethical and safety aspect of the proposed VAFR system is presented.
引用
收藏
页数:17
相关论文
共 3 条
  • [1] Towards the sustainable development of smart cities through mass video surveillance: A response to the COVID-19 pandemic
    Shorfuzzaman, Mohammad
    Hossain, M. Shamim
    Alhamid, Mohammed F.
    SUSTAINABLE CITIES AND SOCIETY, 2021, 64
  • [2] Towards sustainable smart cities: Maturity assessment and development pattern recognition in China
    Liu, Jingjing
    Chen, Nengcheng
    Chen, Zeqiang
    Xu, Lei
    Du, Wenying
    Zhang, Yan
    Wang, Chao
    JOURNAL OF CLEANER PRODUCTION, 2022, 370
  • [3] Towards A Sustainable Development Cities Through Smart Shopping Trolly: A Response to the Covid-19 Pandemic
    Bita, Aileen Anak
    Al-Humairi, Safaa Najah Saud
    Azlan, Adzliza Salmi Binti Mohamad
    11TH IEEE SYMPOSIUM ON COMPUTER APPLICATIONS & INDUSTRIAL ELECTRONICS (ISCAIE 2021), 2021, : 141 - 145