Multi-Frequency RF Sensor Fusion for Word-Level Fluent ASL Recognition

被引:9
|
作者
Gurbuz, Sevgi Z. [1 ]
Rahman, M. Mahbubur [1 ]
Kurtoglu, Emre [1 ]
Malaia, Evie [2 ]
Gurbuz, Ali Cafer [3 ]
Griffin, Darrin J. [4 ]
Crawford, Chris [5 ]
机构
[1] Univ Alabama, Dept Elect & Comp Engn, Tuscaloosa, AL 35487 USA
[2] Univ Alabama, Dept Commun Disorders, Tuscaloosa, AL 35487 USA
[3] Mississippi State Univ, Dept Elect & Comp Engn, Starkville, MS 39762 USA
[4] Univ Alabama, Dept Commun Studies, Tuscaloosa, AL 35487 USA
[5] Univ Alabama, Dept Comp Sci, Tuscaloosa, AL 35487 USA
基金
美国国家科学基金会;
关键词
Sensors; Radio frequency; Radar; Bandwidth; Auditory system; Time-frequency analysis; Sensor fusion; American sign language; gesture recognition; radar micro-Doppler; RF sensing; deep learning; autoencoders; DOPPLER RADAR; CLASSIFICATION;
D O I
10.1109/JSEN.2021.3078339
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deaf spaces are unique indoor environments designed to optimize visual communication and Deaf cultural expression. However, much of the technological research geared towards the deaf involve use of video or wearables for American sign language (ASL) translation, with little consideration for Deaf perspective on privacy and usability of the technology. In contrast to video, RF sensors offer the avenue for ambient ASL recognition while also preserving privacy for Deaf signers. Methods: This paper investigates the RF transmit waveform parameters required for effective measurement of ASL signs and their effect on word-level classification accuracy attained with transfer learning and convolutional autoencoders (CAE). A multi-frequency fusion network is proposed to exploit data from all sensors in an RF sensor network and improve the recognition accuracy of fluent ASL signing. Results: For fluent signers, CAEs yield a 20-sign classification accuracy of %76 at 77 GHz and %73 at 24 GHz, while at X-band (10 Ghz) accuracy drops to 67%. For hearing imitation signers, signs are more separable, resulting in a 96% accuracy with CAEs. Further, fluent ASL recognition accuracy is significantly increased with use of the multi-frequency fusion network, which boosts the 20-sign fluent ASL recognition accuracy to 95%, surpassing conventional feature level fusion by 12%. Implications: Signing involves finer spatiotemporal dynamics than typical hand gestures, and thus requires interrogation with a transmit waveform that has a rapid succession of pulses and high bandwidth. Millimeter wave RF frequencies also yield greater accuracy due to the increased Doppler spread of the radar backscatter. Comparative analysis of articulation dynamics also shows that imitation signing is not representative of fluent signing, and not effective in pre-training networks for fluent ASL classification. Deep neural networks employing multi-frequency fusion capture both shared, as well as sensor-specific features and thus offer significant performance gains in comparison to using a single sensor or feature-level fusion.
引用
收藏
页码:11373 / 11381
页数:9
相关论文
共 50 条
  • [41] A ghost imaging method based on multi-frequency fusion
    Hualong Ye
    Yi Kang
    Jian Wang
    Leihong Zhang
    Haojie Sun
    Dawei Zhang
    The European Physical Journal D, 2022, 76
  • [42] Multi-frequency Miniaturized RF Components Using Hybrid Substrates
    Mondal, Saikat
    Karrapuswami, Saranraj
    Kumar, Deepak
    Chahal, Premjeet
    IEEE 71ST ELECTRONIC COMPONENTS AND TECHNOLOGY CONFERENCE (ECTC 2021), 2021, : 191 - 196
  • [43] DDC design for multi-frequency receiver based on RF sampling
    Han, Chunyang
    Sun, Wei
    Han, Guixin
    MATERIAL SCIENCE, CIVIL ENGINEERING AND ARCHITECTURE SCIENCE, MECHANICAL ENGINEERING AND MANUFACTURING TECHNOLOGY II, 2014, 651-653 : 413 - 416
  • [44] Speech Emotion Recognition based on AttentionWeight Correction Using Word-level Confidence Measure
    Santoso, Jennifer
    Yamada, Takeshi
    Makino, Shoji
    Ishizuka, Kenkichi
    Hiramura, Takekatsu
    INTERSPEECH 2021, 2021, : 1947 - 1951
  • [45] MULTI-FREQUENCY TUNING TECHNIQUE FOR DISTRIBUTED SENSOR LOCALIZATION
    Yao, Bobin
    Wang, Wenjie
    Wu, Feilong
    Yin, Qinye
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 3915 - 3918
  • [46] Word-Level Multi-Fix Rectifiability of Finite Field Arithmetic Circuits
    Rao, Vikas
    Ilioaea, Irina
    Ondricek, Haden
    Kalla, Priyank
    Enescu, Florian
    PROCEEDINGS OF THE 2021 TWENTY SECOND INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2021), 2021, : 41 - 47
  • [47] Multi-frequency sensor for remote measurement of breath and heartbeat
    Jelen, M.
    Biebl, E. M.
    ADVANCES IN RADIO SCIENCE, 2006, 4 (79-83) : 79 - 83
  • [48] Multi-Frequency Interrogation of Nanostructured Gas Sensor Arrays
    Calavia, Raul
    Maria Vazquez, Rosa
    Llobet, Eduard
    Vergara, Alexander
    2010 IEEE SENSORS, 2010, : 1083 - 1087
  • [49] Magnetic Sensor Calibration Based on Multi-frequency Signal
    Wang Yanzhang
    Cheng Defu
    Lu Hao
    Wan Yunxia
    2009 INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION, VOL I, 2009, : 31 - 34
  • [50] Word-Level Script Identification from Handwritten Multi-script Documents
    Singh, Pawan Kumar
    Mondal, Arafat
    Bhowmik, Showmik
    Sarkar, Ram
    Nasipuri, Mita
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2014, VOL 1, 2015, 327 : 551 - 558