Multi-Frequency RF Sensor Fusion for Word-Level Fluent ASL Recognition

被引:9
|
作者
Gurbuz, Sevgi Z. [1 ]
Rahman, M. Mahbubur [1 ]
Kurtoglu, Emre [1 ]
Malaia, Evie [2 ]
Gurbuz, Ali Cafer [3 ]
Griffin, Darrin J. [4 ]
Crawford, Chris [5 ]
机构
[1] Univ Alabama, Dept Elect & Comp Engn, Tuscaloosa, AL 35487 USA
[2] Univ Alabama, Dept Commun Disorders, Tuscaloosa, AL 35487 USA
[3] Mississippi State Univ, Dept Elect & Comp Engn, Starkville, MS 39762 USA
[4] Univ Alabama, Dept Commun Studies, Tuscaloosa, AL 35487 USA
[5] Univ Alabama, Dept Comp Sci, Tuscaloosa, AL 35487 USA
基金
美国国家科学基金会;
关键词
Sensors; Radio frequency; Radar; Bandwidth; Auditory system; Time-frequency analysis; Sensor fusion; American sign language; gesture recognition; radar micro-Doppler; RF sensing; deep learning; autoencoders; DOPPLER RADAR; CLASSIFICATION;
D O I
10.1109/JSEN.2021.3078339
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deaf spaces are unique indoor environments designed to optimize visual communication and Deaf cultural expression. However, much of the technological research geared towards the deaf involve use of video or wearables for American sign language (ASL) translation, with little consideration for Deaf perspective on privacy and usability of the technology. In contrast to video, RF sensors offer the avenue for ambient ASL recognition while also preserving privacy for Deaf signers. Methods: This paper investigates the RF transmit waveform parameters required for effective measurement of ASL signs and their effect on word-level classification accuracy attained with transfer learning and convolutional autoencoders (CAE). A multi-frequency fusion network is proposed to exploit data from all sensors in an RF sensor network and improve the recognition accuracy of fluent ASL signing. Results: For fluent signers, CAEs yield a 20-sign classification accuracy of %76 at 77 GHz and %73 at 24 GHz, while at X-band (10 Ghz) accuracy drops to 67%. For hearing imitation signers, signs are more separable, resulting in a 96% accuracy with CAEs. Further, fluent ASL recognition accuracy is significantly increased with use of the multi-frequency fusion network, which boosts the 20-sign fluent ASL recognition accuracy to 95%, surpassing conventional feature level fusion by 12%. Implications: Signing involves finer spatiotemporal dynamics than typical hand gestures, and thus requires interrogation with a transmit waveform that has a rapid succession of pulses and high bandwidth. Millimeter wave RF frequencies also yield greater accuracy due to the increased Doppler spread of the radar backscatter. Comparative analysis of articulation dynamics also shows that imitation signing is not representative of fluent signing, and not effective in pre-training networks for fluent ASL classification. Deep neural networks employing multi-frequency fusion capture both shared, as well as sensor-specific features and thus offer significant performance gains in comparison to using a single sensor or feature-level fusion.
引用
收藏
页码:11373 / 11381
页数:9
相关论文
共 50 条
  • [21] Multi-frequency GPR Data Fusion
    De Coster, A.
    Lambot, S.
    PROCEEDINGS OF 2016 16TH INTERNATIONAL CONFERENCE ON GROUND PENETRATING RADAR (GPR), 2016,
  • [22] Multi-frequency modulation spectrum fusion enhanced recognition method for pneumatic targets
    Zhao Q.
    Zhao Z.
    Ye C.
    Lu Y.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2023, 45 (07): : 2043 - 2050
  • [23] Multi-frequency ECT with AMR sensor
    He, D. F.
    Shiwa, M.
    Jia, J. P.
    Takatsubo, J.
    Moriya, S.
    NDT & E INTERNATIONAL, 2011, 44 (05) : 438 - 441
  • [24] WORD-LEVEL RECOGNITION OF SMALL SETS OF HAND-WRITTEN WORDS
    ELIAZ, A
    GEIGER, D
    PATTERN RECOGNITION LETTERS, 1995, 16 (10) : 999 - 1009
  • [25] Word-level Chinese named entity recognition based on segmentation digraph
    Gao, H
    Huang, D
    Yang, YS
    PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 380 - 383
  • [26] HOLISM REVISITED - EVIDENCE FOR PARALLEL INDEPENDENT WORD-LEVEL AND LETTER-LEVEL PROCESSORS DURING WORD RECOGNITION
    ALLEN, PA
    EMERSON, PL
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 1991, 17 (02) : 489 - 511
  • [27] Multi-Frequency Gyrotrons for Plasma Fusion Installations
    Denisov, G. G.
    Litvak, A. G.
    Agapova, M. V.
    Myasnikov, V. E.
    Tai, E. M.
    Zapevalov, V. E.
    Chirkov, A. V.
    Kuftin, A. N.
    Malygin, S. A.
    Malygin, V. I.
    Nicniporenko, V. O.
    Kazansky, I. V.
    Kruglov, A. V.
    Rukavishnikova, V. G.
    Gnedenkov, A. F.
    Pavel'ev, A. B.
    Parshin, V. V.
    Popov, L. G.
    Sokolov, E. V.
    Soluyanova, E. A.
    Ilin, V. I.
    Ilin, V. N.
    Vikharev, A. L.
    Shamanova, N. A.
    Usachev, S. V.
    2008 33RD INTERNATIONAL CONFERENCE ON INFRARED, MILLIMETER AND TERAHERTZ WAVES, VOLS 1 AND 2, 2008, : 142 - +
  • [28] RAPID WORD RECOGNITION AS A MEASURE OF WORD-LEVEL AUTOMATICITY AND ITS RELATION TO OTHER MEASURES OF READING
    Frye, Elizabeth M.
    Gosky, Ross
    READING PSYCHOLOGY, 2012, 33 (04) : 350 - 366
  • [29] Compressed Multi-frequency RF Sensing with Photonic Assistance
    Gao, Yuyang
    Dai, Yitang
    Xu, Kun
    Yan, Li
    Yin, Feifei
    Li, Jianqiang
    Lin, Jintong
    2013 CONFERENCE ON LASERS AND ELECTRO-OPTICS (CLEO), 2013,
  • [30] Multi-frequency MEMS acoustic emission sensor
    Khan, Talha Masood
    Taha, Raguez
    Zhang, Tonghao
    Ozevin, Didem
    SENSORS AND ACTUATORS A-PHYSICAL, 2023, 362