Task Estimation Using Latent Semantic Analysis of Visual Scenes and Spoken Words

被引:0
|
作者
Kimura, Masashi [1 ]
Sawada, Shinta [1 ]
Iribe, Yurie [2 ]
Katsurada, Kouichi [1 ]
Nitta, Tsuneo [1 ]
机构
[1] Toyohashi Univ Technol, Toyohashi, Aichi, Japan
[2] Toyohashi Univ Technol, Informat & Media Ctr, Toyohashi, Aichi, Japan
关键词
multimodal processing; latent semantic analysis; task estimation;
D O I
10.1002/ecj.11560
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a task estimation method based on multiple subspaces extracted from multimodal information of image objects in visual scenes and spoken words in dialogue appearing in the same task. The multiple subspaces are obtained by using latent semantic analysis (LSA). In the proposed method, a task vector composed of spoken words and the frequencies of image-object appearances are extracted first, and then similarities among the input task vector and reference subspaces of different tasks are compared. Experiments are conducted on the identification of game tasks. The experimental results show that the proposed method with multimodal information outperforms the method in which only the single modality of image or spoken dialogue is applied. The proposed method achieves accurate performance even if less spoken dialogue is applied.
引用
收藏
页码:33 / 42
页数:10
相关论文
共 50 条
  • [41] Using latent semantic analysis to assess reader strategies
    Magliano, JP
    Wiemer-Hastings, K
    Millis, KK
    Muñoz, BD
    McNamara, D
    BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 2002, 34 (02): : 181 - 188
  • [42] Latent semantic analysis of game models using LSTM
    Ghica, Dan R.
    Alyahya, Khulood
    JOURNAL OF LOGICAL AND ALGEBRAIC METHODS IN PROGRAMMING, 2019, 106 : 39 - 54
  • [43] Using latent semantic analysis to assess reader strategies
    Joseph P. Magliano
    Katja Wiemer-Hastings
    Keith K. Millis
    Brenton D. MuÑoz
    Danielle Mcnamara
    Behavior Research Methods, Instruments, & Computers, 2002, 34 : 181 - 188
  • [44] KANNADA TEXT SUMMARIZATION USING LATENT SEMANTIC ANALYSIS
    Geetha, J. K.
    Deepamala, N.
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2015, : 1508 - 1512
  • [45] Semantic visual simultaneous localization and mapping (SLAM) using deep learning for dynamic scenes
    Zhang X.Y.
    Rahman A.H.A.
    Qamar F.
    PeerJ Computer Science, 2023, 9
  • [46] Semantic visual simultaneous localization and mapping (SLAM) using deep learning for dynamic scenes
    Zhang, Xiao Ya
    Abd Rahman, Abdul Hadi
    Qamar, Faizan
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [47] Identifying reading strategies using latent semantic analysis: Comparing semantic benchmarks
    Keith Millis
    Hyun-Jeong Joyce Kim
    Stacey Todaro
    Joseph P. Magliano
    Katja Wiemer-Hastings
    Danielle S. McNamara
    Behavior Research Methods, Instruments, & Computers, 2004, 36 : 213 - 221
  • [48] Identifying reading strategies using latent semantic analysis: Comparing semantic benchmarks
    Millis, K
    Kim, HJJ
    Todaro, S
    Magliano, JR
    Wiemer-Hastings, K
    McNamara, DS
    BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 2004, 36 (02): : 213 - 221
  • [49] Improved spoken document retrieval with dynamic key term lexicon and probabilistic latent semantic analysis (PLSA)
    Hsieh, Ya-chao
    Huang, Yu-tsun
    Wang, Chien-chih
    Lee, Lin-shan
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 961 - 964
  • [50] Improved Visual SLAM Using Semantic Segmentation and Layout Estimation
    Mahmoud, Ahmed
    Atia, Mohamed
    ROBOTICS, 2022, 11 (05)