Task Estimation Using Latent Semantic Analysis of Visual Scenes and Spoken Words

被引:0
|
作者
Kimura, Masashi [1 ]
Sawada, Shinta [1 ]
Iribe, Yurie [2 ]
Katsurada, Kouichi [1 ]
Nitta, Tsuneo [1 ]
机构
[1] Toyohashi Univ Technol, Toyohashi, Aichi, Japan
[2] Toyohashi Univ Technol, Informat & Media Ctr, Toyohashi, Aichi, Japan
关键词
multimodal processing; latent semantic analysis; task estimation;
D O I
10.1002/ecj.11560
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a task estimation method based on multiple subspaces extracted from multimodal information of image objects in visual scenes and spoken words in dialogue appearing in the same task. The multiple subspaces are obtained by using latent semantic analysis (LSA). In the proposed method, a task vector composed of spoken words and the frequencies of image-object appearances are extracted first, and then similarities among the input task vector and reference subspaces of different tasks are compared. Experiments are conducted on the identification of game tasks. The experimental results show that the proposed method with multimodal information outperforms the method in which only the single modality of image or spoken dialogue is applied. The proposed method achieves accurate performance even if less spoken dialogue is applied.
引用
收藏
页码:33 / 42
页数:10
相关论文
共 50 条
  • [21] Learning latent semantic model with visual consistency for image analysis
    Jian Cheng
    Peng Li
    Ting Rui
    Hanqing Lu
    Multimedia Tools and Applications, 2015, 74 : 1341 - 1356
  • [22] Discriminatively trained spoken document similarity models and their application to probabilistic latent semantic analysis
    Thambiratnam, K.
    Seide, F.
    Yu, P.
    2006 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, 2006, : 42 - +
  • [23] Semantic Analysis and Organization of Spoken Documents Based on Parameters Derived From Latent Topics
    Kong, Sheng-Yi
    Lee, Lin-Shan
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07): : 1875 - 1889
  • [24] Stemming for Arabic Words Similarity Measures based on Latent Semantic Analysis Model
    Froud, Hanane
    Lachkar, Abdelmonaime
    Alaoui Ouatik, Said
    2012 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2012, : 780 - 784
  • [25] Text summarization using Latent Semantic Analysis
    Ozsoy, Makbule Gulcin
    Alpaslan, Ferda Nur
    Cicekli, Ilyas
    JOURNAL OF INFORMATION SCIENCE, 2011, 37 (04) : 405 - 417
  • [26] LATENT SEMANTIC INDEXING USING MULTIRESOLUTION ANALYSIS
    Jaber, Tareq
    Amira, Abbes
    Milligan, Peter
    PECCS 2011: PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON PERVASIVE AND EMBEDDED COMPUTING AND COMMUNICATION SYSTEMS, 2011, : 327 - 332
  • [27] Enhanced Latent Semantic Analysis by Considering Mistyped Words in Automated Essay Scoring
    Sendra, Martin
    Sutrisno, Rudy
    Harianata, Josep
    Suhartono, Derwin
    Asmani, Almodad Biduk
    2016 INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTING (ICIC), 2016, : 304 - 308
  • [28] Search-Based Automatic Web Image Annotation Using Latent Visual and Semantic Analysis
    Xia, Dingyin
    Wu, Fei
    Zhuang, Yueting
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2008, 9TH PACIFIC RIM CONFERENCE ON MULTIMEDIA, 2008, 5353 : 842 - 845
  • [29] Tweets Clustering using Latent Semantic Analysis
    Rasidi, Norsuhaili Mahamed
    Abu Bakar, Sakhinah
    Razak, Fatimah Abdul
    4TH INTERNATIONAL CONFERENCE ON MATHEMATICAL SCIENCES (ICMS4): MATHEMATICAL SCIENCES: CHAMPIONING THE WAY IN A PROBLEM BASED AND DATA DRIVEN SOCIETY, 2017, 1830
  • [30] Semantic Analysis of Spoken Input Using Markov Logic Networks
    Despotovic, Vladimir
    Walter, Oliver
    Haeb-Umbach, Reinhold
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1859 - 1863