SUPERVISED MULTI-MODAL TOPIC MODEL FOR IMAGE ANNOTATION

被引:0
|
作者
Tran, Thu Hoai [1 ]
Choi, Seungjin [1 ]
机构
[1] POSTECH, Div IT Convergence Engn, Pohang, South Korea
关键词
Image annotation; latent Dirichlet allocation; topic models;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Multi-modal topic models are probabilistic generative models where hidden topics are learned from data of different types. In this paper we present supervised multi-modal latent Dirichlet allocation (smmLDA), where we incorporate class label (global description) into the joint modeling of visual words and caption words (local description), for image annotation task. We derive variational inference algorithm to approximately compute posterior distribution over latent variables. Experiments on a subset of LabelMe dataset demonstrate the useful behavior of our model, compared to existing topic models.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] A Multi-Modal Topic Model for Image Annotation Using Text Analysis
    Tian, Jing
    Huang, Yu
    Guo, Zhi
    Qi, Xiang
    Chen, Ziyan
    Huang, Tinglei
    IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (07) : 886 - 890
  • [2] Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation
    Putthividhya, Duangmanee
    Attias, Hagai T.
    Nagarajan, Srikantan S.
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3408 - 3415
  • [3] Jointly Image Annotation and Classification Based on Supervised Multi-Modal Hierarchical Semantic Model
    Yin, Chun-yan
    Chen, Yong-Heng
    Zuo, Wan-li
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2020, 30 (01) : 76 - 86
  • [4] Jointly Image Annotation and Classification Based on Supervised Multi-Modal Hierarchical Semantic Model
    Chun-yan Yin
    Yong-Heng Chen
    Wan-li Zuo
    Pattern Recognition and Image Analysis, 2020, 30 : 76 - 86
  • [5] Erratum to: Jointly Image Annotation and Classification Based on Supervised Multi-Modal Hierarchical Semantic Model
    Chun-yan Yin
    Yong-Heng Chen
    Wan-li Zuo
    Pattern Recognition and Image Analysis, 2020, 30 : 566 - 566
  • [6] A probabilistic semantic model for image annotation and multi-modal image retrieval
    Zhang, RF
    Zhang, ZF
    Li, MJ
    Ma, WY
    Zhang, HJ
    TENTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 846 - 851
  • [7] A probabilistic semantic model for image annotation and multi-modal image retrieval
    Zhang, Ruofei
    Zhang, Zhongfei
    Li, Mingjing
    Ma, Wei-Ying
    Zhang, Hong-Jiang
    MULTIMEDIA SYSTEMS, 2006, 12 (01) : 27 - 33
  • [8] A probabilistic semantic model for image annotation and multi-modal image retrieval
    Ruofei Zhang
    Zhongfei (Mark) Zhang
    Mingjing Li
    Wei-Ying Ma
    Hong-Jiang Zhang
    Multimedia Systems, 2006, 12 : 27 - 33
  • [9] SUPERVISED TOPIC MODEL FOR AUTOMATIC IMAGE ANNOTATION
    Putthividhya, D.
    Attias, H. T.
    Nagarajan, S. S.
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 1894 - 1897
  • [10] Nonparametric Bayesian Upstream Supervised Multi-Modal Topic Models
    Liao, Renjie
    Zhu, Jun
    Qin, Zengchang
    WSDM'14: PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2014, : 493 - 502