SUPERVISED MULTI-MODAL TOPIC MODEL FOR IMAGE ANNOTATION

被引:0
|
作者
Tran, Thu Hoai [1 ]
Choi, Seungjin [1 ]
机构
[1] POSTECH, Div IT Convergence Engn, Pohang, South Korea
关键词
Image annotation; latent Dirichlet allocation; topic models;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Multi-modal topic models are probabilistic generative models where hidden topics are learned from data of different types. In this paper we present supervised multi-modal latent Dirichlet allocation (smmLDA), where we incorporate class label (global description) into the joint modeling of visual words and caption words (local description), for image annotation task. We derive variational inference algorithm to approximately compute posterior distribution over latent variables. Experiments on a subset of LabelMe dataset demonstrate the useful behavior of our model, compared to existing topic models.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] MMDF-LDA: An improved Multi-Modal Latent Dirichlet Allocation model for social image annotation
    Liu Zheng
    Zhang Caiming
    Chen Caixian
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 104 : 168 - 184
  • [32] Mining heterogeneous clinical notes by multi-modal latent topic model
    Wen, Zhi
    Nair, Pratheeksha
    Deng, Chih-Ying
    Lu, Xing Han
    Moseley, Edward
    George, Naomi
    Lindvall, Charlotta
    Li, Yue
    PLOS ONE, 2021, 16 (04):
  • [33] Multi-Modal Multi-Scale Deep Learning for Large-Scale Image Annotation
    Niu, Yulei
    Lu, Zhiwu
    Wen, Ji-Rong
    Xiang, Tao
    Chang, Shih-Fu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (04) : 1720 - 1731
  • [34] Multi-modal multi-concept-based deep neural network for automatic image annotation
    Xu, Haijiao
    Huang, Changqin
    Huang, Xiaodi
    Huang, Muxiong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (21) : 30651 - 30675
  • [35] Multi-modal multi-concept-based deep neural network for automatic image annotation
    Haijiao Xu
    Changqin Huang
    Xiaodi Huang
    Muxiong Huang
    Multimedia Tools and Applications, 2019, 78 : 30651 - 30675
  • [36] Semi-supervised multi-modal medical image segmentation with unified translation
    Sun H.
    Wei J.
    Yuan W.
    Li R.
    Computers in Biology and Medicine, 2024, 176
  • [37] Multi-modal multi-layered topic classification model for social event analysis
    Y. H. Chen
    C. Y. Yin
    Y. J. Lin
    W. L. Zuo
    Multimedia Tools and Applications, 2018, 77 : 23291 - 23315
  • [38] Multi-modal multi-layered topic classification model for social event analysis
    Chen, Y. H.
    Yin, C. Y.
    Lin, Y. J.
    Zuo, W. L.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (18) : 23291 - 23315
  • [39] Supervised online multi-modal discrete hashing
    Liu, Yun
    Fu, Qiang
    Ji, Shujuan
    Fang, Xianwen
    SIGNAL PROCESSING, 2025, 231
  • [40] Correlated Topic Model for Image Annotation
    Xu, Xing
    Shimada, Atsushi
    Taniguchi, Rin-ichiro
    PROCEEDINGS OF THE 19TH KOREA-JAPAN JOINT WORKSHOP ON FRONTIERS OF COMPUTER VISION (FCV 2013), 2013, : 201 - 208