SUPERVISED MULTI-MODAL TOPIC MODEL FOR IMAGE ANNOTATION

被引：0

作者：

Tran, Thu Hoai ^{[1
]}

Choi, Seungjin ^{[1
]}

机构：

[1] POSTECH, Div IT Convergence Engn, Pohang, South Korea

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年

关键词：

Image annotation; latent Dirichlet allocation; topic models;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Multi-modal topic models are probabilistic generative models where hidden topics are learned from data of different types. In this paper we present supervised multi-modal latent Dirichlet allocation (smmLDA), where we incorporate class label (global description) into the joint modeling of visual words and caption words (local description), for image annotation task. We derive variational inference algorithm to approximately compute posterior distribution over latent variables. Experiments on a subset of LabelMe dataset demonstrate the useful behavior of our model, compared to existing topic models.

引用

页数：5

共 50 条

[1] A Multi-Modal Topic Model for Image Annotation Using Text Analysis
Tian, Jing
Huang, Yu
Guo, Zhi
Qi, Xiang
Chen, Ziyan
Huang, Tinglei
IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (07) : 886 - 890
[2] Topic Regression Multi-Modal Latent Dirichlet Allocation for Image Annotation
Putthividhya, Duangmanee
Attias, Hagai T.
Nagarajan, Srikantan S.
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3408 - 3415
[3] Jointly Image Annotation and Classification Based on Supervised Multi-Modal Hierarchical Semantic Model
Yin, Chun-yan
Chen, Yong-Heng
Zuo, Wan-li
PATTERN RECOGNITION AND IMAGE ANALYSIS, 2020, 30 (01) : 76 - 86
[4] Jointly Image Annotation and Classification Based on Supervised Multi-Modal Hierarchical Semantic Model
Chun-yan Yin
Yong-Heng Chen
Wan-li Zuo
Pattern Recognition and Image Analysis, 2020, 30 : 76 - 86
[5] Erratum to: Jointly Image Annotation and Classification Based on Supervised Multi-Modal Hierarchical Semantic Model
Chun-yan Yin
Yong-Heng Chen
Wan-li Zuo
Pattern Recognition and Image Analysis, 2020, 30 : 566 - 566
[6] A probabilistic semantic model for image annotation and multi-modal image retrieval
Zhang, RF
Zhang, ZF
Li, MJ
Ma, WY
Zhang, HJ
TENTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 846 - 851
[7] A probabilistic semantic model for image annotation and multi-modal image retrieval
Zhang, Ruofei
Zhang, Zhongfei
Li, Mingjing
Ma, Wei-Ying
Zhang, Hong-Jiang
MULTIMEDIA SYSTEMS, 2006, 12 (01) : 27 - 33
[8] A probabilistic semantic model for image annotation and multi-modal image retrieval
Ruofei Zhang
Zhongfei (Mark) Zhang
Mingjing Li
Wei-Ying Ma
Hong-Jiang Zhang
Multimedia Systems, 2006, 12 : 27 - 33
[9] SUPERVISED TOPIC MODEL FOR AUTOMATIC IMAGE ANNOTATION
Putthividhya, D.
Attias, H. T.
Nagarajan, S. S.
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 1894 - 1897
[10] Nonparametric Bayesian Upstream Supervised Multi-Modal Topic Models
Liao, Renjie
Zhu, Jun
Qin, Zengchang
WSDM'14: PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2014, : 493 - 502

← 1 2 3 4 5 →