SUPERVISED MULTI-MODAL TOPIC MODEL FOR IMAGE ANNOTATION

被引：0

作者：

Tran, Thu Hoai ^{[1
]}

Choi, Seungjin ^{[1
]}

机构：

[1] POSTECH, Div IT Convergence Engn, Pohang, South Korea

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年

关键词：

Image annotation; latent Dirichlet allocation; topic models;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Multi-modal topic models are probabilistic generative models where hidden topics are learned from data of different types. In this paper we present supervised multi-modal latent Dirichlet allocation (smmLDA), where we incorporate class label (global description) into the joint modeling of visual words and caption words (local description), for image annotation task. We derive variational inference algorithm to approximately compute posterior distribution over latent variables. Experiments on a subset of LabelMe dataset demonstrate the useful behavior of our model, compared to existing topic models.

引用

页数：5

共 50 条

[31] MMDF-LDA: An improved Multi-Modal Latent Dirichlet Allocation model for social image annotation
Liu Zheng
Zhang Caiming
Chen Caixian
EXPERT SYSTEMS WITH APPLICATIONS, 2018, 104 : 168 - 184
[32] Mining heterogeneous clinical notes by multi-modal latent topic model
Wen, Zhi
Nair, Pratheeksha
Deng, Chih-Ying
Lu, Xing Han
Moseley, Edward
George, Naomi
Lindvall, Charlotta
Li, Yue
PLOS ONE, 2021, 16 (04):
[33] Multi-Modal Multi-Scale Deep Learning for Large-Scale Image Annotation
Niu, Yulei
Lu, Zhiwu
Wen, Ji-Rong
Xiang, Tao
Chang, Shih-Fu
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (04) : 1720 - 1731
[34] Multi-modal multi-concept-based deep neural network for automatic image annotation
Xu, Haijiao
Huang, Changqin
Huang, Xiaodi
Huang, Muxiong
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (21) : 30651 - 30675
[35] Multi-modal multi-concept-based deep neural network for automatic image annotation
Haijiao Xu
Changqin Huang
Xiaodi Huang
Muxiong Huang
Multimedia Tools and Applications, 2019, 78 : 30651 - 30675
[36] Semi-supervised multi-modal medical image segmentation with unified translation
Sun H.
Wei J.
Yuan W.
Li R.
Computers in Biology and Medicine, 2024, 176
[37] Multi-modal multi-layered topic classification model for social event analysis
Y. H. Chen
C. Y. Yin
Y. J. Lin
W. L. Zuo
Multimedia Tools and Applications, 2018, 77 : 23291 - 23315
[38] Multi-modal multi-layered topic classification model for social event analysis
Chen, Y. H.
Yin, C. Y.
Lin, Y. J.
Zuo, W. L.
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (18) : 23291 - 23315
[39] Supervised online multi-modal discrete hashing
Liu, Yun
Fu, Qiang
Ji, Shujuan
Fang, Xianwen
SIGNAL PROCESSING, 2025, 231
[40] Correlated Topic Model for Image Annotation
Xu, Xing
Shimada, Atsushi
Taniguchi, Rin-ichiro
PROCEEDINGS OF THE 19TH KOREA-JAPAN JOINT WORKSHOP ON FRONTIERS OF COMPUTER VISION (FCV 2013), 2013, : 201 - 208

← 1 2 3 4 5 →