Learning multi-task local metrics for image annotation

被引：0

作者：

Xing Xu

Atsushi Shimada

Hajime Nagahara

Rin-ichiro Taniguchi

机构：

[1] Kyushu University,Department of Advanced Information and Technology

来源：

Multimedia Tools and Applications | 2016年 / 75卷

关键词：

Image annotation; Label prediction; Metric learning; Local metric; Multi-task learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

The goal of image annotation is to automatically assign a set of textual labels to an image to describe the visual contents thereof. Recently, with the rapid increase in the number of web images, nearest neighbor (NN) based methods have become more attractive and have shown exciting results for image annotation. One of the key challenges of these methods is to define an appropriate similarity measure between images for neighbor selection. Several distance metric learning (DML) algorithms derived from traditional image classification problems have been applied to annotation tasks. However, a fundamental limitation of applying DML to image annotation is that it learns a single global distance metric over the entire image collection and measures the distance between image pairs in the image-level. For multi-label annotation problems, it may be more reasonable to measure similarity of image pairs in the label-level. In this paper, we develop a novel label prediction scheme utilizing multiple label-specific local metrics for label-level similarity measure, and propose two different local metric learning methods in a multi-task learning (MTL) framework. Extensive experimental results on two challenging annotation datasets demonstrate that 1) utilizing multiple local distance metrics to learn label-level distances is superior to using a single global metric in label prediction, and 2) the proposed methods using the MTL framework to learn multiple local metrics simultaneously can model the commonalities of labels, thereby facilitating label prediction results to achieve state-of-the-art annotation performance.

引用

页码：2203 / 2231

页数：28

共 50 条

[1] Learning multi-task local metrics for image annotation
Xu, Xing
Shimada, Atsushi
Nagahara, Hajime
Taniguchi, Rin-ichiro
MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (04) : 2203 - 2231
[2] Enhanced representation and multi-task learning for image annotation
Binder, Alexander
Samek, Wojciech
Mueller, Klaus-Robert
Kawanabe, Motoaki
COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (05) : 466 - 478
[3] Hierarchical learning of multi-task sparse metrics for large-scale image classification
Zheng, Yu
Fan, Jianping
Zhang, Ji
Gao, Xinbo
PATTERN RECOGNITION, 2017, 67 : 97 - 109
[4] Multi-label Annotation for Visual Multi-Task Learning Models
Sharma, Gaurang
Angleraud, Alexandre
Pieters, Roel
2023 SEVENTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING, IRC 2023, 2023, : 31 - 34
[5] Asymmetric Multi-Task Learning with Local Transference
Oliveira, Saullo H. G.
Goncalves, Andre R.
Von Zuben, Fernando J.
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 16 (05)
[6] A Multi-task Learning Approach for Image Captioning
Zhao, Wei
Wang, Benyou
Ye, Jianbo
Yang, Min
Zhao, Zhou
Luo, Ruotian
Qiao, Yu
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 1205 - 1211
[7] A Multi-Task Learning CNN for Image Steganalysis
Yu, Xiangyu
Tan, Huabin
Liang, Hui
Li, Chang-Tsun
Liao, Guangjun
2018 10TH IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY (WIFS), 2018,
[8] Multi-task Deep Learning for Image Understanding
Yu, Bo
Lane, Ian
2014 6TH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2014, : 37 - 42
[9] Chooser - A Multi-Task Annotation Tool
Koeva, Svetla
Rizov, Borislav
Leseva, Svetlozara
SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 728 - 734
[10] Simultaneous Image Annotation and Geo-Tag Prediction via Correlation Guided Multi-Task Learning
Wang, Hua
Joshi, Dhiraj
Luo, Jiebo
Huang, Heng
Park, Minwoo
2012 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2012, : 69 - 72

← 1 2 3 4 5 →