Learning multi-task local metrics for image annotation

被引:0
|
作者
Xing Xu
Atsushi Shimada
Hajime Nagahara
Rin-ichiro Taniguchi
机构
[1] Kyushu University,Department of Advanced Information and Technology
来源
关键词
Image annotation; Label prediction; Metric learning; Local metric; Multi-task learning;
D O I
暂无
中图分类号
学科分类号
摘要
The goal of image annotation is to automatically assign a set of textual labels to an image to describe the visual contents thereof. Recently, with the rapid increase in the number of web images, nearest neighbor (NN) based methods have become more attractive and have shown exciting results for image annotation. One of the key challenges of these methods is to define an appropriate similarity measure between images for neighbor selection. Several distance metric learning (DML) algorithms derived from traditional image classification problems have been applied to annotation tasks. However, a fundamental limitation of applying DML to image annotation is that it learns a single global distance metric over the entire image collection and measures the distance between image pairs in the image-level. For multi-label annotation problems, it may be more reasonable to measure similarity of image pairs in the label-level. In this paper, we develop a novel label prediction scheme utilizing multiple label-specific local metrics for label-level similarity measure, and propose two different local metric learning methods in a multi-task learning (MTL) framework. Extensive experimental results on two challenging annotation datasets demonstrate that 1) utilizing multiple local distance metrics to learn label-level distances is superior to using a single global metric in label prediction, and 2) the proposed methods using the MTL framework to learn multiple local metrics simultaneously can model the commonalities of labels, thereby facilitating label prediction results to achieve state-of-the-art annotation performance.
引用
收藏
页码:2203 / 2231
页数:28
相关论文
共 50 条
  • [21] PERSONALITY DRIVEN MULTI-TASK LEARNING FOR IMAGE AESTHETIC ASSESSMENT
    Li, Leida
    Zhu, Hancheng
    Zhao, Sicheng
    Ding, Guiguang
    Jiang, Hongyan
    Tan, Allen
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 430 - 435
  • [22] Image Captioning with Deep Bidirectional LSTMs and Multi-Task Learning
    Wang, Cheng
    Yang, Haojin
    Meinel, Christoph
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2018, 14 (02)
  • [23] Red Lesion Segmentation of Fundus Image with Multi-task Learning
    Guo S.
    Li T.
    Li N.
    Kang H.
    Zhang Y.-J.
    Wang K.
    Ruan Jian Xue Bao/Journal of Software, 2021, 32 (11): : 3646 - 3658
  • [24] Hand Image Understanding via Deep Multi-Task Learning
    Zhang, Xiong
    Huang, Hongsheng
    Tan, Jianchao
    Xu, Hongmin
    Yang, Cheng
    Peng, Guozhu
    Wang, Lei
    Liu, Ji
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11261 - 11272
  • [25] Deep multi-task learning for image/video distortions identification
    Ameur, Zoubida
    Fezza, Sid Ahmed
    Hamidouche, Wassim
    Neural Computing and Applications, 2022, 34 (24) : 21607 - 21623
  • [26] MULTI-TASK DEEP LEARNING FOR SATELLITE IMAGE PANSHARPENING AND SEGMENTATION
    Khalel, Andrew
    Tasar, Onur
    Charpiat, Guillaume
    Tarabalka, Yuliya
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 4869 - 4872
  • [27] A Modulation Module for Multi-task Learning with Applications in Image Retrieval
    Zhao, Xiangyun
    Li, Haoxiang
    Shen, Xiaohui
    Liang, Xiaodan
    Wu, Ying
    COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 : 415 - 432
  • [28] Deep multi-task learning for image/video distortions identification
    Zoubida Ameur
    Sid Ahmed Fezza
    Wassim Hamidouche
    Neural Computing and Applications, 2022, 34 : 21607 - 21623
  • [29] MEDIC: a multi-task learning dataset for disaster image classification
    Firoj Alam
    Tanvirul Alam
    Md. Arid Hasan
    Abul Hasnat
    Muhammad Imran
    Ferda Ofli
    Neural Computing and Applications, 2023, 35 : 2609 - 2632
  • [30] Dependent Multi-Task Learning with Causal Intervention for Image Captioning
    Chen, Wenqing
    Tian, Jidong
    Fan, Caoyun
    He, Hao
    Jin, Yaohui
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2263 - 2270