Automatic Image Annotation by Sequentially Learning From Multi-Level Semantic Neighborhoods

被引:3
|
作者
Li, Houjie [1 ]
Li, Wei [2 ,3 ]
Zhang, Hongda [2 ]
He, Xin [1 ]
Zheng, Mingxiao [2 ]
Song, Haiyu [2 ,3 ]
机构
[1] Dalian Minzu Univ, Sch Informat & Commun Engn, Dalian 116600, Peoples R China
[2] Dalian Minzu Univ, Sch Comp Sci & Engn, Dalian 116600, Peoples R China
[3] Jilin Univ, Coll Traff, Changchun 130022, Peoples R China
基金
美国国家科学基金会;
关键词
Image annotation; Semantics; Visualization; Convolutional neural networks; Training; Task analysis; Deep learning; Automatic image annotation; semantic gap; nearest neighbor; weak-labeling;
D O I
10.1109/ACCESS.2021.3117349
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic image annotation is a key technology in image understanding and pattern recognition, and is becoming increasingly important in order to annotate large-scale images. In the past decade, the nearest neighbor model-based AIA (Automatic image annotation) methods have been proved to be the most successful in all classical models. This model has four major challenges including semantic gap, label-imbalance, wider range labels, and weak-labeling. In this paper, we propose a novel annotation model based on three-pass KNN (k-Nearest Neighbor) to address the aforementioned challenges. The key idea is to identify appropriate neighbors at each pass KNN. In the first pass KNN, we identify the several most relevant categories based on label feature rather than visual feature as traditional models. In the second pass KNN, we determine the relevant images based on multi-modal (visual and textual label) embedding features. As the test image has not been annotated with any label, we propose a pre-annotation strategory before image annotation to improve the semantic level. In the third pass KNN, we capture relevant labels from semantically and visually similar images and propagate them to the given unlabeled image. In contrast with traditional nearest neighbor based methods, our method can inherently alleviate the problems of semantic gap, label-imbalance, and wider range labels. In addition, to alleviate the issue of weak-labeling, we propose label refinement for training images. Extensive experiments on three classical benchmark datasets and MS-COCO demonstrate that the proposed method significantly outperforms the state-of-the-art in terms of per-label and per-image metrics.
引用
收藏
页码:135742 / 135754
页数:13
相关论文
共 50 条
  • [1] An ontology-based multi-level semantic representation model for learning objects annotation
    Rezgui, Kalthoum
    Mhiri, Hedia
    Ghedira, Khaled
    2017 IEEE/ACS 14TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2017, : 1391 - 1398
  • [2] Learning Semantic Concepts from Noisy Media Collection for Automatic Image Annotation
    TIAN Feng
    SHEN Xukun
    Chinese Journal of Electronics, 2015, 24 (04) : 790 - 794
  • [3] Learning Semantic Concepts from Noisy Media Collection for Automatic Image Annotation
    Tian Feng
    Shen Xukun
    CHINESE JOURNAL OF ELECTRONICS, 2015, 24 (04) : 790 - 794
  • [4] Image Semantic Description and Automatic Semantic Annotation
    Liang Meiyu
    Du Junping
    Jia Yingmin
    Sun Zengqi
    INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 1192 - 1195
  • [5] Multi-level medical image semantic modeling approach based on statistical learning
    Lin, Chun-Yi
    Yin, Jun-Xun
    Gao, Xue
    Chen, Jian-Yu
    Sun, Shao-Hui
    Shenzhen Daxue Xuebao (Ligong Ban)/Journal of Shenzhen University Science and Engineering, 2007, 24 (02): : 138 - 143
  • [6] CASCADE OF MULTI-LEVEL MULTI-INSTANCE CLASSIFIERS FOR IMAGE ANNOTATION
    Cam-Tu Nguyen
    Ha Vu Le
    Tokuyama, Takeshi
    KDIR 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2011, : 14 - 23
  • [7] Automatic image annotation for semantic image retrieval
    Shao, Wenbin
    Naghdy, Golshah
    Phung, Son Lam
    ADVANCES IN VISUAL INFORMATION SYSTEMS, 2007, 4781 : 369 - 378
  • [8] A semantic approach for automatic image annotation
    Oujaoura, Mustapha
    Minaouf, Brahim
    Fakir, Mohammed
    2013 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS: THEORIES AND APPLICATIONS (SITA), 2013,
  • [9] Effective image annotation for search using multi-level semantics
    Cheng, PJ
    Chien, LF
    DIGITAL LIBRARIES: TECHNOLOGY AND MANAGEMENT OF INDIGENOUS KNOWLEDGE FOR GLOBAL ACCESS, 2003, 2911 : 230 - 242
  • [10] Multi-level Semantic Binary Descriptor for Image Retrieval
    Wu Z.-B.
    Yu J.-Q.
    He Y.-F.
    Guan T.
    Jisuanji Xuebao/Chinese Journal of Computers, 2020, 43 (09): : 1641 - 1655