Automatic Image Annotation by Sequentially Learning From Multi-Level Semantic Neighborhoods

被引：3

作者：

Li, Houjie ^{[1
]}

Li, Wei ^{[2
,3
]}

Zhang, Hongda ^{[2
]}

He, Xin ^{[1
]}

Zheng, Mingxiao ^{[2
]}

Song, Haiyu ^{[2
,3
]}

机构：

[1] Dalian Minzu Univ, Sch Informat & Commun Engn, Dalian 116600, Peoples R China

[2] Dalian Minzu Univ, Sch Comp Sci & Engn, Dalian 116600, Peoples R China

[3] Jilin Univ, Coll Traff, Changchun 130022, Peoples R China

来源：

IEEE ACCESS | 2021年 / 9卷

基金：

美国国家科学基金会;

关键词：

Image annotation; Semantics; Visualization; Convolutional neural networks; Training; Task analysis; Deep learning; Automatic image annotation; semantic gap; nearest neighbor; weak-labeling;

D O I：

10.1109/ACCESS.2021.3117349

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Automatic image annotation is a key technology in image understanding and pattern recognition, and is becoming increasingly important in order to annotate large-scale images. In the past decade, the nearest neighbor model-based AIA (Automatic image annotation) methods have been proved to be the most successful in all classical models. This model has four major challenges including semantic gap, label-imbalance, wider range labels, and weak-labeling. In this paper, we propose a novel annotation model based on three-pass KNN (k-Nearest Neighbor) to address the aforementioned challenges. The key idea is to identify appropriate neighbors at each pass KNN. In the first pass KNN, we identify the several most relevant categories based on label feature rather than visual feature as traditional models. In the second pass KNN, we determine the relevant images based on multi-modal (visual and textual label) embedding features. As the test image has not been annotated with any label, we propose a pre-annotation strategory before image annotation to improve the semantic level. In the third pass KNN, we capture relevant labels from semantically and visually similar images and propagate them to the given unlabeled image. In contrast with traditional nearest neighbor based methods, our method can inherently alleviate the problems of semantic gap, label-imbalance, and wider range labels. In addition, to alleviate the issue of weak-labeling, we propose label refinement for training images. Extensive experiments on three classical benchmark datasets and MS-COCO demonstrate that the proposed method significantly outperforms the state-of-the-art in terms of per-label and per-image metrics.

引用

页码：135742 / 135754

页数：13

共 50 条

[1] An ontology-based multi-level semantic representation model for learning objects annotation
Rezgui, Kalthoum
Mhiri, Hedia
Ghedira, Khaled
2017 IEEE/ACS 14TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2017, : 1391 - 1398
[2] Learning Semantic Concepts from Noisy Media Collection for Automatic Image Annotation
TIAN Feng
SHEN Xukun
Chinese Journal of Electronics, 2015, 24 (04) : 790 - 794
[3] Learning Semantic Concepts from Noisy Media Collection for Automatic Image Annotation
Tian Feng
Shen Xukun
CHINESE JOURNAL OF ELECTRONICS, 2015, 24 (04) : 790 - 794
[4] Image Semantic Description and Automatic Semantic Annotation
Liang Meiyu
Du Junping
Jia Yingmin
Sun Zengqi
INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 1192 - 1195
[5] Multi-level medical image semantic modeling approach based on statistical learning
Lin, Chun-Yi
Yin, Jun-Xun
Gao, Xue
Chen, Jian-Yu
Sun, Shao-Hui
Shenzhen Daxue Xuebao (Ligong Ban)/Journal of Shenzhen University Science and Engineering, 2007, 24 (02): : 138 - 143
[6] CASCADE OF MULTI-LEVEL MULTI-INSTANCE CLASSIFIERS FOR IMAGE ANNOTATION
Cam-Tu Nguyen
Ha Vu Le
Tokuyama, Takeshi
KDIR 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2011, : 14 - 23
[7] Automatic image annotation for semantic image retrieval
Shao, Wenbin
Naghdy, Golshah
Phung, Son Lam
ADVANCES IN VISUAL INFORMATION SYSTEMS, 2007, 4781 : 369 - 378
[8] A semantic approach for automatic image annotation
Oujaoura, Mustapha
Minaouf, Brahim
Fakir, Mohammed
2013 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS: THEORIES AND APPLICATIONS (SITA), 2013,
[9] Effective image annotation for search using multi-level semantics
Cheng, PJ
Chien, LF
DIGITAL LIBRARIES: TECHNOLOGY AND MANAGEMENT OF INDIGENOUS KNOWLEDGE FOR GLOBAL ACCESS, 2003, 2911 : 230 - 242
[10] Multi-level Semantic Binary Descriptor for Image Retrieval
Wu Z.-B.
Yu J.-Q.
He Y.-F.
Guan T.
Jisuanji Xuebao/Chinese Journal of Computers, 2020, 43 (09): : 1641 - 1655

← 1 2 3 4 5 →