Mid-Level Concept Learning with Visual Contextual Ontologies and Probabilistic Inference for Image Annotation

被引:0
|
作者
Liu, Yuee [1 ]
Zhang, Jinglan [1 ]
Tjondronegoro, Dian [1 ]
Geva, Shlomo [1 ]
Li, Zhengrong [1 ]
机构
[1] Queensland Univ Technol, Fac Sci & Technol, Brisbane, Qld 4001, Australia
关键词
Image Annotation; Salient Objects; Visual Context; Ontology; Probabilistic Inference; multi-level concept;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To date, automatic recognition of semantic information such as salient objects and mid-level concepts from images is a challenging task. Since real-world objects tend to exist in a context within their environment, the computer vision researchers have increasingly incorporated contextual information for improving object recognition. In this paper, we present a method to build a visual contextual ontology from salient objects descriptions for image annotation. The ontologies include not only partOf/kindOf relations, but also spatial and co-occurrence relations. A two-step image annotation algorithm is also proposed based on ontology relations and probabilistic inference. Different from most of the existing work, we exploit how to combine representation of ontology, contextual knowledge and probabilistic inference. The experiments in the LabelMe dataset show that image annotation results are improved using contextual knowledge.
引用
收藏
页码:229 / 239
页数:11
相关论文
共 50 条
  • [41] The Magic Number 2 ± 1: CapacityLimited Inference in Mid-Level Perceptual Properties
    Tyler, Christopher
    PERCEPTION, 2019, 48 : 92 - 92
  • [42] PatchGame: Learning to Signal Mid-level Patches in Referential Games
    Gupta, Kamal
    Somepalli, Gowthami
    Gupta, Anubhav
    Jayasundara, Vinoj
    Zwicker, Matthias
    Shrivastava, Abhinav
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [43] Learning to Combine Mid-level Cues for Object Proposal Generation
    Lee, Tom
    Fidler, Sanja
    Dickinson, Sven
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1680 - 1688
  • [44] VisualHashtags: Visual Summarization of Social Media Events Using Mid-Level Visual Elements
    Goel, Sonal
    Ahuja, Sarthak
    Subramanyam, A. V.
    Kumaraguru, Ponnurangam
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1434 - 1442
  • [45] Improving Streaming Video Segmentation with Early and Mid-Level Visual Processing
    Tripathi, Subarna
    Hwang, Youngbae
    Belongie, Serge
    Truong Nguyen
    2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 477 - 484
  • [46] Ensemble representation of animacy could be based on mid-level visual features
    Tiurina, Natalia A.
    Markov, Yuri A.
    ATTENTION PERCEPTION & PSYCHOPHYSICS, 2025, 87 (02) : 415 - 430
  • [47] Crowd Behavior Analysis Using Local Mid-Level Visual Descriptors
    Fradi, Hajer
    Luvison, Bertrand
    Quoc Cuong Pham
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (03) : 589 - 602
  • [48] Learning Mid-level Filters for Person Re-identification
    Zhao, Rui
    Ouyang, Wanli
    Wang, Xiaogang
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 144 - 151
  • [49] SAR IMAGE CLASSIFICATION BASED ON THE MULTI-LAYER NETWORK AND TRANSFER LEARNING OF MID-LEVEL REPRESENTATIONS
    Kang, Chenyao
    He, Chu
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 1146 - 1149
  • [50] MID-LEVEL FEATURE BASED LOCAL DESCRIPTOR SELECTION FOR IMAGE SEARCH
    Bucak, Serhat
    Saxena, Ankur
    Nagar, Abhishek
    Fernandes, Felix
    Bhat, Kong-Posh
    2013 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP 2013), 2013,