Towards Total Scene Understanding: Classification, Annotation and Segmentation in an Automatic Framework

被引:0
|
作者
Li, Li-Jia [1 ]
Socher, Richard [1 ]
Li Fei-Fei [1 ]
机构
[1] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA
来源
CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4 | 2009年
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given an image, we propose a hierarchical generative model that classifies the overall scene, recognizes and segments each object component, as well as annotates the image with a list of tags. To our knowledge, this is the first model that performs all three tasks in one coherent framework. For instance, a scene of a 'polo game' consists of several visual objects such as 'human', 'horse', 'grass', etc. In addition, it can be further annotated with a list of more abstract (e.g. 'dusk') or visually less salient (e.g. 'saddle') tags. Our generative model jointly explains images through a visual model and a textual model. Visually relevant objects are represented by regions and patches, while visually irrelevant textual annotations are influenced directly by the overall scene class. Vile propose a fully automatic learning framework that is able to learn robust scene models from noisy web data such as images and user tags from Flickr.com. We demonstrate the effectiveness of our framework by automatically classifying, annotating and segmenting images from eight classes depicting sport scenes. In all three tasks, our model significantly outperforms state-of-the-art algorithms.
引用
收藏
页码:2036 / 2043
页数:8
相关论文
共 50 条
  • [41] A HYBRID HIERARCHICAL FRAMEWORK FOR AUTOMATIC IMAGE ANNOTATION
    Cai, Yuan-Yuan
    Mu, Zhi-Chun
    Ren, Yan-Fei
    Xu, Guo-Qing
    PROCEEDINGS OF 2014 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 1, 2014, : 30 - 36
  • [42] A Framework for Evaluating Automatic Image Annotation Algorithms
    Athanasakos, Konstantinos
    Stathopoulos, Vassilios
    Jose, Joemon M.
    ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2010, 5993 : 217 - 228
  • [43] Automatic Image Annotation Using Multiple Grid Segmentation
    Arellano, Gerardo
    Enrique Sucar, Luis
    Morales, Eduardo F.
    ADVANCES IN ARTIFICIAL INTELLIGENCE, MICAI 2010, PT I, 2010, 6437 : 278 - 289
  • [44] Automatic Semantic Segmentation and Annotation of MOOC Lecture Videos
    Das, Ananda
    Das, Partha Pratim
    DIGITAL LIBRARIES AT THE CROSSROADS OF DIGITAL INFORMATION FOR THE FUTURE, ICADL 2019, 2019, 11853 : 181 - 188
  • [45] The method of Web image annotation classification automatic
    Zheng Xin
    Cai Aiping
    ENGINEERING SOLUTIONS FOR MANUFACTURING PROCESSES IV, PTS 1 AND 2, 2014, 889-890 : 1323 - 1326
  • [46] SEMANTIC ANNOTATION TO SUPPORT AUTOMATIC TAXONOMY CLASSIFICATION
    Kim, S.
    Bracewell, R. H.
    Ahmed, S.
    Wallace, K. M.
    9TH INTERNATIONAL DESIGN CONFERENCE - DESIGN 2006, VOLS 1 AND 2, 2006, (36): : 1171 - +
  • [47] Weakly-supervised region annotation for understanding scene images
    Hao Wang
    Tong Lu
    Yiming Wang
    Palaiahnakote Shivakumara
    Chew Lim Tan
    Multimedia Tools and Applications, 2016, 75 : 3027 - 3051
  • [48] Weakly-supervised region annotation for understanding scene images
    Wang, Hao
    Lu, Tong
    Wang, Yiming
    Shivakumara, Palaiahnakote
    Tan, Chew Lim
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (06) : 3027 - 3051
  • [49] Flexible and Scalable Annotation Tool to Develop Scene Understanding Datasets
    Elahi, Md Fazle
    Tian, Renran
    Luo, Xiao
    WORKSHOP ON HUMAN-IN-THE-LOOP DATA ANALYTICS, HILDA 2022, 2022,
  • [50] OS-MSL: One Stage Multimodal Sequential Link Framework for Scene Segmentation and Classification
    Liu, Ye
    Qiao, Lingfeng
    Yin, Di
    Jiang, Zhuoxuan
    Jiang, Xinghua
    Jiang, Deqiang
    Ren, Bo
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 6269 - 6277