Towards Total Scene Understanding: Classification, Annotation and Segmentation in an Automatic Framework

被引:0
|
作者
Li, Li-Jia [1 ]
Socher, Richard [1 ]
Li Fei-Fei [1 ]
机构
[1] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA
来源
CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4 | 2009年
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given an image, we propose a hierarchical generative model that classifies the overall scene, recognizes and segments each object component, as well as annotates the image with a list of tags. To our knowledge, this is the first model that performs all three tasks in one coherent framework. For instance, a scene of a 'polo game' consists of several visual objects such as 'human', 'horse', 'grass', etc. In addition, it can be further annotated with a list of more abstract (e.g. 'dusk') or visually less salient (e.g. 'saddle') tags. Our generative model jointly explains images through a visual model and a textual model. Visually relevant objects are represented by regions and patches, while visually irrelevant textual annotations are influenced directly by the overall scene class. Vile propose a fully automatic learning framework that is able to learn robust scene models from noisy web data such as images and user tags from Flickr.com. We demonstrate the effectiveness of our framework by automatically classifying, annotating and segmenting images from eight classes depicting sport scenes. In all three tasks, our model significantly outperforms state-of-the-art algorithms.
引用
收藏
页码:2036 / 2043
页数:8
相关论文
共 50 条
  • [21] Automatic segmentation and annotation of audio archive documents
    Bohac, Marek
    Blavka, Karel
    2011 10TH INTERNATIONAL WORKSHOP ON ELECTRONICS, CONTROL, MEASUREMENT AND SIGNALS (ECMS), 2011, : 61 - 66
  • [22] Hierarchical classification for automatic image annotation
    Dept of Computer Science, UNC-Charlotte, Charlotte, NC 28223, United States
    Proc. Annu. Int. ACM SIGIR Conf. Res. Dev. Inf. Retr., 2007, (111-118):
  • [23] An automatic segmentation and classification framework for anti-nuclear antibody images
    Cheng, Chung-Chuan
    Hsieh, Tsu-Yi
    Taur, Jin-Shiuh
    Chen, Yung-Fu
    BIOMEDICAL ENGINEERING ONLINE, 2013, 12
  • [24] Automatic Annotation for Semantic Segmentation in Indoor Scenes
    Reza, Md Alimoor
    Naik, Akshay U.
    Chen, Kai
    Crandall, David J.
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 4970 - 4976
  • [25] AUTOMATIC IMAGE REGION ANNOTATION THROUGH SEGMENTATION BASED VISUAL SEMANTIC ANALYSIS AND DISCRIMINATIVE CLASSIFICATION
    Zhang, Jing
    Gao, Yangwei
    Feng, Shengwei
    Yuan, Yubo
    Lee, Chin-Hui
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 1956 - 1960
  • [26] Piecewise planar segmentation for automatic scene modeling
    Bartoli, A
    2001 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2001, : 283 - 289
  • [27] Indoor Scene Classification through Dual-Stream Deep Learning: A Framework for Improved Scene Understanding in Robotics
    Khan, Sultan Daud
    Othman, Kamal M.
    COMPUTERS, 2024, 13 (05)
  • [28] Automatic interchange in scene colors by image segmentation
    Kotera, H
    Horiuchi, T
    12TH COLOR IMAGING CONFERENCE: COLOR SCIENCE AND ENGINEERING SYSTEMS, TECHNOLOGIES, APPLICATIONS, 2004, : 93 - 99
  • [29] A Comparative Study of Video Annotation Tools for Scene Understanding Yet (not) another Annotation Tool
    Kletz, Sabrina
    Leibetseder, Andreas
    Schoeffmann, Klaus
    PROCEEDINGS OF THE 10TH ACM MULTIMEDIA SYSTEMS CONFERENCE (ACM MMSYS'19), 2019, : 133 - 144
  • [30] Automatic segmentation and annotation in radiology [Automatisierte Segmentierung und Annotation in der Radiologie]
    Dankerl P.
    Cavallaro A.
    Uder M.
    Hammon M.
    Der Radiologe, 2014, 54 (3): : 265 - 270