Towards Total Scene Understanding: Classification, Annotation and Segmentation in an Automatic Framework

被引：0

作者：

Li, Li-Jia ^{[1
]}

Socher, Richard ^{[1
]}

Li Fei-Fei ^{[1
]}

机构：

[1] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA

来源：

CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4 | 2009年

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Given an image, we propose a hierarchical generative model that classifies the overall scene, recognizes and segments each object component, as well as annotates the image with a list of tags. To our knowledge, this is the first model that performs all three tasks in one coherent framework. For instance, a scene of a 'polo game' consists of several visual objects such as 'human', 'horse', 'grass', etc. In addition, it can be further annotated with a list of more abstract (e.g. 'dusk') or visually less salient (e.g. 'saddle') tags. Our generative model jointly explains images through a visual model and a textual model. Visually relevant objects are represented by regions and patches, while visually irrelevant textual annotations are influenced directly by the overall scene class. Vile propose a fully automatic learning framework that is able to learn robust scene models from noisy web data such as images and user tags from Flickr.com. We demonstrate the effectiveness of our framework by automatically classifying, annotating and segmenting images from eight classes depicting sport scenes. In all three tasks, our model significantly outperforms state-of-the-art algorithms.

引用

页码：2036 / 2043

页数：8

共 50 条

[21] Automatic segmentation and annotation of audio archive documents
Bohac, Marek
Blavka, Karel
2011 10TH INTERNATIONAL WORKSHOP ON ELECTRONICS, CONTROL, MEASUREMENT AND SIGNALS (ECMS), 2011, : 61 - 66
[22] Hierarchical classification for automatic image annotation
Dept of Computer Science, UNC-Charlotte, Charlotte, NC 28223, United States
Proc. Annu. Int. ACM SIGIR Conf. Res. Dev. Inf. Retr., 2007, (111-118):
[23] An automatic segmentation and classification framework for anti-nuclear antibody images
Cheng, Chung-Chuan
Hsieh, Tsu-Yi
Taur, Jin-Shiuh
Chen, Yung-Fu
BIOMEDICAL ENGINEERING ONLINE, 2013, 12
[24] Automatic Annotation for Semantic Segmentation in Indoor Scenes
Reza, Md Alimoor
Naik, Akshay U.
Chen, Kai
Crandall, David J.
2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 4970 - 4976
[25] AUTOMATIC IMAGE REGION ANNOTATION THROUGH SEGMENTATION BASED VISUAL SEMANTIC ANALYSIS AND DISCRIMINATIVE CLASSIFICATION
Zhang, Jing
Gao, Yangwei
Feng, Shengwei
Yuan, Yubo
Lee, Chin-Hui
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 1956 - 1960
[26] Piecewise planar segmentation for automatic scene modeling
Bartoli, A
2001 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2001, : 283 - 289
[27] Indoor Scene Classification through Dual-Stream Deep Learning: A Framework for Improved Scene Understanding in Robotics
Khan, Sultan Daud
Othman, Kamal M.
COMPUTERS, 2024, 13 (05)
[28] Automatic interchange in scene colors by image segmentation
Kotera, H
Horiuchi, T
12TH COLOR IMAGING CONFERENCE: COLOR SCIENCE AND ENGINEERING SYSTEMS, TECHNOLOGIES, APPLICATIONS, 2004, : 93 - 99
[29] A Comparative Study of Video Annotation Tools for Scene Understanding Yet (not) another Annotation Tool
Kletz, Sabrina
Leibetseder, Andreas
Schoeffmann, Klaus
PROCEEDINGS OF THE 10TH ACM MULTIMEDIA SYSTEMS CONFERENCE (ACM MMSYS'19), 2019, : 133 - 144
[30] Automatic segmentation and annotation in radiology [Automatisierte Segmentierung und Annotation in der Radiologie]
Dankerl P.
Cavallaro A.
Uder M.
Hammon M.
Der Radiologe, 2014, 54 (3): : 265 - 270

← 1 2 3 4 5 →