Adopting Abstract Images for Semantic Scene Understanding

被引:39
|
作者
Zitnick, C. Lawrence [1 ]
Vedantam, Ramakrishna [2 ]
Parikh, Devi [2 ]
机构
[1] Microsoft Res, Interact Visual Media Grp, One Microsoft Way, Redmond, WA USA
[2] Virginia Tech, Dept Elect & Comp Engn, Blacksburg, VA USA
基金
美国国家科学基金会;
关键词
Semantic scene understanding; linguistic meaning; saliency; memorability; abstract images; OBJECTS; MEMORY; MODEL; PREDICT;
D O I
10.1109/TPAMI.2014.2366143
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Relating visual information to its linguistic semantic meaning remains an open and challenging area of research. The semantic meaning of images depends on the presence of objects, their attributes and their relations to other objects. But precisely characterizing this dependence requires extracting complex visual information from an image, which is in general a difficult and yet unsolved problem. In this paper, we propose studying semantic information in abstract images created from collections of clip art. Abstract images provide several advantages over real images. They allow for the direct study of how to infer high-level semantic information, since they remove the reliance on noisy low-level object, attribute and relation detectors, or the tedious hand-labeling of real images. Importantly, abstract images also allow the ability to generate sets of semantically similar scenes. Finding analogous sets of real images that are semantically similar would be nearly impossible. We create 1,002 sets of 10 semantically similar abstract images with corresponding written descriptions. We thoroughly analyze this dataset to discover semantically important features, the relations of words to visual features and methods for measuring semantic similarity. Finally, we study the relation between the saliency and memorability of objects and their semantic importance.
引用
收藏
页码:627 / 638
页数:12
相关论文
共 50 条
  • [1] Representation Learning for Semantic Scene Understanding
    Farshad, Azade
    HHAI 2023: AUGMENTING HUMAN INTELLECT, 2023, 368 : 445 - 458
  • [2] Rainy Night Scene Understanding With Near Scene Semantic Adaptation
    Di, Shuai
    Feng, Qi
    Li, Chun-Guang
    Zhang, Mei
    Zhang, Honggang
    Elezovikj, Semir
    Tan, Chiu C.
    Ling, Haibin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (03) : 1594 - 1602
  • [3] Latent Semantic Analysis for Classifying Scene Images
    Lee, Chu-Hui
    Chiang, Kun-Cheng
    INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS (IMECS 2010), VOLS I-III, 2010, : 1467 - 1470
  • [4] Indoor Scene Understanding with Geometric and Semantic Contexts
    Choi, Wongun
    Chao, Yu-Wei
    Pantofaru, Caroline
    Savarese, Silvio
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 112 (02) : 204 - 220
  • [5] Semantic Foggy Scene Understanding with Synthetic Data
    Sakaridis, Christos
    Dai, Dengxin
    Van Gool, Luc
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2018, 126 (09) : 973 - 992
  • [6] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223
  • [7] Exploiting context for semantic scene content understanding
    Luo, Jiebo
    ICIS '06: INTERNATIONAL CONGRESS OF IMAGING SCIENCE, FINAL PROGRAM AND PROCEEDINGS: LINKING THE EXPLOSION OF IMAGING APPLICATIONS WITH THE SCIENCE AND TECHNOLOGY OF IMAGING, 2006, : 479 - 479
  • [8] Open Vocabulary Semantic Scene Sketch Understanding
    Bourouis, Ahmed
    Fan, Judith E.
    Gryaditskaya, Yulia
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 4176 - 4186
  • [9] Indoor Scene Understanding with Geometric and Semantic Contexts
    Wongun Choi
    Yu-Wei Chao
    Caroline Pantofaru
    Silvio Savarese
    International Journal of Computer Vision, 2015, 112 : 204 - 220
  • [10] Semantic Foggy Scene Understanding with Synthetic Data
    Christos Sakaridis
    Dengxin Dai
    Luc Van Gool
    International Journal of Computer Vision, 2018, 126 : 973 - 992