Recognition Using Visual Phrases

被引:0
|
作者
Sadeghi, Mohammad Amin [1 ]
Farhadi, Ali [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we introduce visual phrases, complex visual composites like "a person riding a horse". Visual phrases often display significantly reduced visual complexity compared to their component objects, because the appearance of those objects can change profoundly when they participate in relations. We introduce a dataset suitable for phrasal recognition that uses familiar PASCAL object categories, and demonstrate significant experimental gains resulting from exploiting visual phrases. We show that a visual phrase detector significantly outperforms a baseline which detects component objects and reasons about relations, even though visual phrase training sets tend to be smaller than those for objects. We argue that any multi-class detection system must decode detector outputs to produce final results; this is usually done with non-maximum suppression. We describe a novel decoding procedure that can account accurately for local context without solving difficult inference problems. We show this decoding procedure outperforms the state of the art. Finally, we show that decoding a combination of phrasal and object detectors produces real improvements in detector results.
引用
收藏
页码:1745 / 1752
页数:8
相关论文
共 50 条
  • [21] Discovery of collocation patterns: from visual words to visual phrases
    Yuan, Junsong
    Wu, Ying
    Yang, Ming
    2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 1930 - +
  • [22] Image Retrieval Using Multiple Orders of Geometry-Preserving Visual Phrases
    Wang, Fangyuan
    Zhang, Shuwu
    Li, Heping
    Zhang, Naiguang
    PROCEEDINGS OF 2012 INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND SIGNAL PROCESSING, 2012, : 59 - 63
  • [23] Mining Visual Phrases for Long-Term Visual SLAM
    Kanji, Tanaka
    Yuuto, Chokushi
    Masatoshi, Ando
    2014 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2014), 2014, : 136 - 142
  • [24] Sign Language Recognition of Selected Filipino Phrases Using LSTM Neural Network
    Montefalcon, Myron Darrel
    Padilla, Jay Rhald
    Rodriguez, Ramon
    PROCEEDINGS OF SEVENTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 4, 2023, 465 : 633 - 641
  • [25] Image Retrieval with Scale Invariant Visual Phrases
    Feng, Deying
    Yang, Jie
    Yang, Cheng
    Liu, Congxin
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (05) : 1063 - 1067
  • [26] Bayes pooling of visual phrases for object retrieval
    Jiang, Wenhui
    Zhao, Zhicheng
    Su, Fei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (15) : 9095 - 9119
  • [27] Landmark recognition using visual cues
    Elkins, L
    ROBOTIC AND SEMI-ROBOTIC GROUND VEHICLE TECHNOLOGY, 1998, 3366 : 155 - 164
  • [28] Spatio-Temporal Phrases for Activity Recognition
    Zhang, Yimeng
    Liu, Xiaoming
    Chang, Ming-Ching
    Ge, Weina
    Chen, Tsuhan
    COMPUTER VISION - ECCV 2012, PT III, 2012, 7574 : 707 - 721
  • [29] Bayes pooling of visual phrases for object retrieval
    Wenhui Jiang
    Zhicheng Zhao
    Fei Su
    Multimedia Tools and Applications, 2016, 75 : 9095 - 9119
  • [30] FALSE RECOGNITION OF ADJECTIVE-NOUN PHRASES
    ANISFELD, M
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1970, 86 (01): : 120 - &