Recognition Using Visual Phrases

被引:0
|
作者
Sadeghi, Mohammad Amin [1 ]
Farhadi, Ali [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we introduce visual phrases, complex visual composites like "a person riding a horse". Visual phrases often display significantly reduced visual complexity compared to their component objects, because the appearance of those objects can change profoundly when they participate in relations. We introduce a dataset suitable for phrasal recognition that uses familiar PASCAL object categories, and demonstrate significant experimental gains resulting from exploiting visual phrases. We show that a visual phrase detector significantly outperforms a baseline which detects component objects and reasons about relations, even though visual phrase training sets tend to be smaller than those for objects. We argue that any multi-class detection system must decode detector outputs to produce final results; this is usually done with non-maximum suppression. We describe a novel decoding procedure that can account accurately for local context without solving difficult inference problems. We show this decoding procedure outperforms the state of the art. Finally, we show that decoding a combination of phrasal and object detectors produces real improvements in detector results.
引用
收藏
页码:1745 / 1752
页数:8
相关论文
共 50 条
  • [41] An image classification method based on PLSA and visual phrases
    Zhang, Yong
    Yang, Hao
    2016 INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION, BIG DATA & SMART CITY (ICITBS), 2017, : 59 - 62
  • [42] Image Retrieval with Geometry-Preserving Visual Phrases
    Zhang, Yimeng
    Jia, Zhaoyin
    Chen, Tsuhan
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 809 - 816
  • [43] Recognition of definitions associated to predicative phrases in definitional contexts
    Aguilar, Cesar
    Sierra, Gerardo
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2009, (43): : 151 - 158
  • [44] Dialogue Act Recognition Using Visual Information
    Martinek, Jiri
    Kral, Pavel
    Lenc, Ladislav
    DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II, 2021, 12822 : 793 - 807
  • [45] Using the visual component in automatic speech recognition
    Brooke, NM
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1656 - 1659
  • [46] Visual Recognition using Mappings that Replicate Margins
    Wolf, Lior
    Manor, Nathan
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 810 - 816
  • [47] Qualitative landmark recognition using visual cues
    Poovendran, R
    Speigle, S
    Srinivasan, S
    Raghavan, S
    Chellappa, R
    NAVIGATION AND CONTROL TECHNOLOGIES FOR UNMANNED SYSTEMS II, 1997, 3087 : 74 - 83
  • [48] Qualitative landmark recognition using visual cues
    Srinivasan, S
    Kanal, L
    PATTERN RECOGNITION LETTERS, 1997, 18 (11-13) : 1405 - 1414
  • [49] Activity recognition using visual tracking and RFID
    Krahnstoever, N
    Rittscher, J
    Tu, P
    Chean, K
    Tomlinson, T
    WACV 2005: SEVENTH IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION, PROCEEDINGS, 2005, : 494 - 500
  • [50] Visual Recognition Using Local Quantized Patterns
    ul Hussain, Sibt
    Triggs, Bill
    COMPUTER VISION - ECCV 2012, PT II, 2012, 7573 : 716 - 729