Recognition Using Visual Phrases

被引:0
|
作者
Sadeghi, Mohammad Amin [1 ]
Farhadi, Ali [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we introduce visual phrases, complex visual composites like "a person riding a horse". Visual phrases often display significantly reduced visual complexity compared to their component objects, because the appearance of those objects can change profoundly when they participate in relations. We introduce a dataset suitable for phrasal recognition that uses familiar PASCAL object categories, and demonstrate significant experimental gains resulting from exploiting visual phrases. We show that a visual phrase detector significantly outperforms a baseline which detects component objects and reasons about relations, even though visual phrase training sets tend to be smaller than those for objects. We argue that any multi-class detection system must decode detector outputs to produce final results; this is usually done with non-maximum suppression. We describe a novel decoding procedure that can account accurately for local context without solving difficult inference problems. We show this decoding procedure outperforms the state of the art. Finally, we show that decoding a combination of phrasal and object detectors produces real improvements in detector results.
引用
收藏
页码:1745 / 1752
页数:8
相关论文
共 50 条
  • [1] Affine Invariant Visual Phrases for Object Instance Recognition
    Patraucean, Viorica
    Ovsjanikov, Maks
    2015 14TH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA), 2015, : 14 - 17
  • [2] 3D Visual Phrases for Landmark Recognition
    Hao, Qiang
    Cai, Rui
    Li, Zhiwei
    Zhang, Lei
    Pang, Yanwei
    Wu, Feng
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 3594 - 3601
  • [3] Image Retrieval using Visual Phrases
    Anwar, Benish
    Baber, Junaid
    Ahmed, Atiq
    Bakhtyar, Maheen
    Daudpota, Sher Muhammad
    Sanjrani, Anwar Ali
    Ullah, Ihsan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (03) : 476 - 480
  • [4] RECOGNITION OF SUB-PHRASES USING PARSERS
    CERCENADO, RJ
    GIL, JM
    CASACUBERTA, F
    REVISTA DE INFORMATICA Y AUTOMATICA, 1988, 21 (03): : 15 - 21
  • [5] MULTIPLE INSTANCE LEARNING USING VISUAL PHRASES FOR OBJECT CLASSIFICATION
    Song, Yan
    Tian, Qi
    Wang, Mengyue
    Liu, Heng
    Dai, Lirong
    2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 649 - 654
  • [6] Example-Based Speech Recognition using Formulaic Phrases
    Watkins, Christopher J.
    Cox, Stephen J.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 3027 - 3030
  • [7] Mining Visual Phrases for Visual Robot Localization
    Tanaka, Kanji
    Chokushi, Yuuto
    Ando, Masatoshi
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2016, 20 (01) : 57 - 65
  • [8] SPEECH RECOGNITION FOR JAPANESE PHRASES
    KAMIYA, S
    KIYAMA, J
    HAKARIDANI, M
    TANAKA, A
    SHARP TECHNICAL JOURNAL, 1991, (49): : 23 - 26
  • [9] RECOGNITION OF TRANSFORMED MUSICAL PHRASES
    COWAN, TM
    SCHOEN, L
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1987, 25 (05) : 339 - 339
  • [10] Extraction of Robust Visual Phrases Using Graph Mining for Image Retrieval
    Yeh, Jun-Bin
    Wu, Chung-Hsien
    2010 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, 2010, : 3681 - 3684