Discovery of collocation patterns: from visual words to visual phrases

被引:82
|
作者
Yuan, Junsong [1 ]
Wu, Ying [1 ]
Yang, Ming [1 ]
机构
[1] Northwestern Univ, Dept EECS, 2145 Sheridan Rd, Evanston, IL 60208 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR.2007.383222
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
A visual word lexicon can be constructed by clustering primitive visual features, and a visual object can be described by a set of visual words. Such a "bag-of-words" representation has led to many significant results in various vision tasks including object recognition and categorization. However, in practice, the clustering of primitive visual features tends to result in synonymous visual words that over-represent visual patterns, as well as polysemous visual words that bring large uncertainties and ambiguities in the representation. This paper aims at generating a higher-level lexicon, i.e. visual phrase lexicon, where a visual phrase is a meaningful spatially co-occurrent pattern of visual words. This higher-level lexicon is much less ambiguous than the lower-level one. The contributions of this paper include: (1) a fast and principled solution to the discovery of significant spatial co-occurrent patterns using frequent itemset mining; (2) a pattern summarization method that deals with the compositional uncertainties in visual phrases; and (3) a top-down refinement scheme of the visual word lexicon by feeding back discovered phrases to tune the similarity measure through metric learning.
引用
收藏
页码:1930 / +
页数:2
相关论文
共 50 条
  • [1] Generating Descriptive Visual Words and Visual Phrases for Large-Scale Image Applications
    Zhang, Shiliang
    Tian, Qi
    Hua, Gang
    Huang, Qingming
    Gao, Wen
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (09) : 2664 - 2677
  • [2] SEMANTICALLY SIMILAR VISUAL WORDS DISCOVERY TO FACILITATE VISUAL INVARIANCE
    Chimlek, Sutasinee
    Kesorn, Kraisak
    Piamsa-nga, Punpiti
    Poslad, Stefan
    2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 1242 - 1247
  • [3] Mining Visual Phrases for Visual Robot Localization
    Tanaka, Kanji
    Chokushi, Yuuto
    Ando, Masatoshi
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2016, 20 (01) : 57 - 65
  • [4] Recognition Using Visual Phrases
    Sadeghi, Mohammad Amin
    Farhadi, Ali
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 1745 - 1752
  • [5] Visual Decoding of Phrases from Occipital Neuromagnetic Signals
    Dash, Debadatta
    Ferrari, Paul
    Borna, Amir
    Iivanainen, Joonas
    Schwindt, Peter D. D.
    Wang, Jun
    2021 10TH INTERNATIONAL IEEE/EMBS CONFERENCE ON NEURAL ENGINEERING (NER), 2021, : 230 - 233
  • [6] A MNEMONIC BASED ON ARRANGING WORDS ON VISUAL-PATTERNS
    BELLEZZA, FS
    JOURNAL OF EDUCATIONAL PSYCHOLOGY, 1986, 78 (03) : 217 - 224
  • [7] EEG ASYMMETRY AND PERFORMANCE - VISUAL DETECTION OF WORDS AND PATTERNS
    REBERT, CS
    ELECTROENCEPHALOGRAPHY AND CLINICAL NEUROPHYSIOLOGY, 1976, 40 (03): : 327 - 327
  • [8] Mining Visual Phrases for Long-Term Visual SLAM
    Kanji, Tanaka
    Yuuto, Chokushi
    Masatoshi, Ando
    2014 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2014), 2014, : 136 - 142
  • [9] BoVW-CAM: Visual Explanation from Bag of Visual Words
    Barros da Silva, Arnaldo Vitor
    Alves Pereira, Luis Filipe
    INTELLIGENT SYSTEMS, PT II, 2022, 13654 : 45 - 55
  • [10] A robust CBIR framework in between bags of visual words and phrases models for specific image datasets
    Ouni, Achref
    Urruty, Thierry
    Visani, Muriel
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (20) : 26173 - 26189