Discovery of collocation patterns: from visual words to visual phrases

被引:82
|
作者
Yuan, Junsong [1 ]
Wu, Ying [1 ]
Yang, Ming [1 ]
机构
[1] Northwestern Univ, Dept EECS, 2145 Sheridan Rd, Evanston, IL 60208 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR.2007.383222
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
A visual word lexicon can be constructed by clustering primitive visual features, and a visual object can be described by a set of visual words. Such a "bag-of-words" representation has led to many significant results in various vision tasks including object recognition and categorization. However, in practice, the clustering of primitive visual features tends to result in synonymous visual words that over-represent visual patterns, as well as polysemous visual words that bring large uncertainties and ambiguities in the representation. This paper aims at generating a higher-level lexicon, i.e. visual phrase lexicon, where a visual phrase is a meaningful spatially co-occurrent pattern of visual words. This higher-level lexicon is much less ambiguous than the lower-level one. The contributions of this paper include: (1) a fast and principled solution to the discovery of significant spatial co-occurrent patterns using frequent itemset mining; (2) a pattern summarization method that deals with the compositional uncertainties in visual phrases; and (3) a top-down refinement scheme of the visual word lexicon by feeding back discovered phrases to tune the similarity measure through metric learning.
引用
收藏
页码:1930 / +
页数:2
相关论文
共 50 条
  • [21] Mining Visual Collocation Patterns via Self-Supervised Subspace Learning
    Yuan, Junsong
    Wu, Ying
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (02): : 334 - 346
  • [22] VisKE: Visual Knowledge Extraction and Question Answering by Visual Verification of Relation Phrases
    Sadeghi, Fereshteh
    Divvala, Santosh K.
    Farhad, Ali
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 1456 - 1464
  • [23] Probing Fundamental Visual Comprehend Capabilities on Vision Language Models via Visual Phrases from Structural Data
    Xie, Peijin
    Liu, Bingquan
    COGNITIVE COMPUTATION, 2024, 16 (06) : 3484 - 3504
  • [24] Image Retrieval with Scale Invariant Visual Phrases
    Feng, Deying
    Yang, Jie
    Yang, Cheng
    Liu, Congxin
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (05) : 1063 - 1067
  • [25] Bayes pooling of visual phrases for object retrieval
    Jiang, Wenhui
    Zhao, Zhicheng
    Su, Fei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (15) : 9095 - 9119
  • [26] MEASURING CONCEPTUAL RELATION OF VISUAL WORDS FOR VISUAL CATEGORIZATION
    Li, Teng
    Kweon, In-So
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 2057 - 2060
  • [27] THE ROLE OF SPATIAL-FREQUENCY AND VISUAL DETAIL IN THE RECOGNITION OF PATTERNS AND WORDS
    THEIOS, J
    AMRHEIN, PC
    CURRENT ISSUES IN COGNITIVE PROCESSES: THE TULANE FLOWEREE SYMPOSIUM ON COGNITION, 1989, : 389 - 409
  • [28] Visual words for automated visual inspection of bulk materials
    Richter, Matthias
    Laengle, Thomas
    Beyerer, Juergen
    2015 14TH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA), 2015, : 210 - 213
  • [29] Visual laterality patterns for the perception of emotional words in alcoholic and aging individuals
    Hutner, N
    OscarBerman, M
    JOURNAL OF STUDIES ON ALCOHOL, 1996, 57 (02): : 144 - 154
  • [30] Bayes pooling of visual phrases for object retrieval
    Wenhui Jiang
    Zhicheng Zhao
    Fei Su
    Multimedia Tools and Applications, 2016, 75 : 9095 - 9119