Unsupervised Learning of Dictionaries of Hierarchical Compositional Models

被引:7
|
作者
Dai, Jifeng [1 ,4 ]
Hong, Yi [2 ]
Hu, Wenze [3 ]
Zhu, Song-Chun [4 ]
Wu, Ying Nian [4 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] WalmartLab, Bentonville, AK USA
[3] Google Inc, Mountain View, CA USA
[4] Univ Calif Los Angeles, Los Angeles, CA 90024 USA
来源
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2014年
关键词
RECOGNITION;
D O I
10.1109/CVPR.2014.321
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an unsupervised method for learning dictionaries of hierarchical compositional models for representing natural images. Each model is in the form of a template that consists of a small group of part templates that are allowed to shift their locations and orientations relative to each other, and each part template is in turn a composition of Gabor wavelets that are also allowed to shift their locations and orientations relative to each other. Given a set of unannotated training images, a dictionary of such hierarchical templates are learned so that each training image can be represented by a small number of templates that are spatially translated, rotated and scaled versions of the templates in the learned dictionary. The learning algorithm iterates between the following two steps: (1) Image encoding by a template matching pursuit process that involves a bottom-up template matching sub-process and a top-down template localization sub-process. (2) Dictionary re-learning by a shared matching pursuit process. Experimental results show that the proposed approach is capable of learning meaningful templates, and the learned templates are useful for tasks such as domain adaption and image cosegmentation.
引用
收藏
页码:2505 / 2512
页数:8
相关论文
共 50 条
  • [21] Unsupervised learning of dense hierarchical appearance representations
    Scalzo, Fabien
    Piater, Justus H.
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2006, : 395 - +
  • [22] Hierarchical Exploration of Continuous Seismograms With Unsupervised Learning
    Steinmann, Rene
    Seydoux, Leonard
    Beauce, Eric
    Campillo, Michel
    JOURNAL OF GEOPHYSICAL RESEARCH-SOLID EARTH, 2022, 127 (01)
  • [23] Unsupervised Learning of Hierarchical Spatial Structures In Images
    Parikh, Devi
    Zitnick, C. Lawrence
    Chen, Tsuhan
    CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 2735 - +
  • [24] Hierarchical unsupervised learning of facial expression categories
    Hoey, J
    IEEE WORKSHOP ON DETECTION AND RECOGNITION OF EVENTS IN VIDEO, PROCEEDINGS, 2001, : 99 - 106
  • [25] Unsupervised learning of hierarchical semantics of objects (hSOs)
    Parikh, Devi
    Chen, Tsuhan
    2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 2859 - +
  • [26] A hierarchical learning system incorporating with supervised, unsupervised and reinforcement learning
    Hu, Jinglu
    Sasakawa, Takafumi
    Hirasawa, Kotaro
    Zheng, Huiru
    ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 1, PROCEEDINGS, 2007, 4491 : 403 - +
  • [27] Contracts for BIP: Hierarchical interaction models for compositional verification
    Graf, Susanne
    Quinton, Sophie
    FORMAL TECHNIQUES FOR NETWORKED AND DISTRIBUTED SYSTEMS - FORTE 2007, 2007, 4574 : 1 - +
  • [28] Unsupervised Learning of Camera Pose with Compositional Re-estimation
    Nabavi, Seyed Shahabeddin
    Hosseinzadeh, Mehrdad
    Fahimi, Ramin
    Wang, Yang
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 11 - 20
  • [29] UNSUPERVISED LEARNING OF COMPOSITIONAL SPARSE CODE FOR NATURAL IMAGE REPRESENTATION
    Hong, Yi
    Si, Zhangzhang
    Hu, Wenze
    Zhu, Song-Chun
    Wu, Ying Nian
    QUARTERLY OF APPLIED MATHEMATICS, 2014, 72 (02) : 373 - 406
  • [30] A fast importance sampling algorithm for unsupervised learning of over-complete dictionaries
    Blumensath, T
    Davies, M
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 213 - 216