Interpretation of complex scenes using dynamic tree-structure Bayesian networks

被引:7
|
作者
Todorovic, Sinisa [1 ]
Nechyba, Michael C.
机构
[1] Univ Illinois, Comp Vis & Robot Lab, Beckman Inst Adv Sci & Technol, Urbana, IL 61801 USA
[2] Pittsburgh Pattern Recognit Inc, Pittsburgh, PA 15222 USA
关键词
generative models; Bayesian networks; dynamic trees; variational inference; image segmentation; object recognition;
D O I
10.1016/j.cviu.2005.09.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the problem of object detection and recognition in complex scenes, where objects are partially occluded. The approach presented herein is based on the hypothesis that a careful analysis of visible object details at various scales is critical for recognition in such settings. In general, however, computational complexity becomes prohibitive when trying to analyze multiple sub-parts of multiple objects in an image. To alleviate this problem, we propose a generative-model framework-namely, dynamic tree-structure belief networks (DTSBNs). This framework formulates object detection and recognition as inference of DTSBN structure and image-class conditional distributions, given an image. The causal (Markovian) dependencies in DTSBNs allow for design of computationally efficient inference, as well as for interpretation of the estimated structure as follows: each root represents a whole distinct object, while children nodes down the sub-tree represent parts of that object at various scales. Therefore, within the DTSBN framework, the treatment and recognition of object parts requires no additional training, but merely a particular interpretation of the tree/subtree structure. This property leads to a strategy for recognition of objects as a whole through recognition of their visible parts. Our experimental results demonstrate that this approach remarkably outperforms strategies without explicit analysis of object parts. (c) 2006 Elsevier Inc. All rights reserved.
引用
收藏
页码:71 / 84
页数:14
相关论文
共 50 条
  • [41] Scalable Content-Based Music Retrieval Using Chord Progression Histogram and Tree-Structure LSH
    Yu, Yi
    Zimmermann, Roger
    Wang, Ye
    Oria, Vincent
    IEEE TRANSACTIONS ON MULTIMEDIA, 2013, 15 (08) : 1969 - 1981
  • [42] Dynamic Bayesian networks and discriminative classifiers for multi-stage semantic interpretation
    Lefevre, Fabrice
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 13 - 16
  • [43] Piecewise forecasting of nonlinear time series with model tree dynamic Bayesian networks
    Quesada, David
    Bielza, Concha
    Fontan, Pedro
    Larranaga, Pedro
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (11) : 9108 - 9137
  • [44] Estimating the structure of Bayesian networks using mixtures
    McMichael, DW
    FUSION'98: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MULTISOURCE-MULTISENSOR INFORMATION FUSION, VOLS 1 AND 2, 1998, : 109 - 115
  • [45] Hand gestures recognition using dynamic Bayesian networks
    Shiravandi, Somayeh
    Rahmati, Mohammad
    Mahmoudi, Fariborz
    2013 3RD JOINT CONFERENCE OF AI & ROBOTICS AND 5TH ROBOCUP IRAN OPEN INTERNATIONAL SYMPOSIUM (RIOS), 2013, : 19 - 24
  • [46] Articulatory feature recognition using dynamic Bayesian networks
    Frankel, Joe
    Wester, Mirjam
    King, Simon
    COMPUTER SPEECH AND LANGUAGE, 2007, 21 (04): : 620 - 640
  • [47] Gait Type Analysis Using Dynamic Bayesian Networks
    Kozlow, Patrick
    Abid, Noor
    Yanushkevich, Svetlana
    SENSORS, 2018, 18 (10)
  • [48] Recognition of degraded characters using dynamic Bayesian networks
    Likforman-Sulem, Laurence
    Sigelle, Marc
    PATTERN RECOGNITION, 2008, 41 (10) : 3092 - 3103
  • [49] Learning Dynamic Bayesian Networks using evolutionary MCMC
    Wang, Hao
    Yu, Kui
    Yao, Hongliang
    2006 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PTS 1 AND 2, PROCEEDINGS, 2006, : 45 - 50
  • [50] Discovering excitatory relationships using dynamic Bayesian networks
    Patnaik, Debprakash
    Laxman, Srivatsan
    Ramakrishnan, Naren
    KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 29 (02) : 273 - 303