Classification and automatic annotation extension of images using a Bayesian network

被引:0
|
作者
Barrat, Sabine [1 ]
Tabbone, Salvatore [1 ]
机构
[1] Univ Nancy, LORIA UMR7503, F-54506 Vandoeuvre Les Nancy, France
关键词
Probabilistic graphical models; Bayesian networks; variable selection; image classification; image annotation; SELECTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapid growth of Internet and multimedia information has shown a need in the development of multimedia information retrieval techniques, especially in image retrieval. We can distinguish two main trends. The first one, called "text-based image retrieval", consists in applying text-retrieval techniques from fully annotated images. The text describes high-level concepts but this technique presents some drawbacks: it requires a tedious work of annotation. Moreover, annotations could be ambiguous because two users can use different keywords to describe a same image. Consequently some approaches have proposed to useWordnet in order to reduce these potential ambiguities. The second approach, called "content-based image retrieval" is a younger field. These methods rely on visual features (color, texture or shape) computed automatically, and retrieve images using a similarity measure. However, the obtained performances are not really acceptable, except in the case of well-focused corpus. In order to improve the recognition, a solution consists in combining visual and semantic information. In many vision problems, instead of having fully annotated training data, it is easier to obtain just a subset of data with annotations, because it is less restrictive for the user. This paper deals with modeling, classifying, and annotating weakly annotated images. More precisely, we propose a scheme for image classification optimization, using a joint visual-text clustering approach and automatically extending image annotations. The proposed approach is derived from the probabilistic graphical model theory and dedicated for both tasks of weakly-annotated image classification and annotation. We consider an image as weakly annotated if the number of keywords defined for it is less than the maximum defined in the ground truth. Thanks to their ability to manage missing values, a probabilistic graphical model has been proposed to represent weakly annotated images. We propose a probabilistic graphical model based on a Gaussian-Mixtures and Multinomial mixture. The visual features are estimated by the Gaussian mixtures and the keywords by a Multinomial distribution. Therefore, the proposed model does not require that all images be annotated: when an image is weakly annotated, the missing keywords are considered as missing values. Besides, our model can automatically extend existing annotations to weakly-annotated images, without user intervention. The uncertainty around the association between a set of keywords and an image is tackled by a joint probability distribution (defined from Gaussian-Mixtures and Multinomial mixture) over the dictionary of keywords and the visual features extracted from our collection of images. Moreover, in order to solve the dimensionality problem due to the large dimensions of visual features, we have adapted a variable selection method. Results of visual-textual classification, reported on a database of images collected from the Web, partially and manually annotated, show an improvement of about 32.3% in terms of recognition rate against only visual information classification. Besides the automatic annotation extension with our model for images with missing keywords outperforms the visual-textual classification of about 6.8%. Finally the proposed method is experimentally competitive with the state-of-art classifiers.
引用
收藏
页码:339 / 352
页数:14
相关论文
共 50 条
  • [1] Classification and Automatic Annotation Extension of Images Using Bayesian Network
    Barrat, Sabine
    Tabbone, Salvatore
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2008, 5342 : 937 - 946
  • [2] Automatic Images Annotation Extension Using a Probabilistic Graphical Model
    Bouzaieni, Abdessalem
    Tabbone, Salvatore
    Barrat, Sabine
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2015, PT II, 2015, 9257 : 579 - 590
  • [3] Automatic Annotation Extension and Classification of Documents Using a Probabilistic Graphical Model
    Bouzaieni, Abdessalem
    Barrat, Sabine
    Tabbone, Salvatore
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 316 - 320
  • [4] Automatic Content Description and Annotation of Sport Images using Classification Techniques
    Hatem, Yomna
    Rady, Sherine
    Ismail, Rasha
    Bahnasy, Khaled
    INTERNATIONAL CONFERENCE ON INFORMATICS AND SYSTEMS (INFOS 2016), 2016, : 88 - 94
  • [5] Automatic video annotation using Bayesian inference
    Wang, Fangshi
    Xu, De
    Lu, Wei
    Wu, Weixin
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 1468 - +
  • [6] Automatic Annotation Algorithm of Medical Radiological Images using Convolutional Neural Network
    Li, Xiaofeng
    Wang, Yanwei
    Cai, Yingjie
    PATTERN RECOGNITION LETTERS, 2021, 152 : 158 - 165
  • [7] Automatic Annotation Algorithm of Medical Radiological Images using Convolutional Neural Network
    Li, Xiaofeng
    Wang, Yanwei
    Cai, Yingjie
    Wang, Yanwei (xianxinyue@163.com), 1600, Elsevier B.V. (152): : 158 - 165
  • [8] Automatic classification of heartbeats using neural network classifier based on a Bayesian framework
    Karraz, G.
    Magenes, G.
    2006 28TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-15, 2006, : 3921 - +
  • [9] Bayesian Framework for Automatic Image Annotation Using Visual Keywords
    Agrawal, Rajeev
    Wu, Changhua
    Grosky, William
    Fotouhi, Farshad
    UBIQUITOUS COMPUTING AND MULTIMEDIA APPLICATIONS, 2010, 75 : 142 - +
  • [10] Automatic image annotation using adaptive color classification
    Saber, E
    Tekalp, AM
    Eschbach, R
    Knox, K
    GRAPHICAL MODELS AND IMAGE PROCESSING, 1996, 58 (02): : 115 - 126