Probabilistic data fusion on a large document collection

被引:1
|
作者
Lillis, David [1 ]
Toolan, Fergus [2 ]
Collier, Rem [1 ]
Dunnion, John [1 ]
机构
[1] Univ Coll Dublin, Sch Comp Sci & Informat, Dublin 4, Ireland
[2] Griffith Coll Dublin, Fac Computing Sci, Dublin 8, Ireland
关键词
data fusion; information retrieval; ProbFuse;
D O I
10.1007/s10462-007-9037-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data fusion is the process of combining the output of a number of Information Retrieval (IR) algorithms into a single result set, to achieve greater retrieval performance. ProbFuse is a data fusion algorithm that uses the history of the underlying IR algorithms to estimate the probability that subsequent result sets include relevant documents in particular positions. It has been shown to out-perform CombMNZ, the standard data fusion algorithm against which to compare performance, in a number of previous experiments. This paper builds upon this previous work and applies probFuse to the much larger Web Track document collection from the 2004 Text REtreival Conference. The performance of probFuse is compared against that of CombMNZ using a number of evaluation measures and is shown to achieve substantial performance improvements.
引用
收藏
页码:23 / 34
页数:12
相关论文
共 50 条
  • [11] ProbMap-A probabilistic approach for mapping large document collections
    Hofmann, Thomas
    Intelligent Data Analysis, 2000, 4 (02) : 149 - 164
  • [12] Toward a Robust Data Fusion for Document Retrieval
    He, Daqing
    Dan Wu
    IEEE NLP-KE 2008: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2008, : 338 - +
  • [13] A probabilistic chemical sensor model for data fusion
    Robins, P
    Raprey, V
    Thomas, P
    2005 7th International Conference on Information Fusion (FUSION), Vols 1 and 2, 2005, : 1116 - 1122
  • [14] Probabilistic graphical models and their application in data fusion
    Bottone, Steven
    Stanek, Clay
    AUTOMATIC TARGET RECOGNITION XVII, 2007, 6566
  • [15] A comparison of probabilistic representations for decentralised data fusion
    Ong, LL
    Ridley, M
    Upcroft, B
    Kumar, S
    Bailey, T
    Sukkarieh, S
    Durrant-Whyte, H
    PROCEEDINGS OF THE 2005 INTELLIGENT SENSORS, SENSOR NETWORKS & INFORMATION PROCESSING CONFERENCE, 2005, : 187 - 192
  • [16] Aerial Reconstructions via Probabilistic Data Fusion
    Cabezas, Randi
    Freifeld, Oren
    Rosman, Guy
    Fisher, John W., III
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 4010 - 4017
  • [17] Data collection with probabilistic guarantees in opportunistic wireless networks
    Ren, Meirui
    Li, Jianzhong
    Guo, Longjiang
    Cai, Zhipeng
    INTERNATIONAL JOURNAL OF SENSOR NETWORKS, 2017, 24 (02) : 125 - 137
  • [18] Scalable Histograms on Large Probabilistic Data
    Tang, Mingwang
    Li, Feifei
    PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 631 - 640
  • [19] Optimum data collection and fusion schemes in WBSN
    Mehrani, Mohammad
    Attarzadeh, Iman
    Hosseinzadeh, Mehdi
    INTERNATIONAL JOURNAL OF SENSOR NETWORKS, 2020, 33 (03) : 123 - 147
  • [20] Proposal of a probabilistic believes fusion framework application to range data fusion
    Piat, E
    Meizel, D
    IROS '97 - PROCEEDINGS OF THE 1997 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOT AND SYSTEMS: INNOVATIVE ROBOTICS FOR REAL-WORLD APPLICATIONS, VOLS 1-3, 1996, : 1415 - 1422