Extended structural relevance framework: a framework for evaluating structured document retrieval

被引:0
|
作者
M. Sadek Ali
Mariano Consens
Mounia Lalmas
机构
[1] University of Toronto,Bahen Center
[2] Yahoo! Research,undefined
来源
Information Retrieval | 2012年 / 15卷
关键词
Tree retrieval; XML retrieval; Evaluation; Relevance; Redundancy; User navigation; Effectiveness measures;
D O I
暂无
中图分类号
学科分类号
摘要
A structured document retrieval (SDR) system aims to minimize the effort users spend to locate relevant information by retrieving parts of documents. To evaluate the range of SDR tasks, from element to passage to tree retrieval, numerous task-specific measures have been proposed. This has resulted in SDR evaluation measures that cannot easily be compared with respect to each other and across tasks. In previous work, we defined the SDR task of tree retrieval where passage and element are special cases. In this paper, we look in greater detail into tree retrieval to identify the main components of SDR evaluation: relevance, navigation, and redundancy. Our goal is to evaluate SDR within a single probabilistic framework based on these components. This framework, called Extended Structural Relevance (ESR), calculates user expected gain in relevant information depending on whether it is seen via hits (relevant results retrieved), unseen via misses (relevant results not retrieved), or possibly seen via near-misses (relevant results accessed via navigation). We use these expectations as parameters to formulate evaluation measures for tree retrieval. We then demonstrate how existing task-specific measures, if viewed as tree retrieval, can be formulated, computed and compared using our framework. Finally, we experimentally validate ESR across a range of SDR tasks.
引用
收藏
页码:558 / 590
页数:32
相关论文
共 50 条
  • [1] Extended structural relevance framework: a framework for evaluating structured document retrieval
    Ali, M. Sadek
    Consens, Mariano
    Lalmas, Mounia
    INFORMATION RETRIEVAL, 2012, 15 (06): : 558 - 590
  • [2] An end-to-end pseudo relevance feedback framework for neural document retrieval
    Wang, Le
    Luo, Ze
    Li, Canjia
    He, Ben
    Sun, Le
    Yu, Hao
    Sun, Yingfei
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (02)
  • [3] A Document Retrieval Framework for Scientific Publications
    Vazhenin, Denis
    Ishikawa, Satoru
    Satake, Satoru
    Klyuev, Vitaly
    2008 IEEE REGION 8 INTERNATIONAL CONFERENCE ON COMPUTATIONAL TECHNOLOGIES IN ELECTRICAL AND ELECTRONICS ENGINEERING: SIBIRCON 2008, PROCEEDINGS, 2008, : 95 - 99
  • [4] Document Indexing Framework for Retrieval of Degraded Document Images
    Garg, Ritu
    Hassan, Ehtesham
    Chaudhury, Santanu
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 1261 - 1265
  • [5] A novel dense retrieval framework for long document retrieval
    Wang, Jiajia
    Zhao, Weizhong
    Tu, Xinhui
    He, Tingting
    FRONTIERS OF COMPUTER SCIENCE, 2023, 17 (04)
  • [6] A NEURAL DOCUMENT LANGUAGE MODELING FRAMEWORK FOR SPOKEN DOCUMENT RETRIEVAL
    Yen, Li-Phen
    Wu, Zhen-Yu
    Chen, Kuan-Yu
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8139 - 8143
  • [7] Machine learning in a multimedia document retrieval framework
    Perrone, MP
    Russell, GF
    Ziq, A
    IBM SYSTEMS JOURNAL, 2002, 41 (03) : 494 - 503
  • [8] Framework for document retrieval using latent semantic indexing
    Phadnis, Neelam
    Gadge, Jayant
    International Journal of Computers and Applications, 2014, 94 (14) : 37 - 41
  • [9] A Unified Relevance Feedback Framework for Web Image Retrieval
    Cheng, En
    Jing, Feng
    Zhang, Lei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2009, 18 (06) : 1350 - 1357
  • [10] Developing and evaluating the extended epistemic vigilance framework
    Bielik, Tom
    Krell, Moritz
    JOURNAL OF RESEARCH IN SCIENCE TEACHING, 2025, 62 (03) : 869 - 895