Using Language Models and Topic Models for XML Retrieval

被引:0
|
作者
Huang, Fang [1 ]
机构
[1] Robert Gordon Univ, Sch Comp, Aberdeen AB9 1FR, Scotland
来源
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper exposes the results of our participation in the INEX 2007 ad hoc track. We implemented two different models: a mixture language model and a topic model. For the language model, we focused on the question of how shallow features of text display information in an XML document can be used to enhance retrieval effectiveness. Our language model combined estimates based on element full-text and the compact representation of the element. We also used non-content priors, including the location the element appears in the original document, and the length of the element path, to boost retrieval effectiveness. For the topic model, we looked at a recent statistical model called Latent Dirichlet Allocation[1], and explored how it could be applied to XML retrieval.
引用
收藏
页码:94 / 102
页数:9
相关论文
共 50 条
  • [1] Using language models and the HITS algorithm for XML retrieval
    Kimelfeld, Benny
    Kovacs, Eitan
    Sagiv, Yehoshua
    Yahav, Dan
    COMPARATIVE EVALUATION OF XML INFORMATION RETRIEVAL SYSTEMS, 2007, 4518 : 253 - 260
  • [2] Language Models for XML Element Retrieval
    Li, Rongmei
    van der Weide, Theo
    FOCUSED RETRIEVAL AND EVALUATION, 2010, 6203 : 95 - +
  • [3] Statistical language models for intelligent XML retrieval
    Hiemstra, D
    INTELLIGENT SEARCH ON XML DATA: APPLICATIONS, LANGUAGES, MODELS IMPLEMENTATIONS AND BENCHMARKS, 2003, 2818 : 107 - 118
  • [4] Extended Language Models for XML Element Retrieval
    Li, Rongmei
    van der Weide, Theo
    COMPARATIVE EVALUATION OF FOCUSED RETRIEVAL, 2011, 6932 : 89 - 97
  • [5] Hierarchical language models for XML component retrieval
    Ogilvie, P
    Callan, J
    ADVANCES IN XML INFORMATION RETRIEVAL, 2005, 3493 : 224 - 237
  • [6] Topic signature language models for ad hoc retrieval
    Zhou, Xiaohua
    Hu, Xiaohua
    Zhang, Xiaodan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (09) : 1276 - 1287
  • [7] Topic based language models for ad hoc information retrieval
    Azzopardi, L
    Girolami, M
    van Rijsbergen, CJ
    2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 3281 - 3286
  • [8] Contextualization models for XML retrieval
    Arvola, Paavo
    Kekalainen, Jaana
    Junkkari, Marko
    INFORMATION PROCESSING & MANAGEMENT, 2011, 47 (05) : 762 - 776
  • [9] Modeling query-document dependencies with topic language models for information retrieval
    Wu, Meng-Sung
    INFORMATION SCIENCES, 2015, 312 : 1 - 12
  • [10] Disentangling Transformer Language Models as Superposed Topic Models
    Lim, Jia Peng
    Lauw, Hady W.
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 8646 - 8666