Using Language Models and Topic Models for XML Retrieval

被引:0
|
作者
Huang, Fang [1 ]
机构
[1] Robert Gordon Univ, Sch Comp, Aberdeen AB9 1FR, Scotland
来源
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper exposes the results of our participation in the INEX 2007 ad hoc track. We implemented two different models: a mixture language model and a topic model. For the language model, we focused on the question of how shallow features of text display information in an XML document can be used to enhance retrieval effectiveness. Our language model combined estimates based on element full-text and the compact representation of the element. We also used non-content priors, including the location the element appears in the original document, and the length of the element path, to boost retrieval effectiveness. For the topic model, we looked at a recent statistical model called Latent Dirichlet Allocation[1], and explored how it could be applied to XML retrieval.
引用
收藏
页码:94 / 102
页数:9
相关论文
共 50 条
  • [21] Enhancing Interactive Image Retrieval With Query Rewriting Using Large Language Models and Vision Language Models
    Zhu, Hongyi
    Huang, Jia-Hong
    Rudinac, Stevan
    Kanoulas, Evangelos
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 978 - 987
  • [22] Using the INEX environment as a test bed for various user models for XML retrieval
    Mass, Yosi
    Mandelbrod, Matan
    ADVANCES IN XML INFORMATION RETRIEVAL AND EVALUATION, 2006, 3977 : 187 - 195
  • [23] Cross-language information retrieval models based on latent topic models trained with document-aligned comparable corpora
    Ivan Vulić
    Wim De Smet
    Marie-Francine Moens
    Information Retrieval, 2013, 16 : 331 - 368
  • [24] Cross-language information retrieval models based on latent topic models trained with document-aligned comparable corpora
    Vulic, Ivan
    De Smet, Wim
    Moens, Marie-Francine
    INFORMATION RETRIEVAL, 2013, 16 (03): : 331 - 368
  • [25] Improving the Effectiveness of XML Retrieval with User Navigation Models
    Ali, M. S.
    Consens, Mariano P.
    Helou, Bassam
    ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, : 1584 - 1587
  • [26] Detecting polarizing language in Twitter using topic models and ML algorithms
    Gitari N.D.
    Zuping Z.
    Herman W.
    Gitari, Njagi Dennis (gitaden2000@yahoo.com), 1600, Science and Engineering Research Support Society (09): : 211 - 222
  • [27] Adaptation of Language Models for SMT Using Neural Networks with Topic Information
    Zhao, Yinggong
    Huang, Shujian
    Dai, Xin-Yu
    Chen, Jiajun
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2016, 15 (03)
  • [28] Document Retrieval Using Entity-Based Language Models
    Raviv, Hadas
    Kurland, Oren
    Carmel, David
    SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 65 - 74
  • [29] A Comparative Study of Utilizing Topic Models for Information Retrieval
    Yi, Xing
    Allan, James
    ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 29 - 41
  • [30] Assessment of the Quality of Topic Models for Information Retrieval Applications
    Yuan, Meng
    Lin, Pauline
    Rashidi, Lida
    Zobel, Justin
    PROCEEDINGS OF THE 2023 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2023, 2023, : 265 - 274