Using Language Models and Topic Models for XML Retrieval

被引:0
|
作者
Huang, Fang [1 ]
机构
[1] Robert Gordon Univ, Sch Comp, Aberdeen AB9 1FR, Scotland
来源
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper exposes the results of our participation in the INEX 2007 ad hoc track. We implemented two different models: a mixture language model and a topic model. For the language model, we focused on the question of how shallow features of text display information in an XML document can be used to enhance retrieval effectiveness. Our language model combined estimates based on element full-text and the compact representation of the element. We also used non-content priors, including the location the element appears in the original document, and the length of the element path, to boost retrieval effectiveness. For the topic model, we looked at a recent statistical model called Latent Dirichlet Allocation[1], and explored how it could be applied to XML retrieval.
引用
收藏
页码:94 / 102
页数:9
相关论文
共 50 条
  • [31] Probabilistic Topic Models for Text Data Retrieval and Analysis
    Zhai, ChengXiang
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 1399 - 1401
  • [32] Cross-Language Retrieval Using Link-Based Language Models
    Roth, Benjamin
    Klakow, Dietrich
    SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 773 - 774
  • [33] Exploiting Temporal Topic Models in Social Media Retrieval
    Tran, Tuan A.
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 999 - 999
  • [34] Differentiating language usage through topic models
    McFarland, Daniel A.
    Ramage, Daniel
    Chuang, Jason
    Heer, Jeffrey
    Manning, Christopher D.
    Jurafsky, Daniel
    POETICS, 2013, 41 (06) : 607 - 625
  • [35] Automated Topic Analysis with Large Language Models
    Kirilenko, Andrei
    Stepchenkova, Svetlana
    INFORMATION AND COMMUNICATION TECHNOLOGIES IN TOURISM 2024, ENTER 2024, 2024, : 29 - 34
  • [36] Statistical Language Models for Information Retrieval
    Department of Computer Science, United States
    不详
    Synth. Lect. Human Lang. Technol., 2009, 1 (1-141):
  • [37] Inferential language models for information retrieval
    Nie, Jian-Yun
    Cao, Guihong
    Bai, Jing
    ACM Transactions on Asian Language Information Processing, 2006, 5 (04): : 296 - 322
  • [38] Positional Language Models for Information Retrieval
    Lv, Yuanhua
    Zhai, ChengXiang
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 299 - 306
  • [39] Statistical Language Models for Information Retrieval
    Gaussier, Eric
    COMPUTATIONAL LINGUISTICS, 2010, 36 (02) : 279 - 281
  • [40] Analysis of Retrieval Models for Cross Language Information Retrieval
    Ujjwal, Dasu
    Rastogi, Prakhar
    Siddhartha, Siril
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO'16), 2016,