Effective keyword query structuring using NER for XML retrieval

被引:2
|
作者
Roko, Abubakar [1 ]
Doraisamy, Shyamala [1 ]
Jantan, Azrul Hazri [1 ]
Azman, Azreen [1 ]
机构
[1] Univ Putra Malaysia, Dept Multimedia, Serdang 43400, Malaysia
关键词
Managing and storing XML data; Indexing and retrieval of XML data; Metadata and ontologies;
D O I
10.1108/IJWIS-06-2014-0022
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Purpose - The purpose of this paper is to propose and evaluate XKQSS, a query structuring method that relegates the task of generating structured queries from a user to a search engine while retaining the simple keyword search query interface. A more effective way for searching XML database is to use structured queries. However, using query languages to express queries prove to be difficult for most users since this requires learning a query language and knowledge of the underlying data schema. On the other hand, the success of Web search engines has made many users to be familiar with keyword search and, therefore, they prefer to use a keyword search query interface to search XML data. Design/methodology/approach - Existing query structuring approaches require users to provide structural hints in their input keyword queries even though their interface is keyword base. Other problems with existing systems include their inability to put keyword query ambiguities into consideration during query structuring and how to select the best generated structure query that best represents a given keyword query. To address these problems, this study allows users to submit a schema independent keyword query, use named entity recognition (NER) to categorize query keywords to resolve query ambiguities and compute semantic information for a node from its data content. Algorithms were proposed that find user search intentions and convert the intentions into a set of ranked structured queries. Findings - Experiments with Sigmod and IMDB datasets were conducted to evaluate the effectiveness of the method. The experimental result shows that the XKQSS is about 20 per cent more effective than XReal in terms of return nodes identification, a state-of-art systems for XML retrieval. Originality/value - Existing systems do not take keyword query ambiguities into account. XKSS consists of two guidelines based on NER that help to resolve these ambiguities before converting the submitted query. It also include a ranking function computes a score for each generated query by using both semantic information and data statistic, as opposed to data statistic only approach used by the existing approaches.
引用
收藏
页码:33 / 53
页数:21
相关论文
共 50 条
  • [21] Beyond Bag of Words: A New Model for XML Keyword Query
    Liu, Xiping
    Wan, Changxuan
    2014 INTERNATIONAL CONFERENCE ON MANAGEMENT OF E-COMMERCE AND E-GOVERNMENT (ICMECG), 2014, : 252 - 259
  • [22] MAXLCA: A NEW QUERY SEMANTIC MODEL FOR XML KEYWORD SEARCH
    Gao, Ning
    Deng, Zhi-Hong
    Jiang, Jia-Jian
    Yu, Hang
    JOURNAL OF WEB ENGINEERING, 2012, 11 (02): : 131 - 145
  • [23] Efficient XML keyword query refinement with meaningful results generation
    Huang, Jing
    Lu, Jiaheng
    Meng, Xiaofeng
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2010, 47 (05): : 841 - 848
  • [24] A Novel Two-Phase XML Keyword Query Algorithm
    Lin Xudong
    Wang Ning
    Xu De
    CHINESE JOURNAL OF ELECTRONICS, 2010, 19 (04): : 613 - 617
  • [25] A fast approach for SLCA in keyword query over XML Document
    Wan, Liyong
    FRONTIERS OF MANUFACTURING AND DESIGN SCIENCE IV, PTS 1-5, 2014, 496-500 : 1779 - 1782
  • [26] The Research of XML Keyword Retrieval Algorithms Based on MapReduce
    Xia, YaoWen
    Xie, Jili
    MECHATRONICS ENGINEERING, COMPUTING AND INFORMATION TECHNOLOGY, 2014, 556-562 : 3347 - 3349
  • [27] KEYWORD-QUERY EXPANSION USING CITATION CLUSTERS FOR PAPER INFORMATION RETRIEVAL
    Yamaguchi, Kiyohiro
    Mori, Junichiro
    Kajikawa, Yuya
    14TH INTERNATIONAL SOCIETY OF SCIENTOMETRICS AND INFORMETRICS CONFERENCE (ISSI), 2013, : 2034 - 2036
  • [28] Efficient, effective and flexible XML retrieval using summaries
    Ali, M. S.
    Consens, Mariano
    Gu, Xin
    Kanza, Yaron
    Rizzolo, Flavio
    Stasiu, Raquel
    COMPARATIVE EVALUATION OF XML INFORMATION RETRIEVAL SYSTEMS, 2007, 4518 : 89 - 103
  • [29] XML keyword retrieval algorithm based on nearest pair
    Ji, Cong-Rui
    Deng, Zhi-Hong
    Tang, Shi-Wei
    Ruan Jian Xue Bao/Journal of Software, 2009, 20 (04): : 910 - 917
  • [30] Structural feedback for keyword-based XML retrieval
    Schenkel, Ralf
    Theobald, Martin
    ADVANCES IN INFORMATION RETRIEVAL, 2006, 3936 : 326 - 337