Query Subtopic Mining Exploiting Word Embedding for Search Result Diversification

被引:4
|
作者
Ullah, Md Zia [1 ]
Shajalal, Md [1 ]
Chy, Abu Nowshed [1 ]
Aono, Masaki [1 ]
机构
[1] Toyohashi Univ Technol, Dept Comp Sci & Engn, Toyohashi, Aichi, Japan
关键词
Subtopic mining; Word embedding; Diversification; Novelty;
D O I
10.1007/978-3-319-48051-0_24
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Understanding the users' search intents through mining query subtopic is a challenging task and a prerequisite step for search diversification. This paper proposes mining query subtopic by exploiting the word embedding and short-text similarity measure. We extract candidate subtopic from multiple sources and introduce a new way of ranking based on a new novelty estimation that faithfully represents the possible search intents of the query. To estimate the subtopic relevance, we introduce new semantic features based on word embedding and bipartite graph based ranking. To estimate the novelty of a subtopic, we propose a method by combining the contextual and categorical similarities. Experimental results on NTCIR subtopic mining datasets turn out that our proposed approach outperforms the baselines, known previous methods, and the official participants of the subtopic mining tasks.
引用
收藏
页码:308 / 314
页数:7
相关论文
共 50 条
  • [1] Query Subtopic Mining for Search Result Diversification
    Ullah, Md Zia
    Aono, Masaki
    2014 International Conference of Advanced Informatics: Concept, Theory and Application (ICAICTA), 2014, : 309 - 314
  • [2] Supervised Search Result Diversification via Subtopic Attention
    Jiang, Zhengbao
    Dou, Zhicheng
    Zhao, Wayne Xin
    Nie, Jian-Yun
    Yue, Ming
    Wen, Ji-Rong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (10) : 1971 - 1984
  • [3] Improve Web Search Diversification with Intent Subtopic Mining
    Damien, Aymeric
    Zhang, Min
    Liu, Yiqun
    Ma, Shaoping
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2013, 2013, 400 : 322 - 333
  • [4] Improve web search diversification with intent subtopic mining
    Damien, Aymeric
    Zhang, Min
    Liu, Yiqun
    Ma, Shaoping
    Communications in Computer and Information Science, 2013, 400 : 322 - 333
  • [5] Summary of the NTCIR-10 INTENT-2 Task: Subtopic Mining and Search Result Diversification
    Sakai, Tetsuya
    Dou, Zhicheng
    Yamamoto, Takehiro
    Liu, Yiqun
    Zhang, Min
    Kato, Makoto P.
    SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 761 - 764
  • [6] Search Result Diversification Based on Query Facets
    Sha Hu
    Zhi-Cheng Dou
    Xiao-Jie Wang
    Ji-Rong Wen
    Journal of Computer Science and Technology, 2015, 30 : 888 - 901
  • [7] Search Result Diversification Based on Query Facets
    Hu, Sha
    Dou, Zhi-Cheng
    Wang, Xiao-Jie
    Wen, Ji-Rong
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2015, 30 (04) : 888 - 901
  • [8] Search Result Diversification Using Query Aspects as Bottlenecks
    Yu, Puxuan
    Rahimi, Razieh
    Huang, Zhiqi
    Allan, James
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 3040 - 3051
  • [9] On Query Result Diversification
    Vieira, Marcos R.
    Razente, Humberto L.
    Barioni, Maria C. N.
    Hadjieleftheriou, Marios
    Srivastava, Divesh
    Traina, Caetano, Jr.
    Tsotras, Vassilis J.
    IEEE 27TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2011), 2011, : 1163 - 1174
  • [10] Utilizing Word Embeddings for Result Diversification in Tweet Search
    Onal, Kezban Dilek
    Altingovde, Ismail Sengor
    Karagoz, Pinar
    INFORMATION RETRIEVAL TECHNOLOGY, AIRS 2015, 2015, 9460 : 366 - 378