Topic-Specific Post Identification in Microblog Streams

被引:0
|
作者
Karunasekera, Shanika [1 ]
Harwood, Aaron [1 ]
Samarawickrama, Sameendra [1 ]
Ramamohanarao, Kotagiri [1 ]
Robins, Garry [2 ]
机构
[1] Univ Melbourne, Dept Comp & Informat Syst, Melbourne, Vic 3010, Australia
[2] Univ Melbourne, Melbourne Sch Psychol Sci, Melbourne, Vic 3010, Australia
关键词
microblog; topic; keyword; query; document; term;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The tracking of microblog discussion, on a given topic, is useful for a wide range of higher level applications. Microblog services like Twitter provide a simple keyword based tracking capability, where any tweet containing a keyword is returned. Due to the short length of microblog posts, using a small number of topic specific query words for tracking, would impact recall. Use of a larger number of keywords (compared to regular document retrieval) is generally required in order to obtain good recall, but this would result in a large number of off-topic posts, resulting in low precision. In our work, we consider the scenario of using a large number of query terms to maintain high recall, for automated tracking of a microblog streams. The challenge we address is how to score each of the returned microblogs, with respect to the query, on-line, in an unsupervised manner, so as to identify those that are on topic. To this end, we proposed a new term-scoring expression, which we call Adjusted Information Gain (AIG), and we compare this to other term-scoring expressions: inverse document frequency, Dice, Jaccard and keyword frequency. Our comparisons consider a selection of document-scoring functions applied to roughly 40 million tweets collects over a 20 day period for each of two topics. Our results show significant improvements (from 8%-40% of the area under the ROC curves) to existing term-scoring expressions, depending on topic and specificity, and provide insight into further work in query expansion techniques.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Identifying Topic-Specific Experts on Microblog
    Yu, Yan
    Mo, Lingfei
    Wang, Jian
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2016, 10 (06): : 2627 - 2647
  • [2] ACQUIRING TOPIC-SPECIFIC REFERENCES
    TOUSIGNAUT, DR
    AMERICAN JOURNAL OF HOSPITAL PHARMACY, 1980, 37 (03): : 350 - 350
  • [3] Learnable topic-specific web crawler
    Rungsawang, A
    Angkawattanawit, N
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2005, 28 (02) : 97 - 114
  • [4] Is Topic-Specific PCK Unique to Teachers?
    Rollnick, Marissa
    Davidowitz, Bette
    Potgieter, Marietjie
    COGNITIVE AND AFFECTIVE ASPECTS IN SCIENCE EDUCATION RESEARCH, 2017, 3 : 69 - 85
  • [5] Topic-Specific Design Research: An Introduction
    Gravemeijer, Koeno
    Prediger, Susanne
    COMPENDIUM FOR EARLY CAREER RESEARCHERS IN MATHEMATICS EDUCATION, 2019, : 33 - 57
  • [6] Topic-Specific Image Caption Generation
    Zhou, Chang
    Mao, Yuzhao
    Wang, Xiaojie
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, CCL 2017, 2017, 10565 : 321 - 332
  • [7] Applying Semantic Similarity Measures to Enhance Topic-Specific Web Crawling Topic-Specific Web Crawlering through Disambiguating Topic Sense
    Pesaranghader, Ali
    Mustapha, Norwati
    Pesaranghader, Ahmad
    2013 13TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2013, : 205 - 212
  • [8] Building topic-specific collections with intelligent agents
    Nekrestyanov, I
    O'Meara, T
    Patel, A
    Romanova, E
    INTELLIGENCE IN SERVICES AND NETWORKS: PAVING THE WAY FOR AN OPEN SERVICE MARKET, 1999, 1597 : 70 - 82
  • [9] Detecting of topic-specific leaders in social networks
    Vega, Lea
    Mendez-Vazquez, Andres
    10TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT 2019) / THE 2ND INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40 2019) / AFFILIATED WORKSHOPS, 2019, 151 : 1188 - 1193
  • [10] Topic-specific intelligent web crawler system
    Qian, Rong
    Xu, Xinhua
    Zheng, Ying
    Yang, Bingru
    Jisuanji Gongcheng/Computer Engineering, 2006, 32 (03): : 57 - 59