Topic discovery from short reviews based on data enhancement

被引:0
|
作者
Zhu, Tingting [1 ,2 ]
Liu, Yezheng [1 ,2 ]
Sun, Jianshan [1 ,2 ]
Sun, Chunhua [1 ,2 ]
机构
[1] Hefei Univ Technol, Sch Management, 193 Tunxi Rd, Hefei 230009, Anhui, Peoples R China
[2] Minist Educ, Proc Optimizat & Intelligent Decis Making, Hefei, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
Short reviews; topic discovery; data enhancement; clustering; MODEL; INTERNET;
D O I
10.3233/IDA-205715
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid development of social media and mobile Internet, short reviews, such as Weibo and Twitter, have exploded online. Discovering topics from short reviews is significant for many practical applications. It can effectively not only identify users' attitudes and emotions but also enhance customer satisfaction and shopping experience. Because reviews are relatively short, the sparsity of reviews considerably restricts the quality of topic discovery. To improve the efficiency of topic discovery, we introduce the concept of data enhancement and strengthen the data in sentences and words in short reviews based on the weight of importance. We then propose a topic model for reviews to topic discovery based on data enhancement (shorted as DE-LDA). We verify the rationality and feasibility of DE-LDA on real datasets. Results show that the proposed method outperforms benchmarks in topic discovery and also has better clustering effects.
引用
收藏
页码:295 / 310
页数:16
相关论文
共 50 条
  • [1] Torpedo: Topic Periodicity Discovery from Text Data
    Wang, Jingjing
    Deng, Hongbo
    Han, Jiawei
    NEXT-GENERATION ANALYST III, 2015, 9499
  • [2] Adjutant: an R-based tool to support topic discovery for systematic and literature reviews
    Crisan, Anamaria
    Munzner, Tamara
    Gardy, Jennifer L.
    BIOINFORMATICS, 2019, 35 (06) : 1070 - 1072
  • [3] RELFIN -: Topic discovery for ontology enhancement and annotation
    Schaal, M
    Müller, RM
    Brunzel, M
    Spiliopoulou, M
    SEMANTIC WEB: RESEARCH AND APPLICATIONS, PROCEEDINGS, 2005, 3532 : 608 - 622
  • [4] TSSE-DMM: Topic Modeling for Short Texts Based on Topic Subdivision and Semantic Enhancement
    Mai, Chengcheng
    Qiu, Xueming
    Luo, Kaiwen
    Chen, Min
    Zhao, Bo
    Huang, Yihua
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT II, 2021, 12713 : 640 - 651
  • [5] A Prerecognition Model for Hot Topic Discovery Based on Microblogging Data
    Zhu, Tongyu
    Yu, Jianjun
    SCIENTIFIC WORLD JOURNAL, 2014,
  • [6] Topic Discovery for Streaming Short Texts with CTM
    Xu, Yunfeng
    Xu, Hua
    Zhu, Longxia
    Hao, Hanyong
    Deng, Junhui
    Sun, Xiaomin
    Bai, Xiaoli
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [7] Topic Features in Negative Customer Reviews: Evidence Based on Text Data Mining
    Zhen Li
    Fangzhou Li
    Jing Xiao
    Zhi Yang
    The Review of Socionetwork Strategies, 2020, 14 : 19 - 40
  • [8] Topic Features in Negative Customer Reviews: Evidence Based on Text Data Mining
    Li, Zhen
    Li, Fangzhou
    Xiao, Jing
    Yang, Zhi
    REVIEW OF SOCIONETWORK STRATEGIES, 2020, 14 (01): : 19 - 40
  • [9] User group based emotion detection and topic discovery over short text
    Feng, Jiachun
    Rao, Yanghui
    Xie, Haoran
    Wang, Fu Lee
    Li, Qing
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2020, 23 (03): : 1553 - 1587
  • [10] User group based emotion detection and topic discovery over short text
    Jiachun Feng
    Yanghui Rao
    Haoran Xie
    Fu Lee Wang
    Qing Li
    World Wide Web, 2020, 23 : 1553 - 1587