Topic discovery from short reviews based on data enhancement

被引:0
|
作者
Zhu, Tingting [1 ,2 ]
Liu, Yezheng [1 ,2 ]
Sun, Jianshan [1 ,2 ]
Sun, Chunhua [1 ,2 ]
机构
[1] Hefei Univ Technol, Sch Management, 193 Tunxi Rd, Hefei 230009, Anhui, Peoples R China
[2] Minist Educ, Proc Optimizat & Intelligent Decis Making, Hefei, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
Short reviews; topic discovery; data enhancement; clustering; MODEL; INTERNET;
D O I
10.3233/IDA-205715
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid development of social media and mobile Internet, short reviews, such as Weibo and Twitter, have exploded online. Discovering topics from short reviews is significant for many practical applications. It can effectively not only identify users' attitudes and emotions but also enhance customer satisfaction and shopping experience. Because reviews are relatively short, the sparsity of reviews considerably restricts the quality of topic discovery. To improve the efficiency of topic discovery, we introduce the concept of data enhancement and strengthen the data in sentences and words in short reviews based on the weight of importance. We then propose a topic model for reviews to topic discovery based on data enhancement (shorted as DE-LDA). We verify the rationality and feasibility of DE-LDA on real datasets. Results show that the proposed method outperforms benchmarks in topic discovery and also has better clustering effects.
引用
收藏
页码:295 / 310
页数:16
相关论文
共 50 条
  • [41] On the topic "Experiences from the pandemic - a data-based review"
    Pecks, Ulrich
    ZEITSCHRIFT FUR GEBURTSHILFE UND NEONATOLOGIE, 2024, 228 (01): : 15 - 16
  • [42] ATD: Anomalous Topic Discovery in High Dimensional Discrete Data
    Soleimani, Hossein
    Miller, David J.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (09) : 2267 - 2280
  • [43] Special topic issue: Knowledge Discovery and Data Mining - Introduction
    Raghavan, VV
    Deogun, JS
    Sever, H
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1998, 49 (05): : 397 - 402
  • [44] Automatic Topic Discovery of Online Hospital Reviews Using an Improved LDA with Variational Gibbs Sampling
    de Groof, Richard
    Xu, Haiping
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 4022 - 4029
  • [45] Emerging topic identification from app reviews via adaptive online biterm topic modeling
    Zhou, Wan
    Wang, Yong
    Gao, Cuiyun
    Yang, Fei
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2022, 23 (05) : 678 - 691
  • [46] Customer knowledge discovery from online reviews
    You, Weijia
    Xia, Mu
    Liu, Lu
    Liu, Dan
    ELECTRONIC MARKETS, 2012, 22 (03) : 131 - 142
  • [47] Topic Modeling to Extract Information from Nutraceutical Product Reviews
    John, Deena Liz
    Kim, Ernest
    Kotian, Kunal
    Ong, Ker Yu
    White, Tyler
    Gloukhova, Luba
    Woodbridge, Diane Myung-kyung
    Ross, Nicholas
    2019 16TH IEEE ANNUAL CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE (CCNC), 2019,
  • [48] Customer knowledge discovery from online reviews
    Weijia You
    Mu Xia
    Lu Liu
    Dan Liu
    Electronic Markets, 2012, 22 : 131 - 142
  • [49] Topology and Semantic based Topic Dependency Structure Discovery
    Zhao, Anping
    Manandhar, Suresh
    Yu, Lei
    FILOMAT, 2018, 32 (05) : 1843 - 1851
  • [50] Burst topic discovery and trend tracing based on Storm
    Huang, Shihang
    Liu, Ying
    Dang, Depeng
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2014, 416 : 331 - 339