Topic discovery from short reviews based on data enhancement

被引:0
|
作者
Zhu, Tingting [1 ,2 ]
Liu, Yezheng [1 ,2 ]
Sun, Jianshan [1 ,2 ]
Sun, Chunhua [1 ,2 ]
机构
[1] Hefei Univ Technol, Sch Management, 193 Tunxi Rd, Hefei 230009, Anhui, Peoples R China
[2] Minist Educ, Proc Optimizat & Intelligent Decis Making, Hefei, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
Short reviews; topic discovery; data enhancement; clustering; MODEL; INTERNET;
D O I
10.3233/IDA-205715
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid development of social media and mobile Internet, short reviews, such as Weibo and Twitter, have exploded online. Discovering topics from short reviews is significant for many practical applications. It can effectively not only identify users' attitudes and emotions but also enhance customer satisfaction and shopping experience. Because reviews are relatively short, the sparsity of reviews considerably restricts the quality of topic discovery. To improve the efficiency of topic discovery, we introduce the concept of data enhancement and strengthen the data in sentences and words in short reviews based on the weight of importance. We then propose a topic model for reviews to topic discovery based on data enhancement (shorted as DE-LDA). We verify the rationality and feasibility of DE-LDA on real datasets. Results show that the proposed method outperforms benchmarks in topic discovery and also has better clustering effects.
引用
收藏
页码:295 / 310
页数:16
相关论文
共 50 条
  • [31] Utilizing Recurrent Neural Network for topic discovery in short text scenarios
    Lu, Heng-Yang
    Kang, Ning
    Li, Yun
    Zhan, Qian-Yi
    Xie, Jun-Yuan
    Wang, Chong-Jun
    INTELLIGENT DATA ANALYSIS, 2019, 23 (02) : 259 - 277
  • [32] Constructing Pseudo Documents with Semantic Similarity for Short Text Topic Discovery
    Lu, Heng-yang
    Li, Yun
    Tang, Chi
    Wang, Chong-jun
    Xie, Jun-yuan
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT V, 2018, 11305 : 437 - 449
  • [33] Measuring service quality from unstructured data: A topic modeling application on airline passengers' online reviews
    Korfiatis, Nikolaos
    Stamolampros, Panagiotis
    Kourouthanassis, Panos
    Sagiadinos, Vasileios
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 116 : 472 - 486
  • [34] Domain-Oriented Topic Discovery Based on Features Extraction and Topic Clustering
    Lu, Xiaofeng
    Zhou, Xiao
    Wang, Wenting
    Lio, Pietro
    Hui, Pan
    IEEE ACCESS, 2020, 8 (08): : 93648 - 93662
  • [35] Topic model-based mass spectrometric data analysis in cancer biomarker discovery studies
    Minkun Wang
    Tsung-Heng Tsai
    Cristina Di Poto
    Alessia Ferrarini
    Guoqiang Yu
    Habtom W. Ressom
    BMC Genomics, 17
  • [36] Topic model-based mass spectrometric data analysis in cancer biomarker discovery studies
    Wang, Minkun
    Tsai, Tsung-Heng
    Di Poto, Cristina
    Ferrarini, Alessia
    Yu, Guoqiang
    Ressom, Habtom W.
    BMC GENOMICS, 2016, 17
  • [37] Online product reviews helpfulness prediction based on topic analysis
    Zhang W.
    Wang Q.
    Du Y.
    Nie K.
    Li J.
    Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice, 2022, 42 (10): : 2757 - 2768
  • [38] User Needs Mining Based on Topic Analysis of Online Reviews
    Liu, Liqiong
    Zhang, Liyi
    Ye, Pinghao
    Liu, Qihua
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2019, 26 (01): : 230 - 235
  • [39] Social media analytics - Challenges in topic discovery, data collection, and data preparation
    Stieglitz, Stefan
    Mirbabaie, Milad
    Ross, Bjorn
    Neuberger, Christoph
    INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2018, 39 : 156 - 168
  • [40] Study of Multi-source Data Fusion in Topic Discovery
    Xu, Hai Yun
    Wang, Chao
    Ru, Li Jie
    Yue, Zeng Hui
    Wei, Ling
    Fang, Shu
    ADVANCED MULTIMEDIA AND UBIQUITOUS ENGINEERING: FUTURETECH & MUE, 2016, 393 : 729 - 735