Analysis of the user behavior and opinion classification based on the BBS

被引:10
作者
Huang, Weitong [1 ]
Zhao, Yu [1 ]
Yang, Shiqiang [1 ]
Lu, Yuchang [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
关键词
Data mining; BBS; Frequent-set; Text classification; User behavior; User opinion;
D O I
10.1016/j.amc.2008.01.038
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
BBS is an electronic information center as well as an emerging media. BBS forums and other online media has become an important channel of expressing public opinions addition to the traditional mass media, the expression of popular channels. In this paper, we made the research work focused on BBS user behavior and opinions by using of the mining techniques, the result of which are very practical significance for government making decisions, the network purification and prosperity, and building a harmonious society. To study and mine the frequent behavior of users, this paper mines and analyzes the regular pattern of users visiting BBS Also, this paper mines the posts on BBS by ARC-BC text classification algorithm, which is based on associated rules, and divides user opinions into three categories, support, oppose and neutral. In this paper, a special generalization process and a frequent-set based analysis method are proposed according to the particular data type of user visit records. The experimental results show that certain users and the collective performance have the obvious similarity or difference, the algorithm has a good performance on the BBS text classification problem. Such data mining method can cluster users with similar behavior patterns as well. This paper shows the frequent-set based method is very effective for mining BBS user access patterns. (C) 2008 Elsevier Inc. All rights reserved.
引用
收藏
页码:668 / 676
页数:9
相关论文
共 17 条
[1]  
Agrawal R., 1994, Proceedings of the 20th International Conference on Very Large Data Bases. VLDB'94, P487
[2]  
BERTHOLD M, 1999, NETWORK NETPLAY VIRT
[3]  
CUI W, 2001, REALITY VIRTUAL WORL
[4]  
Dasarathy B. V., 1991, IEEE COMPUT SOC TUTO
[5]  
GONG HX, 2001, REAL EXPRESSION VIRT
[6]  
GRIFFTHS M, 1998, PSYCHOL INTERNET
[7]  
Han J., 2012, Data Mining, P393, DOI [DOI 10.1016/B978-0-12-381479-1.00009-5, 10.1016/B978-0-12-381479-1.00001-0]
[8]   Internet paradox - A social technology that reduces social involvement and psychological well-being? [J].
Kraut, R ;
Patterson, M ;
Lundmark, V ;
Kiesler, S ;
Mukopadhyay, T ;
Scherlis, W .
AMERICAN PSYCHOLOGIST, 1998, 53 (09) :1017-1031
[9]  
LANGNER I, 2001, INTRO INTERNET MAILI
[10]  
LEWIS DD, 1998, 10 EUR C MACH LEARN, P4