Data mining of web access logs from an academic web site

被引:0
|
作者
Ciesielski, V
Lalani, A
机构
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We have used a general purpose data mining tool to determine whether we can find any 'golden nuggets' in the web access logs of a large academic web site. Our goal was to use general purpose data mining algorithms to analyse visitors to the website and somehow characterise or distinguish them in some way. We used two web access logs, one from 2001 and one from 2003. We extracted 4 different feature sets from the web logs and used algorithms for classification (1R, J48/C4.5), clustering (EM), association finding (apriori) and feature selection (correlation based subset evaluation with best first search). We discovered several nuggets, the most significant being that a major difference between visitors from within Australia and visitors from outside Australia is that visitors from outside Australia generally arrive via search engines and are interested in information about postgraduate courses.
引用
收藏
页码:1034 / 1043
页数:10
相关论文
共 50 条
  • [41] Analysis of web access logs for surveillance of influenza
    Johnson, HA
    Wagner, MM
    Hogan, WR
    Chapman, W
    Olszewski, RT
    Dowling, J
    Barnas, G
    MEDINFO 2004: PROCEEDINGS OF THE 11TH WORLD CONGRESS ON MEDICAL INFORMATICS, PT 1 AND 2, 2004, 107 : 1202 - 1206
  • [42] A different approach for the analysis of web access logs
    Schoier, G
    Melfi, G
    New Developments in Classification and Data Analysis, 2005, : 211 - 216
  • [43] Efficient Indexing and Representation of Web Access Logs
    Claude, Francisco
    Konow, Roberto
    Navarro, Gonzalo
    STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2014, 2014, 8799 : 65 - 76
  • [44] Web stream algorithm for mining web access patterns
    Department of Computer Science and Technology, Fudan University, Shanghai 200433, China
    Moshi Shibie yu Rengong Zhineng, 2007, 6 (757-762): : 757 - 762
  • [45] Mining of Web Server Logs in a Distributed Cluster Using Big Data Technologies
    Savitha, K.
    Vijaya, M. S.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (01) : 137 - 142
  • [46] Mining Web Usage Profiles from Proxy Logs: User Identification
    Xu, Jing
    Xu, Fei
    Ma, Fanshu
    Zhou, Lei
    Jiang, Shuanglin
    Rao, Zhibo
    2021 IEEE CONFERENCE ON DEPENDABLE AND SECURE COMPUTING (DSC), 2021,
  • [47] An Approach for Mining Web Service Composition Patterns from Execution Logs
    Tang, Ran
    Zou, Ying
    12TH IEEE INTERNATIONAL SYMPOSIUM ON WEB SYSTEMS EVOLUTION (WSE 2010), 2010, : 53 - 62
  • [48] Improving the effectiveness of a web site with web usage mining
    Spiliopoulou, M
    Pohle, C
    Faulstich, LC
    WEB USAGE ANALYSIS AND USER PROFILING, 2000, 1836 : 142 - 162
  • [49] Web data mining
    Wibonele, KJ
    Zhang, YQ
    DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS AND TECHNOLOGY IV, 2002, 4730 : 241 - 244
  • [50] Data mining on Web
    Zhang, XB
    THIRD INTERNATIONAL CONFERENCE ON ELECTRONIC COMMERCE ENGINEERING: DIGITAL ENTERPRISES AND NONTRADITIONAL INDUSTRIALIZATION, 2003, : 504 - 507