Web Usage Mining: Dynamic Methodology to Preprocessing Web Logs

被引:2
|
作者
Manchanda, Mahesh [1 ]
Gupta, Neena [2 ]
机构
[1] Graph Era Hill Univ, Dehra Dun, Uttar Pradesh, India
[2] Gurukul Kangri Vishwavidyalaya, Haridwar, India
来源
HELIX | 2018年 / 8卷 / 05期
关键词
Web Usage Mining; Data Cleaning; URL Rank; GSPAN; Web Pre-Fetching;
D O I
10.29042/2018-3810-3815
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Internet is a huge source of massive information for retrieving information and searching knowledge from WWW, leading to increase network traffic, access delay & server overload, which results in poor web services. With the use of Web-caching & web prefetching techniques to enhance the performance of web services where web mining techniques play an important role to decide which web object should be pm-fetched from server and stored in proxy cache memory so that the web object with high probability of request, in the next couple of days, serves as the base of the proxy cache. But for efficient web mining and to extract meaningful usage access pattern, the raw log file must be transformed into a meaningful & formatted file. This paper proposed a new dynamic preprocess technique to create a dynamic training dataset for prediction model using web mining, and Graph based substructure Pattern Mining (GSPAN) for improved preprocessing using proxy log. The proposed model would help in minimizing the cache size by 40% thus improving the overall performance.
引用
收藏
页码:3810 / 3815
页数:6
相关论文
共 50 条
  • [1] Preprocessing Web logs: A Critical phase in Web Usage Mining
    Goel, Neha
    Jha, C. K.
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER ENGINEERING AND APPLICATIONS (ICACEA), 2015, : 672 - 676
  • [2] An overview of preprocessing of Web log files for Web usage mining
    Department of Computer Science, SDNB Vaishnav College for Women, Chennai, Tamil Nadu, India
    不详
    不详
    J. Theor. Appl. Inf. Technol., 2 (178-185):
  • [3] Semantic preprocessing of Web request streams for Web usage mining
    Jung, JJ
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2005, 11 (08) : 1383 - 1396
  • [4] Web usage mining: extracting unexpected periods from web logs
    F. Masseglia
    P. Poncelet
    M. Teisseire
    A. Marascu
    Data Mining and Knowledge Discovery, 2008, 16 : 39 - 65
  • [5] Web usage mining: extracting unexpected periods from web logs
    Masseglia, F.
    Poncelet, P.
    Teisseire, M.
    Marascu, A.
    DATA MINING AND KNOWLEDGE DISCOVERY, 2008, 16 (01) : 39 - 65
  • [6] An overview of data preprocessing in data and web usage mining
    Suresh, R. M.
    Padmajavalli, R.
    2006 1ST INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT, 2006, : 193 - +
  • [7] Research and development of data preprocessing in Web Usage Mining
    Li Chaofeng
    PROCEEDINGS OF THE 2006 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING, 2006, : 1311 - 1315
  • [8] Advanced data preprocessing for intersites web usage mining
    Tanasa, D
    Trousse, B
    IEEE INTELLIGENT SYSTEMS, 2004, 19 (02) : 59 - 65
  • [9] An effective Data Preprocessing method for Web Usage Mining
    Reddy, K. Sudheer
    Reddy, M. Kantha
    Sitaramulu, V.
    2013 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2013, : 7 - 10
  • [10] Extracting Knowledge from Web Server Logs Using Web Usage Mining
    Eltahir, Mirghani A.
    Dafa-Alla, Anour F. A.
    2013 INTERNATIONAL CONFERENCE ON COMPUTING, ELECTRICAL AND ELECTRONICS ENGINEERING (ICCEEE), 2013, : 413 - 417