Web Usage Mining: Dynamic Methodology to Preprocessing Web Logs

被引:2
|
作者
Manchanda, Mahesh [1 ]
Gupta, Neena [2 ]
机构
[1] Graph Era Hill Univ, Dehra Dun, Uttar Pradesh, India
[2] Gurukul Kangri Vishwavidyalaya, Haridwar, India
来源
HELIX | 2018年 / 8卷 / 05期
关键词
Web Usage Mining; Data Cleaning; URL Rank; GSPAN; Web Pre-Fetching;
D O I
10.29042/2018-3810-3815
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Internet is a huge source of massive information for retrieving information and searching knowledge from WWW, leading to increase network traffic, access delay & server overload, which results in poor web services. With the use of Web-caching & web prefetching techniques to enhance the performance of web services where web mining techniques play an important role to decide which web object should be pm-fetched from server and stored in proxy cache memory so that the web object with high probability of request, in the next couple of days, serves as the base of the proxy cache. But for efficient web mining and to extract meaningful usage access pattern, the raw log file must be transformed into a meaningful & formatted file. This paper proposed a new dynamic preprocess technique to create a dynamic training dataset for prediction model using web mining, and Graph based substructure Pattern Mining (GSPAN) for improved preprocessing using proxy log. The proposed model would help in minimizing the cache size by 40% thus improving the overall performance.
引用
收藏
页码:3810 / 3815
页数:6
相关论文
共 50 条
  • [21] Preprocessing and mining web log data for web personalization
    Baglioni, M
    Ferrara, U
    Romei, A
    Ruggieri, S
    Turini, F
    AI(ASTERISK)IA 2003: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, 2829 : 237 - 249
  • [22] Mining web logs to locate target web pages
    Guo, Ping
    Yang, Houqun
    Chen, Ting
    Wang, Yanxia
    Journal of Computational Information Systems, 2007, 3 (04): : 1691 - 1698
  • [23] An innovative data collection method to eliminate the preprocessing phase in web usage mining
    Canay, Ozkan
    Kocabicak, Umit
    ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2023, 40
  • [24] A Review Paper on Data Preprocessing: A Critical Phase in Web Usage Mining Process
    Dwivedi, Sanjay Kumar
    Rawat, Bhupesh
    2015 INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND INTERNET OF THINGS (ICGCIOT), 2015, : 506 - 510
  • [25] Mining the Query Logs of a Chinese Web Search Engine for Character Usage Analysis
    Lu, Yan
    Chau, Michael
    Fang, Xiao
    PACIFIC ASIA CONFERENCE ON INFORMATION SYSTEMS 2006, SECTIONS 1-8, 2006, : 346 - +
  • [26] Active user-based and ontology-based web log data preprocessing for web usage mining
    Khasawneh, Natheer
    Chan, Chien-Chung
    2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, (WI 2006 MAIN CONFERENCE PROCEEDINGS), 2006, : 325 - +
  • [27] Adaptive Web sites by Web usage mining
    Fu, YJ
    Creado, M
    Shih, MY
    IC'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTERNET COMPUTING, VOLS I AND II, 2001, : 28 - 34
  • [28] A web usage mining algorithm for web personalization
    Picariello, Antonio
    Sansone, Carlo
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2008, 2 (04): : 219 - 230
  • [29] Data Preprocessing for Web Data Mining
    Zhang, Wei
    Chen, Tinggui
    ADVANCES IN ELECTRONIC COMMERCE, WEB APPLICATION AND COMMUNICATION, VOL 2, 2012, 149 : 303 - +
  • [30] DATA PREPROCESSING IN WEB TEXT MINING
    Jiang Yongbo
    FIFTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING (ICACTE 2012), 2012, : 573 - 581