Early detection and data analysis on web mining for relevance

被引:0
|
作者
Lee, CC [1 ]
Yang, YX [1 ]
机构
[1] Calif State Univ Hayward, Hayward, CA 94542 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Topic-specific search engines such as MathSearch and xCentral that offered users relevant topics as search results have recently been developed. However, these topic-specific search engines require intensive human efforts to build and maintain. In addition, they visit too many irrelevant pages. In our project, we propose a new approach for topic-specific Web mining for relevance. First, we do early detection for "candidate topics" while extracting words from the HTML text. Secondly, we perform data analysis on the appearance information such as appearance times and places for candidate topics. By these two techniques, we can reduce candidate topics' crawling times and computing cost. Analysis of the results and the comparisons with related research will be made to demonstrate the effectiveness of our approach.
引用
收藏
页码:62 / 66
页数:5
相关论文
共 50 条
  • [1] Applying data mining techniques in intrusion detection system on web and analysis of web usage
    Al-Ahliyya Amman University, Amman, Jordan
    不详
    Inf. Technol. J., 2006, 1 (57-63):
  • [2] Web Log Data Analysis and Mining
    Grace, L. K. Joshila
    Maheswari, V.
    Nagamalai, Dhinaharan
    ADVANCED COMPUTING, PT III, 2011, 133 : 459 - 469
  • [3] Web log data mining analysis
    Lu Ansheng
    2012 INTERNATIONAL CONFERENCE ON INTELLIGENCE SCIENCE AND INFORMATION ENGINEERING, 2012, 20 : 213 - 215
  • [4] Web + Data Mining = Web Mining
    Kilian Stoffel
    HMD Praxis der Wirtschaftsinformatik, 2009, 46 (4) : 6 - 20
  • [5] DataSpace: A data Web for the exploratory analysis and mining of data
    Grossman, R
    Mazzucco, M
    COMPUTING IN SCIENCE & ENGINEERING, 2002, 4 (04) : 44 - 51
  • [6] Data mining approach to web application intrusions detection
    Kalicki, Arkadiusz
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2011, 2011, 8008
  • [7] Web pattern detection for Business Intelligence with data mining
    Palomino, Arturo
    Gibert, Karina
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT: RECENT ADVANCES AND APPLICATIONS, 2014, 269 : 277 - 280
  • [8] Intelligent Web topics search using early detection and data analysis
    Lee, CC
    Yang, YX
    27TH ANNUAL INTERNATIONAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE, PROCEEDINGS, 2003, : 584 - 589
  • [9] Metadata based Web mining for relevance
    Yi, Jeonghee, 2000, IEEE, Piscataway, NJ, United States
  • [10] Metadata based web mining for relevance
    Yi, J
    Sundaresan, N
    2000 INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM - PROCEEDINGS, 2000, : 113 - 121