Early detection and data analysis on web mining for relevance

被引:0
|
作者
Lee, CC [1 ]
Yang, YX [1 ]
机构
[1] Calif State Univ Hayward, Hayward, CA 94542 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Topic-specific search engines such as MathSearch and xCentral that offered users relevant topics as search results have recently been developed. However, these topic-specific search engines require intensive human efforts to build and maintain. In addition, they visit too many irrelevant pages. In our project, we propose a new approach for topic-specific Web mining for relevance. First, we do early detection for "candidate topics" while extracting words from the HTML text. Secondly, we perform data analysis on the appearance information such as appearance times and places for candidate topics. By these two techniques, we can reduce candidate topics' crawling times and computing cost. Analysis of the results and the comparisons with related research will be made to demonstrate the effectiveness of our approach.
引用
收藏
页码:62 / 66
页数:5
相关论文
共 50 条
  • [31] Data mining on the Web - Response
    Berners-Lee, Tim
    Hall, Wendy
    Hendler, James
    Shadbolt, Nigel
    Weitzner, Daniel J.
    SCIENCE, 2006, 314 (5806) : 1682 - 1682
  • [32] Web data mining trends
    Baeza-Yates, Ricardo
    PROFESIONAL DE LA INFORMACION, 2009, 18 (01): : 5 - 10
  • [33] Data mining for Web intelligence
    Han, JW
    Chang, KCC
    COMPUTER, 2002, 35 (11) : 64 - +
  • [34] Web for data mining applications
    Liu, B
    Ma, YM
    Wong, CK
    24TH ANNUAL INTERNATIONAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COSPSAC 2000), 2000, 24 : 465 - 466
  • [35] The RINGS Resource for Glycome Informatics Analysis and Data Mining on the Web
    Akune, Yukie
    Hosoda, Masae
    Kaiya, Sakiko
    Shinmachi, Daisuke
    Aoki-Kinoshita, Kiyoko F.
    OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY, 2010, 14 (04) : 475 - 486
  • [36] A novel representation of graph structures in web mining and data analysis
    Blazewicz, J
    Pesch, E
    Sterna, M
    OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2005, 33 (01): : 65 - 71
  • [37] Mining Web data on a budget
    Banks, MA
    ONLINE, 2003, 27 (05): : 32 - 35
  • [38] Analysis of web-based learning systems by data mining
    Villegas-Ch, W.
    Lujan-Mora, S.
    Buenano-Fernandez, Diego
    Roman-Canizares, M.
    2017 IEEE SECOND ECUADOR TECHNICAL CHAPTERS MEETING (ETCM), 2017,
  • [39] The Relevance of Open Data Principles for the Web of Data
    Herrera-Cubides, Jhon Francined
    Gaona-Garcia, Paulo Alonso
    Montenegro-Marin, Carlos Enrique
    Sanchez-Alonso, Salvador
    JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2023, 2023
  • [40] User Reviews Data Analysis using Opinion Mining on Web
    Dubey, Gaurav
    Rana, Ajay
    Shukla, Naveen Kumar
    2015 1ST INTERNATIONAL CONFERENCE ON FUTURISTIC TRENDS ON COMPUTATIONAL ANALYSIS AND KNOWLEDGE MANAGEMENT (ABLAZE), 2015, : 603 - 612