Early detection and data analysis on web mining for relevance

被引:0
|
作者
Lee, CC [1 ]
Yang, YX [1 ]
机构
[1] Calif State Univ Hayward, Hayward, CA 94542 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Topic-specific search engines such as MathSearch and xCentral that offered users relevant topics as search results have recently been developed. However, these topic-specific search engines require intensive human efforts to build and maintain. In addition, they visit too many irrelevant pages. In our project, we propose a new approach for topic-specific Web mining for relevance. First, we do early detection for "candidate topics" while extracting words from the HTML text. Secondly, we perform data analysis on the appearance information such as appearance times and places for candidate topics. By these two techniques, we can reduce candidate topics' crawling times and computing cost. Analysis of the results and the comparisons with related research will be made to demonstrate the effectiveness of our approach.
引用
收藏
页码:62 / 66
页数:5
相关论文
共 50 条
  • [21] Early DDoS Detection Based on Data Mining Techniques
    Xylogiannopoulos, Konstantinos
    Karampelas, Panagiotis
    Alhajj, Reda
    INFORMATION SECURITY THEORY AND PRACTICE: SECURING THE INTERNET OF THINGS, 2014, 8501 : 190 - 199
  • [22] Metabolomic data mining for early detection of endometrial cancer
    Svecova, M.
    Dubayova, K.
    Blahova, L.
    Kostolny, J.
    Urdzik, P.
    Marekova, M.
    FEBS OPEN BIO, 2024, 14 : 281 - 281
  • [23] Data Mining Techniques for Early Detection of Breast Cancer
    Cruz, Maria Ines
    Bernardino, Jorge
    KDIR: PROCEEDINGS OF THE 11TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL 1: KDIR, 2019, : 434 - 441
  • [24] Data Preprocessing for Web Data Mining
    Zhang, Wei
    Chen, Tinggui
    ADVANCES IN ELECTRONIC COMMERCE, WEB APPLICATION AND COMMUNICATION, VOL 2, 2012, 149 : 303 - +
  • [25] Phishing Website Detection Framework Through Web Scraping and Data Mining
    Park, Andrew J.
    Quadari, Ruhi Naaz
    Tsang, Herbert H.
    2017 8TH IEEE ANNUAL INFORMATION TECHNOLOGY, ELECTRONICS AND MOBILE COMMUNICATION CONFERENCE (IEMCON), 2017, : 680 - 684
  • [26] Detection of Malicious Requests on Web Logs Using Data Mining Techniques
    Sahin, Mehmet Emin
    Ozdemir, Suat
    2019 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2019, : 463 - 468
  • [27] Analysis of Web Site Using Web Log Expert Tool Based on Web Data Mining
    Singh, Satya Prakash
    Meenu
    2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2017,
  • [28] A data mining approach for signal detection and analysis
    Bate, A
    Lindquist, M
    Edwards, IR
    Orre, R
    DRUG SAFETY, 2002, 25 (06) : 393 - 397
  • [29] A Data Mining Approach for Signal Detection and Analysis
    Andrew Bate
    Marie Lindquist
    I. Ralph. Edwards
    Roland Orre
    Drug Safety, 2002, 25 : 393 - 397
  • [30] Web usage data mining
    Ortega, Jose-Luis
    Aguillo, Isidro F.
    PROFESIONAL DE LA INFORMACION, 2009, 18 (01): : 20 - 26