Efficient and effective Web change detection

被引:9
|
作者
Flesca, S
Masciari, E
机构
[1] Univ Calabria, Fac Engn, DEIS, I-87036 Arcavacata Di Rende, Italy
[2] CNR, ICAR, I-87036 Arcavacata Di Rende, Italy
关键词
update monitoring; continuous queries; WWW tools;
D O I
10.1016/S0169-023X(02)00210-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a new technique for detecting changes in Web documents. The technique is based on a new method to measure the similarity of two documents, that represent the actual and the previous version of the monitored page. The technique has been effectively used to discover changes in selected portions of the original document. The proposed technique has been implemented in the CMW system providing a change monitoring service on the Web. The main features of CMW are the detection of changes on selected portions of web documents and the possibility to express complex queries on the changed information. For instance, a query can require to check if the value of a given stock has increased by more than 10%. Several tests on stock exchange and auction web pages proved the effectiveness of the proposed approach. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:203 / 224
页数:22
相关论文
共 50 条
  • [41] Efficient and Effective Duplicate Detection in Hierarchical Data
    Leitao, Luis
    Calado, Pavel
    Herschel, Melanie
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (05) : 1028 - 1041
  • [42] RaceTracker:Effective and Efficient Detection of Data Races
    Yang, Zhen
    Yu, Zhen
    Su, Xiaohong
    Ma, Peijun
    2016 17TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2016, : 293 - 300
  • [43] Effective, efficient, and robust packing detection and classification
    Biondi, Fabrizio
    Enescu, Michael A.
    Given-Wilson, Thomas
    Legay, Axel
    Noureddine, Lamine
    Verma, Vivek
    COMPUTERS & SECURITY, 2019, 85 : 436 - 451
  • [44] Towards An Effective And Efficient Malware Detection System
    Chia Tien Dan Lo
    Pablo, Ordonez
    Carlos, Cepeda Mora
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 3648 - 3655
  • [45] An Enhanced Architecture and Algorithm for Web Page Change Detection
    Varshney, Naveen Kumar
    Sharma, Dilip Kumar
    PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS AND COMPUTER NETWORKS (ISCON), 2013, : 151 - 154
  • [46] Change Detection and Correction Facilitation for Web Applications and Services
    Alba, Alfredo
    Bhagwan, Varun
    Grandison, Tyrone
    Gruhl, Daniel
    Pieper, Jan
    2009 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, VOLS 1 AND 2, 2009, : 1012 - 1013
  • [47] Parallel crawler architecture and web page change detection
    Computer Science and Information Technology, Jaypee Institute of Information Technology University, Noida, India
    WSEAS Trans. Comput., 2008, 7 (929-940):
  • [48] Change Detection Optimization in Frequently Changing Web Pages
    Meegahapola, L. B.
    Alwis, P. K. D. R. M.
    Nimalarathna, L. B. E. H.
    Mallawaarachchi, V. G.
    Meedeniya, D. A.
    Jayarathna, Sampath
    2017 3RD INTERNATIONAL MORATUWA ENGINEERING RESEARCH CONFERENCE (MERCON), 2017, : 111 - 116
  • [49] A Novel Architecture and Algorithm for Web Page Change Detection
    Varshney, Naveen Kumar
    Sharma, Dilip Kumar
    PROCEEDINGS OF THE 2013 3RD IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2013, : 782 - 787
  • [50] Change detection and correction facilitation for web applications and services
    IBM Almaden Research Center, 650 Harry Road, San Jose, CA 95120, United States
    IEEE Int. Conf. Web Serv., ICWS, 2009, (1012-1013):