Efficient and effective Web change detection

被引:9
|
作者
Flesca, S
Masciari, E
机构
[1] Univ Calabria, Fac Engn, DEIS, I-87036 Arcavacata Di Rende, Italy
[2] CNR, ICAR, I-87036 Arcavacata Di Rende, Italy
关键词
update monitoring; continuous queries; WWW tools;
D O I
10.1016/S0169-023X(02)00210-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a new technique for detecting changes in Web documents. The technique is based on a new method to measure the similarity of two documents, that represent the actual and the previous version of the monitored page. The technique has been effectively used to discover changes in selected portions of the original document. The proposed technique has been implemented in the CMW system providing a change monitoring service on the Web. The main features of CMW are the detection of changes on selected portions of web documents and the possibility to express complex queries on the changed information. For instance, a query can require to check if the value of a given stock has increased by more than 10%. Several tests on stock exchange and auction web pages proved the effectiveness of the proposed approach. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:203 / 224
页数:22
相关论文
共 50 条
  • [21] Distributed and Collaborative Web Change Detection System
    Prieto, Victor M.
    Alvarez, Manuel
    Carneiro, Victor
    Cacheda, Fidel
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2015, 12 (01) : 91 - 114
  • [22] Change Detection and Notification of Web Pages: A Survey
    Mallawaarachchi, Vijini
    Meegahapola, Lakmal
    Madhushanka, Roshan
    Heshan, Eranga
    Meedeniya, Dulani
    Jayarathna, Sampath
    ACM COMPUTING SURVEYS, 2020, 53 (01)
  • [23] Efficient on-the-fly Web bot detection
    Suchacka, Grażyna
    Cabri, Alberto
    Rovetta, Stefano
    Masulli, Francesco
    Knowledge-Based Systems, 2021, 223
  • [24] Efficient on-the-fly Web bot detection
    Suchacka, Grazyna
    Cabri, Alberto
    Rovetta, Stefano
    Masulli, Francesco
    KNOWLEDGE-BASED SYSTEMS, 2021, 223
  • [25] Efficient and robust shot change detection
    Lefevre, Sebastien
    Vincent, Nicole
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2007, 2 (01) : 23 - 34
  • [26] Efficient and robust shot change detection
    Sébastien Lefèvre
    Nicole Vincent
    Journal of Real-Time Image Processing, 2007, 2 : 23 - 34
  • [27] Robust and Efficient Change Detection Algorithm
    Yu, Fei
    Chukwu, Michael
    Wu, Q. M. Jonathan
    ACTIVE MEDIA TECHNOLOGY, 2010, 6335 : 338 - 344
  • [28] Efficient Byzantine Sequential Change Detection
    Fellouris, Georgios
    Bayraktar, Erhan
    Lai, Lifeng
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2018, 64 (05) : 3346 - 3360
  • [29] Fixing the Threshold for Effective Detection of Near Duplicate Web Documents in Web Crawling
    Narayana, V. A.
    Premchand, P.
    Govardhan, A.
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2010, PT I, 2010, 6440 : 169 - 180
  • [30] An effective feature selection method for web spam detection
    Asdaghi, Faeze
    Soleimani, Ali
    KNOWLEDGE-BASED SYSTEMS, 2019, 166 : 198 - 206