A New Hidden Web Crawling Approach

被引:0
|
作者
Saoudi, L. [1 ]
Boukerram, A. [2 ]
Mhamedi, S. [1 ]
机构
[1] Mohammed Boudiaf Univ, Dept Comp Sci, Msila, Algeria
[2] Abderrahmane Mira Univ, Dept Comp Sci, Bejaia, Algeria
关键词
Deep crawler; Hidden Web crawler; SQLI query; form submission; searchable forms;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Traditional search engines deal with the Surface Web which is a set of Web pages directly accessible through hyperlinks and ignores a large part of the Web called hidden Web which is a great amount of valuable information of online database which is "hidden" behind the query forms. To access to those information the crawler have to fill the forms with a valid data, for this reason we propose a new approach which use SQLI technique in order to find the most promising keywords of a specific domain for automatic form submission. The effectiveness of proposed framework has been evaluated through experiments using real web sites and encouraging preliminary results were obtained
引用
收藏
页码:293 / 297
页数:5
相关论文
共 50 条
  • [31] EFFECTS OF CRAWLING STRATEGIES ON THE PERFORMANCE OF FOCUSED WEB CRAWLING
    Pirkola, Ari
    Talvensaari, Tuomas
    WEBIST 2009: PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES, 2009, : 376 - 381
  • [32] Crawling Hidden Objects with kNN Queries
    Yan, Hui
    Gong, Zhiguo
    Zhang, Nan
    Huang, Tao
    Zhong, Hua
    Wei, Jun
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1536 - 1537
  • [33] A Novel and Efficient Approach For Near Duplicate Page Detection in Web Crawling
    Narayana, V. A.
    Premchand, P.
    Govardhan, A.
    2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3, 2009, : 1492 - +
  • [34] Mining the web with hierarchical crawlers - A resource sharing based crawling approach
    Kundu, Anirban
    Dutta, Ruma
    Dattagupta, Rana
    Mukhopadhyay, Debajyoti
    International Journal of Intelligent Information and Database Systems, 2009, 3 (01) : 90 - 106
  • [35] Global Trends in Social Prescribing: Web-Based Crawling Approach
    Lee, Hocheol
    Koh, Sang Baek
    Jo, Heui Sug
    Lee, Tae Ho
    Nam, Hae Kweun
    Zhao, Bo
    Lim, Subeen
    Lim, Joo Aeh
    Lee, Ho Hee
    Hwang, Yu Seong
    Kim, Dong Hyun
    Nam, Eun Woo
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25
  • [36] An Approach to Incremental Deep Web Crawling Based on Incremental Harvest Model
    Huang, Qiuyan
    Li, Qingzhong
    Li, Hong
    Yan, Zhongmin
    2012 INTERNATIONAL WORKSHOP ON INFORMATION AND ELECTRONICS ENGINEERING, 2012, 29 : 1081 - 1087
  • [37] Crawling Hidden Objects with kNN Queries
    Yan, Hui
    Gong, Zhiguo
    Zhang, Nan
    Huang, Tao
    Zhong, Hua
    Wei, Jun
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (04) : 912 - 924
  • [38] Web Crawling Technique for Vulnerability Assessment on Web
    Yudha, Fietyata
    Panji, Andi Muhammad T.
    Adiputro, Laksono A. R.
    Ramadhani, Erika
    LECTURE NOTES IN ELECTRICAL, ELECTRONIC AND COMPUTER ENGINEERING, 2019, : 48 - 54
  • [39] An Architecture for Efficient Web Crawling
    Hernandez, Inma
    Rivero, Carlos R.
    Ruiz, David
    Corchuelo, Rafael
    ADVANCED INFORMATION SYSTEMS ENGINEERING WORKSHOPS, CAISE 2012, 2012, 112 : 228 - 234
  • [40] Crawling toward a Wiser Web
    Hayes, Brian
    AMERICAN SCIENTIST, 2015, 103 (03) : 184 - 187