A Model-Based Approach for Crawling Rich Internet Applications

被引:14
|
作者
Dincturk, Mustafa Emre [1 ]
Jourdan, Guy-Vincent [1 ]
Bochmann, Gregor V. [1 ]
Onut, Iosif Viorel [2 ]
机构
[1] Univ Ottawa, EECS, Ottawa, ON K1N 6N5, Canada
[2] IBM Corp, IBM Secur AppScan Enterprise, Res & Dev, Ottawa, ON K2V 1C8, Canada
关键词
Design; Algorithms; Experimentation; Crawling; rich Internet applications; AJAX; modeling; dynamic analysis; DOM;
D O I
10.1145/2626371
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
New Web technologies, like AJAX, result in more responsive and interactive Web applications, sometimes called Rich Internet Applications (RIAs). Crawling techniques developed for traditional Web applications are not sufficient for crawling RIAs. The inability to crawl RIAs is a problem that needs to be addressed for at least making RIAs searchable and testable. We present a new methodology, called "model-based crawling", that can be used as a basis to design efficient crawling strategies for RIAs. We illustrate model-based crawling with a sample strategy, called the "hypercube strategy". The performances of our model-based crawling strategies are compared against existing standard crawling strategies, including breadth-first, depth-first, and a greedy strategy. Experimental results show that our model-based crawling approach is significantly more efficient than these standard strategies.
引用
收藏
页数:39
相关论文
共 50 条
  • [1] MODEL-BASED RICH INTERNET APPLICATIONS CRAWLING: "MENU" AND "PROBABILITY" MODELS
    Choudhary, Suryakant
    Dincturk, Emre
    Mirtaheri, Seyed
    v Bochmann, Gregor
    Jourdan, Guy-Vincent
    Onut, Iosif Viorel
    JOURNAL OF WEB ENGINEERING, 2014, 13 (3-4): : 243 - 262
  • [2] A statistical approach for efficient crawling of rich internet applications
    EECS, University of Ottawa, 800 King Edward Avenue, Ottawa, ON K1N 6N5, Canada
    不详
    不详
    Lect. Notes Comput. Sci., (362-369):
  • [3] A Strategy for Efficient Crawling of Rich Internet Applications
    Benjamin, Kamara
    von Bochmann, Gregor
    Dincturk, Mustafa Emre
    Jourdan, Guy-Vincent
    Onut, Iosif Viorel
    WEB ENGINEERING, ICWE 2011, 2011, 6757 : 74 - 89
  • [4] A Strategy for Efficient Crawling of Rich Internet Applications
    SITE, University of Ottawa, 800 King Edward Avenue, Ottawa
    ON
    K1N 6N5, Canada
    不详
    ON, Canada
    Lect. Notes Comput. Sci., (74-89):
  • [6] Indexing Rich Internet Applications Using Components-Based Crawling
    Moosavi, Ali
    Hooshmand, Salman
    Baghbanzadeh, Sara
    Jourdan, Guy-Vincent
    Bochmann, Gregor V.
    Onut, Iosif Viorel
    WEB ENGINEERING, ICWE 2014, 2014, 8541 : 200 - 217
  • [7] Using a model-based engineering approach for developing Industrial Internet of Things applications
    Binder, Christoph
    Polanec, Katharina
    Schweiberer, Franz
    Neureiter, Christian
    Lastro, Goran
    Lueder, Arndt
    2020 6TH IEEE INTERNATIONAL SYMPOSIUM ON SYSTEMS ENGINEERING (IEEE ISSE 2020), 2020,
  • [8] Engineering Rich Internet Applications with a Model-Driven Approach
    Fraternali, Piero
    Comai, Sara
    Bozzon, Alessandro
    Carughi, Giovanni Toffetti
    ACM TRANSACTIONS ON THE WEB, 2010, 4 (02)
  • [9] A Behavioral model for rich Internet applications
    Comai, Sara
    Carughi, Giovanni Toffetti
    WEB ENGINEERING, PROCEEDINGS, 2007, 4607 : 364 - +
  • [10] Dialogue-based modeling of rich internet applications: the Rich-IDM approach
    Mainetti, Luca
    Paiano, Roberto
    Bolchini, Davide
    Pandurino, Andrea
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2012, 8 (02) : 157 - +