Implementing Cost-effective Data Collection and Extraction Processes with CollaMine

被引:1
|
作者
Lu, Kenny Zhuo Ming [1 ]
Heng, Belson [1 ]
机构
[1] Nanyang Polytech, Sch Informat Technol, 180 Ang Mo Kio Ave 8, Singapore 569830, Singapore
关键词
D O I
10.1109/ICCCRI.2016.22
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present the CollaMine framework which aims to reduce the cost of data collection and data extraction. The framework consists of two key solutions, namely, a collaborative system for web crawlers, and an automated regular expression diagnosis system for web content extractors. The empirical results show that the systems help to reduce the costs of data collection and extraction.
引用
收藏
页码:92 / 99
页数:8
相关论文
共 50 条
  • [31] Cost-Effective Transfer Learning for Data Streams
    Wu, Ocean
    Koh, Yun Sing
    Dobbie, Gillian
    Lacombe, Thomas
    2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, : 1233 - 1238
  • [32] Operating Dedicated Data Centers - Is It Cost-Effective?
    Ernst, M.
    Hogue, R.
    Hollowell, C.
    Strecker-Kellog, W.
    Wong, A.
    Zaytsev, A.
    20TH INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP2013), PARTS 1-6, 2014, 513
  • [34] Use of Smartphone Panels for Viable and Cost-Effective GPS Data Collection for Small and Medium Planning Agencies
    Flake, Leah
    Lee, Michelle
    Hathaway, Kevin
    Greene, Elizabeth
    TRANSPORTATION RESEARCH RECORD, 2017, 2643 (01) : 160 - 165
  • [35] A Virtual Reality Framework for Human-Driver Interaction Research: Safe and Cost-Effective Data Collection
    Crosato, Luca
    Wei, Chongfeng
    Ho, Edmond S. L.
    Shum, Hubert P. H.
    Sun, Yuzhu
    PROCEEDINGS OF THE 2024 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, HRI 2024, 2024, : 167 - 174
  • [36] A Cost-Effective Distributed Framework for Data Collection in Cloud-Based Mobile Crowd Sensing Architectures
    Capponi, Andrea
    Fiandrino, Claudio
    Kliazovich, Dzmitry
    Bouvry, Pascal
    Giordano, Stefano
    IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2017, 2 (01): : 3 - 16
  • [37] Cost-effective method of DNA extraction from taeniid eggs
    V. Dyachenko
    E. Beck
    N. Pantchev
    C. Bauer
    Parasitology Research, 2008, 102 : 811 - 813
  • [38] Cost-effective method of DNA extraction from taeniid eggs
    Dyachenko, V.
    Beck, E.
    Pantchev, N.
    Bauer, C.
    PARASITOLOGY RESEARCH, 2008, 102 (04) : 811 - 813
  • [39] Flattening the carbon extraction path in unilateral cost-effective action
    Eichner, Thomas
    Pethig, Ruediger
    JOURNAL OF ENVIRONMENTAL ECONOMICS AND MANAGEMENT, 2013, 66 (02) : 185 - 201
  • [40] Rapid and Cost-Effective RNA Extraction of Rat Pancreatic Tissue
    Dastghaib, Sanaz
    Shahsavar, Zahra
    Karimian, Zahra
    Mokarram, Pooneh
    JOVE-JOURNAL OF VISUALIZED EXPERIMENTS, 2020, (163): : 1 - 8