Implementing Cost-effective Data Collection and Extraction Processes with CollaMine

被引:1
|
作者
Lu, Kenny Zhuo Ming [1 ]
Heng, Belson [1 ]
机构
[1] Nanyang Polytech, Sch Informat Technol, 180 Ang Mo Kio Ave 8, Singapore 569830, Singapore
来源
2016 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING RESEARCH AND INNOVATION - ICCCRI 2016 | 2016年
关键词
D O I
10.1109/ICCCRI.2016.22
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present the CollaMine framework which aims to reduce the cost of data collection and data extraction. The framework consists of two key solutions, namely, a collaborative system for web crawlers, and an automated regular expression diagnosis system for web content extractors. The empirical results show that the systems help to reduce the costs of data collection and extraction.
引用
收藏
页码:92 / 99
页数:8
相关论文
共 50 条
  • [21] IMPLEMENTING COST-EFFECTIVE EDUCATIONAL-TECHNOLOGY - SOME REFLECTIONS
    BLASCHKE, C
    SWEENEY, J
    EDUCATIONAL TECHNOLOGY, 1977, 17 (01) : 13 - 18
  • [22] Implementing cost-effective ways to reduce endoscope repair expenses
    Leiner, Dennis C.
    Biomedical Instrumentation and Technology, 2003, 37 (03): : 201 - 204
  • [23] Cost-Effective Memory Replay for Continual Relation Extraction
    Chen, Yunong
    Wen, Yanlong
    Zhang, Haiwei
    WEB INFORMATION SYSTEMS AND APPLICATIONS (WISA 2021), 2021, 12999 : 335 - 346
  • [24] Expanding Education Researchers' Access to Classroom Observation Data With a Remote and Cost-Effective Video Data Collection Protocol
    Mclean, Leigh
    Espinoza, Paul
    Tilley, Kati
    Foote, Lori
    Jones, Nathan
    Kelcey, Ben
    PREVENTION SCIENCE, 2024,
  • [25] PRECISION REQUIREMENTS FOR COST-EFFECTIVE OPERATION OF ANALYTICAL PROCESSES
    WESTGARD, JO
    BURNETT, RW
    CLINICAL CHEMISTRY, 1990, 36 (09) : 1629 - 1632
  • [26] Cost-effective analysis of in-place software processes
    Cook, JE
    Votta, LG
    Wolf, AL
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1998, 24 (08) : 650 - 663
  • [27] Cost of Goods Modeling and Quality by Design for Developing Cost-Effective Processes
    Costioli, Matteo D.
    Guillemot-Potelle, Clementine
    Mitchell-Logean, Christine
    Broly, Herve
    BIOPHARM INTERNATIONAL, 2010, 23 (06) : 26 - +
  • [28] The VADA Architecture for Cost-Effective Data Wrangling
    Konstantinou, Nikolaos
    Koehler, Martin
    Abel, Edward
    Civili, Cristina
    Neumayr, Bernd
    Sallinger, Emanuel
    Fernandes, Alvaro A. A.
    Gottlob, Georg
    Keane, John A.
    Libkin, Leonid
    Paton, Norman W.
    SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2017, : 1599 - 1602
  • [29] An expandable and cost-effective data center network
    Lv, Mengjie
    Liu, Xuanli
    Dong, Hui
    Fan, Weibei
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2024, 232
  • [30] COST-EFFECTIVE DATA ACQUISITION FOR THE ODIN FIELD
    JOHNSON, JD
    JOURNAL OF PETROLEUM TECHNOLOGY, 1988, 40 (10): : 1316 - 1320