easIE: Easy-to-Use Information Extraction for Constructing CSR Databases From the Web

被引:5
|
作者
Gkatziaki, Vasiliki [1 ]
Papadopoulos, Symeon [1 ]
Mills, Richard [2 ]
Diplaris, Sotiris [1 ]
Tsampoulatidis, Ioannis [1 ]
Kompatsiaris, Ioannis [1 ]
机构
[1] ITI, CERTH ITI, Thessaloniki, Greece
[2] Univ Cambridge, Cambridge, England
基金
欧盟地平线“2020”;
关键词
Information extraction; Web wrapper; corporate social responsibility (CSR); environmental; social; and governance (ESG); CORPORATE SOCIAL-RESPONSIBILITY; DISCLOSURE; PERFORMANCE;
D O I
10.1145/3155807
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Public awareness of and concerns about companies' social and environmental impacts have seen a marked increase over recent decades. In parallel, the quantity of relevant information has increased, as states pass laws requiring certain forms of reporting, researchers investigate companies' performance, and companies themselves seek to gain a competitive advantage by being seen to operate fairly and transparently. However, this information is typically dispersed and non-standardized, making it complicated to collect and analyze. To address this challenge, the WikiRate platform aims to collect this information and store it in a standardized format within a centralized public repository, making it much more amenable to analysis. In the context of WikiRate, this article introduces easIE, an easy-to-use information extraction (IE) framework that leverages general Web IE principles for building datasets with environmental, social, and governance information from the Web. To demonstrate the flexibility and value of easIE, we built a large-scale corporate social responsibility database comprising 654,491 metrics related to 49,009 companies spending less than 16 hours for data engineering, collection, and indexing. Finally, a data collection exercise involving 12 subjects was performed to showcase the ease of use of the developed framework.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Hiplot: a comprehensive and easy-to-use web service for boosting publication-ready biomedical data visualization
    Li, Jianfeng
    Miao, Benben
    Wang, Shixiang
    Dong, Wei
    Xu, Houshi
    Si, Chenchen
    Wang, Wei
    Duan, Songqi
    Lou, Jiacheng
    Bao, Zhiwei
    Zeng, Hailuan
    Yang, Zengzeng
    Cheng, Wenyan
    Zhao, Fei
    Zeng, Jianming
    Liu, Xue-Song
    Wu, Renxie
    Shen, Yang
    Chen, Zhu
    Chen, Saijuan
    Wang, Mingjie
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (04)
  • [32] GAIA: An easy-to-use web-based application for interaction analysis of case-control data
    MacGregor, Stuart
    Khan, Imtiaz A.
    BMC MEDICAL GENETICS, 2006, 7
  • [33] Easy-to-Use and Accurate Calibration of RGB-D Cameras from Spheres
    Staranowicz, Aaron
    Brown, Garrett R.
    Morbidi, Fabio
    Mariottini, Gian-Luca
    IMAGE AND VIDEO TECHNOLOGY, PSIVT 2013, 2014, 8333 : 265 - 278
  • [34] SENSE tool: easy-to-use web-based tool to calculate food product environmental impact
    Ramos, Saioa
    Larrinaga, Lohitzune
    Albinarrate, Unai
    Jungbluth, Niels
    Ingolfsdottir, Gyda Mjoll
    Yngvadottir, Eva
    Landquist, Birgit
    Woodhouse, Anna
    Olafsdottir, Gudrun
    Esturo, Aintzane
    Zufia, Jaime
    Perez-Villareal, Begona
    INTERNATIONAL JOURNAL OF LIFE CYCLE ASSESSMENT, 2016, 21 (05): : 710 - 721
  • [35] Information Extraction from Web pages
    Novotny, Robert
    Vojtas, Peter
    Maruscak, Dusan
    2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 3, 2009, : 121 - +
  • [36] Open Information Extraction from the Web
    Banko, Michele
    Cafarella, Michael J.
    Soderland, Stephen
    Broadhead, Matt
    Etzioni, Oren
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 2670 - 2676
  • [37] Extraction of structural information from the web
    Murata, T
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 2, PROCEEDINGS, 2005, 3614 : 1204 - 1207
  • [38] Open Information Extraction from the Web
    Etzioni, Oren
    Banko, Michele
    Soderland, Stephen
    Weld, Daniel S.
    COMMUNICATIONS OF THE ACM, 2008, 51 (12) : 68 - 74
  • [39] Assessing the potentialities of an easy-to-use sample treatment strategy: Multivariate investigation on "Moka extraction" of typical ingredients from dietary supplements
    Baglietto, Matteo
    Benedetti, Barbara
    Di Carro, Marina
    Magi, Emanuele
    ADVANCES IN SAMPLE PREPARATION, 2024, 10
  • [40] Easy-to-use IEF compatible immunoaffinity purification of Erythropoietin from urine retentates
    Reihlen, P.
    Voelker-Schaenzer, E.
    Majer, B.
    Schaenzer, W.
    DRUG TESTING AND ANALYSIS, 2012, 4 (11) : 813 - 817