easIE: Easy-to-Use Information Extraction for Constructing CSR Databases From the Web

被引:5
|
作者
Gkatziaki, Vasiliki [1 ]
Papadopoulos, Symeon [1 ]
Mills, Richard [2 ]
Diplaris, Sotiris [1 ]
Tsampoulatidis, Ioannis [1 ]
Kompatsiaris, Ioannis [1 ]
机构
[1] ITI, CERTH ITI, Thessaloniki, Greece
[2] Univ Cambridge, Cambridge, England
基金
欧盟地平线“2020”;
关键词
Information extraction; Web wrapper; corporate social responsibility (CSR); environmental; social; and governance (ESG); CORPORATE SOCIAL-RESPONSIBILITY; DISCLOSURE; PERFORMANCE;
D O I
10.1145/3155807
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Public awareness of and concerns about companies' social and environmental impacts have seen a marked increase over recent decades. In parallel, the quantity of relevant information has increased, as states pass laws requiring certain forms of reporting, researchers investigate companies' performance, and companies themselves seek to gain a competitive advantage by being seen to operate fairly and transparently. However, this information is typically dispersed and non-standardized, making it complicated to collect and analyze. To address this challenge, the WikiRate platform aims to collect this information and store it in a standardized format within a centralized public repository, making it much more amenable to analysis. In the context of WikiRate, this article introduces easIE, an easy-to-use information extraction (IE) framework that leverages general Web IE principles for building datasets with environmental, social, and governance information from the Web. To demonstrate the flexibility and value of easIE, we built a large-scale corporate social responsibility database comprising 654,491 metrics related to 49,009 companies spending less than 16 hours for data engineering, collection, and indexing. Finally, a data collection exercise involving 12 subjects was performed to showcase the ease of use of the developed framework.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] VisualMLTCGA: An Easy-to-Use Web Tool for the Visualization, Processing and Classification of Clinical and Genomic TCGA Data
    Garin-Muga, Alba
    Maria Sucre, Aurora
    Torres, Jordi
    Kerexeta, Jon
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 5: HEALTHINF, 2020, : 413 - 420
  • [22] Easy-to-use multimedia tools and scalable distributed architectures for web-based teaching and learning
    Jesshope, CR
    DCABES 2001 PROCEEDINGS, 2001, : 52 - 60
  • [23] Captioning on-demand and real-time web multimedia - Easy-to-use tools for educators
    Smith, J
    ED-MEDIA 2004: World Conference on Educational Multimedia, Hypermedia & Telecommunications, Vols. 1-7, 2004, : 5221 - 5223
  • [24] Web Services for information extraction from the Web
    Habegger, B
    Quafafou, M
    IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, PROCEEDINGS, 2004, : 279 - 286
  • [25] Advantages of an easy-to-use DNA extraction method for minimal-destructive analysis of collection specimens
    Patzold, Franziska
    Zilli, Alberto
    Hundsdoerfer, Anna K.
    PLOS ONE, 2020, 15 (07):
  • [26] Use of relational databases to improve Web access to climate information
    Collins, JA
    Ray, A
    14TH INTERNATIONAL CONFERENCE ON INTERACTIVE INFORMATION AND PROCESSING SYSTEM (IIPS) FOR METEOROLOGY, OCEANOGRAPHY, AND HYDROLOGY, 1998, : 413 - 416
  • [27] Uncertainty indication in soil function maps - transparent and easy-to-use information to support sustainable use of soil resources
    Greiner, Lucie
    Nussbaum, Madlene
    Papritz, Andreas
    Zimmermann, Stephan
    Gubler, Andreas
    Gret-Regamey, Adrienne
    Keller, Armin
    SOIL, 2018, 4 (02) : 123 - 139
  • [28] SENSE tool: easy-to-use web-based tool to calculate food product environmental impact
    Saioa Ramos
    Lohitzune Larrinaga
    Unai Albinarrate
    Niels Jungbluth
    Gyda Mjöll Ingolfsdottir
    Eva Yngvadottir
    Birgit Landquist
    Anna Woodhouse
    Gudrun Olafsdottir
    Aintzane Esturo
    Jaime Zufía
    Begoña Perez-Villareal
    The International Journal of Life Cycle Assessment, 2016, 21 : 710 - 721
  • [29] PrecisePrimer: an easy-to-use web server for designing PCR primers for DNA library cloning and DNA shuffling
    Pauthenier, Cyrille
    Faulon, Jean-Loup
    NUCLEIC ACIDS RESEARCH, 2014, 42 (W1) : W205 - W209
  • [30] Building databases with information extracted from web documents
    Gutiérrez, A
    Motz, R
    Viera, D
    XX INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY - PROCEEDINGS, 2000, : 41 - 49