easIE: Easy-to-Use Information Extraction for Constructing CSR Databases From the Web

被引:5
|
作者
Gkatziaki, Vasiliki [1 ]
Papadopoulos, Symeon [1 ]
Mills, Richard [2 ]
Diplaris, Sotiris [1 ]
Tsampoulatidis, Ioannis [1 ]
Kompatsiaris, Ioannis [1 ]
机构
[1] ITI, CERTH ITI, Thessaloniki, Greece
[2] Univ Cambridge, Cambridge, England
基金
欧盟地平线“2020”;
关键词
Information extraction; Web wrapper; corporate social responsibility (CSR); environmental; social; and governance (ESG); CORPORATE SOCIAL-RESPONSIBILITY; DISCLOSURE; PERFORMANCE;
D O I
10.1145/3155807
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Public awareness of and concerns about companies' social and environmental impacts have seen a marked increase over recent decades. In parallel, the quantity of relevant information has increased, as states pass laws requiring certain forms of reporting, researchers investigate companies' performance, and companies themselves seek to gain a competitive advantage by being seen to operate fairly and transparently. However, this information is typically dispersed and non-standardized, making it complicated to collect and analyze. To address this challenge, the WikiRate platform aims to collect this information and store it in a standardized format within a centralized public repository, making it much more amenable to analysis. In the context of WikiRate, this article introduces easIE, an easy-to-use information extraction (IE) framework that leverages general Web IE principles for building datasets with environmental, social, and governance information from the Web. To demonstrate the flexibility and value of easIE, we built a large-scale corporate social responsibility database comprising 654,491 metrics related to 49,009 companies spending less than 16 hours for data engineering, collection, and indexing. Finally, a data collection exercise involving 12 subjects was performed to showcase the ease of use of the developed framework.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] An easy-to-use, comprehensive information service for designers
    Woishnis, W
    PLASTICS ENGINEERING, 1996, 52 (06) : 33 - &
  • [2] An easy-to-use eLearning web authoring system for educators
    Fong, Joseph
    Yeung, Yin Fei
    ADVANCES IN WEB BASED LEARNING - ICWL 2006, 2006, 4181 : 154 - +
  • [3] Easy-to-use programming model for Web Services Security
    Yamaguchi, Yumi
    Chung, Hyen-Vui
    Teraguchi, Masayoshi
    Uramoto, Naohiko
    2ND IEEE ASIA-PACIFIC SERVICES COMPUTING CONFERENCE, PROCEEDINGS, 2007, : 275 - +
  • [4] Comparison of Various Easy-to-Use Procedures for Extraction of Phenols from Apricot Fruits
    Zitka, Ondrej
    Sochor, Jiri
    Rop, Otakar
    Skalickova, Sylvie
    Sobrova, Pavlina
    Zehnalek, Josef
    Beklova, Miroslava
    Krska, Boris
    Adam, Vojtech
    Kizek, Rene
    MOLECULES, 2011, 16 (04): : 2914 - 2936
  • [5] IMDE: an Easy-to-use Web Server for Missing Data Estimation
    Chiu, Chia-Chun
    Wu, Wei-Sheng
    11TH IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2014, : 511 - 514
  • [6] OntoQuery: easy-to-use web-based OWL querying
    Tudose, Ilinca
    Hastings, Janna
    Muthukrishnan, Venkatesh
    Owen, Gareth
    Turner, Steve
    Dekker, Adriano
    Kale, Namrata
    Ennis, Marcus
    Steinbeck, Christoph
    BIOINFORMATICS, 2013, 29 (22) : 2955 - 2957
  • [7] MIFE: An Easy-to-Use Web-Based Tool for Standardized Radiomics Features Extraction in Medical Images
    de Avila-Armenta, Eduardo
    Celaya-Padilla, Jose M.
    Galvan-Tejada, Jorge I.
    Soto-Murillo, Manuel A.
    Hernandez-Guitierrez, Andres
    Alvarado-Padilla, Jose J.
    Rios-Rios, Jose I.
    Martinez-Torteya, Antonio
    18TH INTERNATIONAL CONFERENCE ON FUTURE NETWORKS AND COMMUNICATIONS, FNC 2023/20TH INTERNATIONAL CONFERENCE ON MOBILE SYSTEMS AND PERVASIVE COMPUTING, MOBISPC 2023/13TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY, SEIT 2023, 2023, 224 : 106 - 113
  • [8] Towards an Easy-to-Use Web Application Server and Cloud PaaS for Web Development Education
    Brune, Philipp
    Leisner, Michael
    Janke, Erica
    2014 IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2014 IEEE 6TH INTL SYMP ON CYBERSPACE SAFETY AND SECURITY, 2014 IEEE 11TH INTL CONF ON EMBEDDED SOFTWARE AND SYST (HPCC,CSS,ICESS), 2014, : 1113 - 1116
  • [9] DiscoRhythm: an easy-to-use web application and R package for discovering rhythmicity
    Carlucci, Matthew
    Krisciunas, Algimantas
    Li, Haohan
    Gibas, Povilas
    Koncevicius, Karolis
    Petronis, Art
    Oh, Gabriel
    BIOINFORMATICS, 2020, 36 (06) : 1952 - 1954
  • [10] Alice: Model, paint & animate - Easy-to-use interactive graphics for the web
    Pausch, R
    Forlines, C
    COMPUTER GRAPHICS-US, 2000, 34 (02): : 42 - 43