An Open Data Repository for Engineering Design: Using Text Mining with Open Government Data

被引:5
|
作者
Giordano, Vito [1 ,3 ]
Coli, Elena [1 ,3 ]
Martini, Antonella [2 ,3 ]
机构
[1] Dept Informat Engn, Via Girolamo Caruso 16, I-56122 Pisa, Italy
[2] Dept Energy Syst Terr & Construct Engn, Largo Lucio Lazzarino 2, I-56122 Pisa, Italy
[3] Business Engn Data Sci B4DS Res Lab, Pisa, Italy
关键词
Engineering Design; Natural Language Processing; Open Data; Open Government Data; Open Data Repository; BIG DATA; INNOVATION; BARRIERS; ANALYTICS; EDUCATION;
D O I
10.1016/j.compind.2022.103738
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Engineering Design (ED) is a complex process in which the reuse of knowledge is crucial: applying the knowledge consolidated in previous design activities to future design activities means performing them in a better way. The relevance of data in ED is even more crucial in a business context in which Data Science (DS) is literally revolutionizing the way companies operate and therefore also the way data are analyzed. Despite having been recognized as crucial for ED processes, data still remain closed in the domain and accessible only to their owners due to several constraints related to the private and proprietary nature of the acquired data. An answer to these challenges could be found in Open Data, but at the state of the art an operational Engineering Design framework to embrace them is still far to be achieved by both academia and industry. Given these issues, the aim of this paper is to give evidence that Text Mining can help to make a complex open database more effective to be used for the ED process, taking U.S. Open Government Data (OGD) repository as a case study. Open access to methods and data used within this research is provided. The results of this study allow us to understand for which purposes it is possible to apply the datasets and to comprehend the expertise and the data science methods needed for processing different data for-mats. Moreover, this work opens relevant implications and challenges for researchers, practitioners and policy makers operating in ED and DS domains that could become opportunities for future research and industrial applications. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Open data and open government
    Goda Szilard
    INFORMACIOS TARSADALOM, 2011, 11 (1-4): : 181 - +
  • [2] Open Government, Open Data and Digital Government
    Luna-Reyes, Luis Felipe
    Bertot, John C.
    Mellouli, Sehl
    GOVERNMENT INFORMATION QUARTERLY, 2014, 31 (01) : 4 - 5
  • [3] Mining Open Government Data Used in Scientific Research
    Yan, An
    Weber, Nicholas
    TRANSFORMING DIGITAL WORLDS, ICONFERENCE 2018, 2018, 10766 : 303 - 313
  • [4] SPATIAL GRID BASED OPEN GOVERNMENT DATA MINING
    Zhang, Chenxiao
    Yue, Peng
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 192 - 193
  • [5] Barriers to Using Open Government Data
    Wieczorkowski, Jedrzej
    3RD INTERNATIONAL CONFERENCE ON E-COMMERCE, E-BUSINESS AND E-GOVERNMENT, ICEEG 2019, 2019, : 15 - 20
  • [6] To open or not to open? Determinants of open government data
    Yang, Tung-Mou
    Lo, Jin
    Shiang, Jing
    JOURNAL OF INFORMATION SCIENCE, 2015, 41 (05) : 596 - 612
  • [7] DATA-DRIVEN ENGINEERING DESIGN RESEARCH: OPPORTUNITIES USING OPEN DATA
    Parraguez, Pedro
    Maier, Anja
    DS87-7 PROCEEDINGS OF THE 21ST INTERNATIONAL CONFERENCE ON ENGINEERING DESIGN (ICED 17), VOL 7: DESIGN THEORY AND RESEARCH METHODOLOGY, 2017, : 41 - 50
  • [8] Using Text Mining and Linked Open Data to assist the Mashup of Educational Resources
    Vallejo-Figueroa, Santa
    Rodriguez-Artacho, Miguel
    Castro-Gil, Manuel
    San Cristobal, Elio
    PROCEEDINGS OF 2018 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON) - EMERGING TRENDS AND CHALLENGES OF ENGINEERING EDUCATION, 2018, : 1606 - 1611
  • [9] Connecting Repositories in the Open Access Domain Using Text Mining and Semantic Data
    Knoth, Petr
    Robotka, Vojtech
    Zdrahal, Zdenek
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, TPDL 2011, 2011, 6966 : 483 - 487
  • [10] DESIGN AND SPECIFICATIONS OF A REPOSITORY FOR REAL-TIME OPEN DATA
    Lutchman, Sudesh
    Hosein, Patrick
    PROCEEDINGS OF THE 2014 ITU KALEIDOSCOPE ACADEMIC CONFERENCE: LIVING IN A CONVERGED WORLD: IMPOSSIBLE WITHOUT STANDARDS?, 2014,