Towards Flexible Retrieval, Integration and Analysis of JSON']JSON Data Sets through Fuzzy Sets: A Case Study

被引:11
|
作者
Fosci, Paolo [1 ]
Psaila, Giuseppe [1 ]
机构
[1] Univ Bergamo, Dept Management Informat & Prod Engn, I-24044 Dalmine, BG, Italy
关键词
retrieving [!text type='JSON']JSON[!/text] documents from the web; open data portals; fuzzy sets and soft selection conditions; platform-independent framework; case study; INFORMATION; DATABASES; AGGREGATION; FRAMEWORK; QUERIES; SYSTEMS;
D O I
10.3390/info12070258
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
How to exploit the incredible variety of JSON data sets currently available on the Internet, for example, on Open Data portals? The traditional approach would require getting them from the portals, then storing them into some JSON document store and integrating them within the document store. However, once data are integrated, the lack of a query language that provides flexible querying capabilities could prevent analysts from successfully completing their analysis. In this paper, we show how the J-CO Framework, a novel framework that we developed at the University of Bergamo (Italy) to manage large collections of JSON documents, is a unique and innovative tool that provides analysts with querying capabilities based on fuzzy sets over JSON data sets. Its query language, called J-CO-QL, is continuously evolving to increase potential applications; the most recent extensions give analysts the capability to retrieve data sets directly from web portals as well as constructs to apply fuzzy set theory to JSON documents and to provide analysts with the capability to perform imprecise queries on documents by means of flexible soft conditions. This paper presents a practical case study in which real data sets are retrieved, integrated and analyzed to effectively show the unique and innovative capabilities of the J-CO Framework.
引用
收藏
页数:41
相关论文
共 50 条
  • [21] Towards effective analysis of large grain boundary data sets
    Glowinski, K.
    Morawiec, A.
    17TH INTERNATIONAL CONFERENCE ON TEXTURES OF MATERIALS (ICOTOM 17), 2015, 82
  • [22] Towards Integrative Causal Analysis of Heterogeneous Data Sets and Studies
    Tsamardinos, Ioannis
    Triantafillou, Sofia
    Lagani, Vincenzo
    JOURNAL OF MACHINE LEARNING RESEARCH, 2012, 13 : 1097 - 1157
  • [23] Towards integrative causal analysis of heterogeneous data sets and studies
    Tsamardinos, Ioannis
    Triantafillou, Sofia
    Lagani, Vincenzo
    Journal of Machine Learning Research, 2012, 13 : 1097 - 1157
  • [24] A case study for medical decision making with the fuzzy soft sets
    Murat Kirişci
    Afrika Matematika, 2020, 31 : 557 - 564
  • [25] A case study for medical decision making with the fuzzy soft sets
    Kirisci, Murat
    AFRIKA MATEMATIKA, 2020, 31 (3-4) : 557 - 564
  • [26] Rough sets and few-objects-many-attributes problem: The case study analysis of gene expression data sets
    Slezak, Dominik
    PROCEEDINGS OF THE FRONTIERS IN THE CONVERGENCE OF BIOSCIENCE AND INFORMATION TECHNOLOGIES, 2007, : 437 - 440
  • [27] DeepBlue epigenomic data server: programmatic data retrieval and analysis of epigenome region sets
    Albrecht, Felipe
    List, Markus
    Bock, Christoph
    Lengauer, Thomas
    NUCLEIC ACIDS RESEARCH, 2016, 44 (W1) : W581 - W586
  • [28] PANDORA: analysis of protein and peptide sets through the hierarchical integration of annotations
    Rappoport, Nadav
    Fromer, Menachem
    Schweiger, Regev
    Linial, Michal
    NUCLEIC ACIDS RESEARCH, 2010, 38 : W84 - W89
  • [29] FUZZY-SETS IN INFORMATION-RETRIEVAL AND CLUSTER-ANALYSIS - MIYAMOTO,S
    HATHAWAY, RJ
    JOURNAL OF CLASSIFICATION, 1991, 8 (02) : 284 - 286
  • [30] A generalized fuzzy data envelopment analysis with restricted fuzzy sets and determined constraint condition
    Meng, Xiao-Li
    Shi, Fu-Gui
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2017, 33 (03) : 1895 - 1905