Towards Flexible Retrieval, Integration and Analysis of JSON']JSON Data Sets through Fuzzy Sets: A Case Study

被引:11
|
作者
Fosci, Paolo [1 ]
Psaila, Giuseppe [1 ]
机构
[1] Univ Bergamo, Dept Management Informat & Prod Engn, I-24044 Dalmine, BG, Italy
关键词
retrieving [!text type='JSON']JSON[!/text] documents from the web; open data portals; fuzzy sets and soft selection conditions; platform-independent framework; case study; INFORMATION; DATABASES; AGGREGATION; FRAMEWORK; QUERIES; SYSTEMS;
D O I
10.3390/info12070258
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
How to exploit the incredible variety of JSON data sets currently available on the Internet, for example, on Open Data portals? The traditional approach would require getting them from the portals, then storing them into some JSON document store and integrating them within the document store. However, once data are integrated, the lack of a query language that provides flexible querying capabilities could prevent analysts from successfully completing their analysis. In this paper, we show how the J-CO Framework, a novel framework that we developed at the University of Bergamo (Italy) to manage large collections of JSON documents, is a unique and innovative tool that provides analysts with querying capabilities based on fuzzy sets over JSON data sets. Its query language, called J-CO-QL, is continuously evolving to increase potential applications; the most recent extensions give analysts the capability to retrieve data sets directly from web portals as well as constructs to apply fuzzy set theory to JSON documents and to provide analysts with the capability to perform imprecise queries on documents by means of flexible soft conditions. This paper presents a practical case study in which real data sets are retrieved, integrated and analyzed to effectively show the unique and innovative capabilities of the J-CO Framework.
引用
收藏
页数:41
相关论文
共 50 条
  • [31] Variable precision fuzzy rough sets model in the analysis of process data
    Mieszkowicz-Rolka, A
    Rolka, L
    ROUGH SETS, FUZZY SETS, DATA MINING, AND GRANULAR COMPUTING, PT 1, PROCEEDINGS, 2005, 3641 : 354 - 363
  • [32] Fuzzy Sets in Data Analysis: From Statistical Foundations to Machine Learning
    Couso, Ina
    Borgelt, Christian
    Huellermeier, Eyke
    Kruse, Rudolf
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2019, 14 (01) : 31 - 44
  • [33] Towards Privacy-Preserving Evaluation for Information Retrieval Models Over Industry Data Sets
    Yang, Peilin
    Zhou, Mianwei
    Chang, Yi
    Zhai, Chengxiang
    Fang, Hui
    INFORMATION RETRIEVAL TECHNOLOGY, AIRS 2017, 2017, 10648 : 210 - 221
  • [34] A CASE STUDY ON EVALUATING PERSONNEL AND JOBS JOINTLY WITH FUZZY DISTANCE SETS
    Ozdaban, Ismail
    Ozkan, Coskun
    INTERNATIONAL JOURNAL OF INDUSTRIAL ENGINEERING-THEORY APPLICATIONS AND PRACTICE, 2011, 18 (04): : 169 - 179
  • [35] True degradation estimation of industrial equipment with fuzzy sets: a case study
    Kumar, E. V.
    Chaturvedi, S. K.
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART O-JOURNAL OF RISK AND RELIABILITY, 2009, 223 (O2) : 167 - 179
  • [36] Compilation of food composition data sets: an analysis of user needs through the Use Case approach
    Egan, B.
    Hodgkins, C.
    Fragodt, A.
    Raats, M.
    EUROPEAN JOURNAL OF CLINICAL NUTRITION, 2011, 65 (06) : 757 - 760
  • [37] Compilation of food composition data sets: an analysis of user needs through the Use Case approach
    B Egan
    C Hodgkins
    A Fragodt
    M Raats
    European Journal of Clinical Nutrition, 2011, 65 : 757 - 760
  • [38] AHP–TOPSIS integration extended with Pythagorean fuzzy sets for information security risk analysis
    M. Fatih Ak
    Muhammet Gul
    Complex & Intelligent Systems, 2019, 5 : 113 - 126
  • [39] Interpreting Industrial IoT Data Streams Through Fuzzy Querying With Hysteretic Fuzzy Sets on Apache Kafka
    Malysiak-Mrozek, Bozena
    Ryba, Bartlomiej
    Moleda, Marek
    Hung, Che-Lun
    Pedrycz, Witold
    Ding, Weiping
    Mrozek, Dariusz
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (08) : 4671 - 4684
  • [40] A First Study on the Use of Interval-Valued Fuzzy Sets with Genetic Tuning for Classification with Imbalanced Data-Sets
    Sanz, J.
    Fernandez, A.
    Bustince, H.
    Herrera, F.
    HYBRID ARTIFICIAL INTELLIGENCE SYSTEMS, 2009, 5572 : 581 - +