An Automated Metadata Generation Method for Data Lake of Industrial WoT Applications

被引:5
|
作者
Yu, Han [1 ]
Cai, Hongming [1 ]
Liu, Zhiyuan [1 ]
Xu, Boyi [2 ]
Jiang, Lihong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Software, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Coll Econ & Management, Shanghai 200052, Peoples R China
基金
中国国家自然科学基金;
关键词
Metadata; Semantics; Runtime; Data mining; Ontologies; Text recognition; Conferences; Data lake (DL); data modeling; entity recognition; metadata generation; stream processing; Web of Things (WoT); ACQUISITION; EXTRACTION;
D O I
10.1109/TSMC.2021.3119871
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent trends in the Web of Things (WoT) have led to data explosion. Data lake (DL), as a flexible on-demand heterogeneous data management architecture, has become a feasible solution in data management. Metadata modeling for DLs is the key basis for smart analysis and processing. However, the varieties in structures and semantics of industrial WoT data hinder metadata modeling and maintenance. Moreover, the lack of textual descriptions and the semantics hidden in value streams make it hard to automatically construct semantic metadata. The dynamic nature of WoT requires on-time evolution on metadata. To overcome these challenges, we propose an automated bottom-up metadata generation approach for DL of WoT applications. Applying a data-driven framework, raw data are notated as linked data and self-organizing map-based online clustering is applied to real timely extract data characteristics. To recognize entities, concepts and relations, semantics-based entity discovery approach from short texts is proposed according to the feature of WoT data. The numerical analysis is performed to find the hidden relations from raw values. Full-dimensional metadata with rich semantic knowledge are finally built. Experiments on a real-world dataset are conducted to verify the effectiveness of methods and a case study on an energy WoT system is provided to demonstrate the feasibility of the approach.
引用
收藏
页码:5235 / 5248
页数:14
相关论文
共 50 条
  • [21] An Automated Test Data Generation Method for On-board Subsystem
    Xu, Ning
    Zhao, Xiao-Yu
    Li, Yi-Nan
    Zhang, Yu
    Liu, Ya-Qing
    Wang, Fei
    IAENG International Journal of Computer Science, 2023, 50 (02)
  • [22] A Data Lake Metadata Enrichment Mechanism via Semantic Blueprints
    Pingos, Michalis
    Andreou, Andreas S.
    ENASE: PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON EVALUATION OF NOVEL APPROACHES TO SOFTWARE ENGINEERING, 2022, : 186 - 196
  • [23] Towards Information Profiling: Data Lake Content Metadata Management
    Alserafi, Ayman
    Abello, Alberto
    Romero, Oscar
    Calders, Toon
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 178 - 185
  • [24] Industrial applications of automated process research
    Kirchhoff, E
    Cassidy, CS
    CHIMICA OGGI-CHEMISTRY TODAY, 2001, 19 (06) : 69 - 71
  • [25] Automated Quality Assessment of Metadata across Open Data Portals
    Neumaier, Sebastian
    Umbrich, Jurgen
    Polleres, Axel
    ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2016, 8 (01):
  • [26] Automated diagnosis of data-model conflicts using metadata
    Chen, RO
    Altman, RB
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1999, 6 (05) : 374 - 392
  • [27] Metadata for managing grid resources in data mining applications
    Mastroianni C.
    Talia D.
    Trunfio P.
    Journal of Grid Computing, 2004, 2 (1) : 85 - 102
  • [28] A concept for providing and utilizing metadata in data analytics applications
    Li, Wan
    Kleinert, Tobias
    AT-AUTOMATISIERUNGSTECHNIK, 2023, 71 (01) : 44 - 55
  • [29] A New Metadata Model to Uniformly Handle Heterogeneous Data Lake Sources
    Diamantini, Claudia
    Lo Giudice, Paolo
    Musarella, Lorenzo
    Potena, Domenico
    Storti, Emanuele
    Ursino, Domenico
    NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2018, 2018, 909 : 165 - 177
  • [30] Data provenance, metadata, and reflexivity: Comments on method
    Curtis, Bruce
    ENCOUNTERS IN THEORY AND HISTORY OF EDUCATION, 2014, 15 : 43 - 61