An Automated Metadata Generation Method for Data Lake of Industrial WoT Applications

被引:5
|
作者
Yu, Han [1 ]
Cai, Hongming [1 ]
Liu, Zhiyuan [1 ]
Xu, Boyi [2 ]
Jiang, Lihong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Software, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Coll Econ & Management, Shanghai 200052, Peoples R China
基金
中国国家自然科学基金;
关键词
Metadata; Semantics; Runtime; Data mining; Ontologies; Text recognition; Conferences; Data lake (DL); data modeling; entity recognition; metadata generation; stream processing; Web of Things (WoT); ACQUISITION; EXTRACTION;
D O I
10.1109/TSMC.2021.3119871
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent trends in the Web of Things (WoT) have led to data explosion. Data lake (DL), as a flexible on-demand heterogeneous data management architecture, has become a feasible solution in data management. Metadata modeling for DLs is the key basis for smart analysis and processing. However, the varieties in structures and semantics of industrial WoT data hinder metadata modeling and maintenance. Moreover, the lack of textual descriptions and the semantics hidden in value streams make it hard to automatically construct semantic metadata. The dynamic nature of WoT requires on-time evolution on metadata. To overcome these challenges, we propose an automated bottom-up metadata generation approach for DL of WoT applications. Applying a data-driven framework, raw data are notated as linked data and self-organizing map-based online clustering is applied to real timely extract data characteristics. To recognize entities, concepts and relations, semantics-based entity discovery approach from short texts is proposed according to the feature of WoT data. The numerical analysis is performed to find the hidden relations from raw values. Full-dimensional metadata with rich semantic knowledge are finally built. Experiments on a real-world dataset are conducted to verify the effectiveness of methods and a case study on an energy WoT system is provided to demonstrate the feasibility of the approach.
引用
收藏
页码:5235 / 5248
页数:14
相关论文
共 50 条
  • [41] Automated generation of curved planar reformations from volume data: Method and evaluation
    Raman, R
    Napel, S
    Beaulieu, CF
    Bain, ES
    Jeffrey, RB
    Rubin, GD
    RADIOLOGY, 2000, 217 : 486 - 486
  • [42] An automated method to monitor lake ice phenology
    Pierson, D. C.
    Weyhenmeyer, G. A.
    Arvola, L.
    Benson, B.
    Blenckner, T.
    Kratz, T.
    Livingstone, D. M.
    Markensten, H.
    Marzec, G.
    Pettersson, K.
    Weathers, K.
    LIMNOLOGY AND OCEANOGRAPHY-METHODS, 2011, 9 : 74 - 83
  • [43] Textural-contextual labeling and metadata generation for remote sensing applications
    Kiang, RK
    APPLICATIONS AND SCIENCE OF COMPUTATIONAL INTELLIGENCE II, 1999, 3722 : 243 - 248
  • [44] Industrial applications of Data Mining
    Nakhaeizadeh, G
    PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 1510 : 479 - 480
  • [45] THERMODYNAMIC DATA IN INDUSTRIAL APPLICATIONS
    BYLICKI, A
    MACZYNSKI, A
    SZAFRANSKI, AM
    KROP, E
    HUNGARIAN JOURNAL OF INDUSTRIAL CHEMISTRY, 1984, 12 (01): : 97 - 108
  • [46] MetaFa: Metadata Management Framework for Data Sharing in Data-Intensive Applications
    Ikebe, Minoru
    Inomata, Atsuo
    Fujikawa, Kazutoshi
    Sunahara, Hideki
    DISTRIBUTED COMPUTING, ARTIFICIAL INTELLIGENCE, BIOINFORMATICS, SOFT COMPUTING, AND AMBIENT ASSISTED LIVING, PT II, PROCEEDINGS, 2009, 5518 : 655 - 658
  • [47] Towards a More Generic and Elastic Metadata Management Model in a Data Lake Environment
    Sore, Safiatou
    Ouedraogo, Frederic T.
    Bikienga, Moustapha
    Traore, Yaya
    2024 16TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, ICMLC 2024, 2024, : 44 - 51
  • [48] Automated Test Program Generation for an Industrial Optimizing Compiler
    Zhao, Chen
    Xue, Yunzhi
    Tao, Qiuming
    Guo, Liang
    Wang, Zhaohui
    2009 ICSE WORKSHOP ON AUTOMATION OF SOFTWARE TEST, 2009, : 36 - 43
  • [49] AUTOMATED SOFTWARE TEST DATA GENERATION
    KOREL, B
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1990, 16 (08) : 870 - 879
  • [50] INDUSTRIAL APPLICATIONS OF THE MAGNETOTELLURIC METHOD
    RANKIN, D
    GEOPHYSICS, 1980, 45 (09) : 1450 - 1450