Classification framework and semantic labeling for Big Earth Data

被引:4
|
作者
Wang, Juanle [1 ,2 ]
Bu, Kun [3 ]
Yan, Dongmei [4 ]
Wang, Jingyue [1 ,5 ]
Duan, Bowen [1 ]
Zhang, Min [1 ,6 ]
He, Guojin [4 ]
机构
[1] Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, State Key Lab Resources & Environm Informat Syst, Beijing, Peoples R China
[2] Collaborat Innovat Ctr Dev & Utilizat Geog Inform, Nanjing, Peoples R China
[3] Chinese Acad Sci, Northeast Inst Geog & Agroecol, Changchun, Peoples R China
[4] Int Res Ctr Big Data Sustainable Dev Goals, Beijing, Peoples R China
[5] Shandong Univ Technol, Sch Civil & Architectural Engn, Zibo, Peoples R China
[6] Univ Chinese Acad Sci, Coll Resource & Environm, Beijing, Peoples R China
关键词
Big Earth Data; CASEarth; scientific engineering; data classification; data labeling; data management; SUSTAINABLE DEVELOPMENT GOALS;
D O I
10.1080/20964471.2022.2123946
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Big Earth Data refers to the multidimensional integration and association of scientific data, including geography, resources, environment, ecology, and biology. An effective data classification system and label management strategy are important foundations for long-term management of data resources. The objective of this study was to construct a classification system and realize multidimensional semantic data label management for the Big Earth Data Science Engineering Program (CASEarth). This study constructed two sets of classification and coding systems that realize classification by mapping each other; namely, the geosphere-level and Sustainable Development Goals (SDGs) indicator classifications. This technique was based on natural language processing technology and solved problems with subject-word segmentation, weight calculation, and dynamic matching. A prototype system for classification and label management was constructed based on existing CASEarth datasets of more than 1,100. Furthermore, we expect our study to provide the methodology and technical support for user-oriented classification and label management services for Big Earth Data.
引用
收藏
页码:886 / 903
页数:18
相关论文
共 50 条
  • [41] Big Data Framework
    Tekiner, Firat
    Keane, John A.
    2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 1494 - 1499
  • [42] Earth Observation and Biodiversity Big Data for Forest Habitat Types Classification and Mapping
    Agrillo, Emiliano
    Filipponi, Federico
    Pezzarossa, Alice
    Casella, Laura
    Smiraglia, Daniela
    Orasi, Arianna
    Attorre, Fabio
    Taramelli, Andrea
    REMOTE SENSING, 2021, 13 (07)
  • [43] Smart4Job: A Big Data Framework for Intelligent Job Offers Broadcasting Using Time Series Forecasting and Semantic Classification
    Benabderrahmane, Sidahmed
    Mellouli, Nedra
    Lamolle, Myriam
    Paroubek, Patrick
    BIG DATA RESEARCH, 2017, 7 : 16 - 30
  • [44] Towards a Semantic Extract-Transform-Load (ETL) framework for Big Data Integration
    Bansal, Srividya K.
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 521 - 528
  • [45] A Distributed Arabic Text Classification Approach Using Latent Semantic Analysis for Big data
    Alazzam, Hadeel
    Alsmady, Abdulsalam
    PROCEEDINGS OF THE 2017 12TH INTERNATIONAL SCIENTIFIC AND TECHNICAL CONFERENCE ON COMPUTER SCIENCES AND INFORMATION TECHNOLOGIES (CSIT 2017), VOL. 1, 2017, : 58 - 61
  • [46] SEMANTIC HMC: A PREDICTIVE MODEL USING MULTI-LABEL CLASSIFICATION FOR BIG DATA
    Peixoto, Rafael
    Hassan, Thomas
    Cruz, Christophe
    Bertaux, Aurelie
    Silva, Nuno
    2015 IEEE TRUSTCOM/BIGDATASE/ISPA, VOL 2, 2015, : 173 - 179
  • [47] The challenges of a Big Data Earth
    Boulton, Geoffrey
    BIG EARTH DATA, 2018, 2 (01) : 1 - 7
  • [48] Multi-class Classification for Semantic Labeling of Places
    Shi, Lei
    Kodagoda, Sarath
    Dissanayake, Gamini
    11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2010), 2010, : 2307 - 2312
  • [49] Semantic HMC for Big Data Analysis
    Hassan, Thomas
    Peixoto, Rafael
    Cruz, Christophe
    Bertaux, Aurlie
    Silva, Nuno
    2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014,
  • [50] CCCa Framework - Classification System in Big Data Environment with Clustering and Cache Concepts
    Subramanian, Sabitha Malli
    Vijayalakshmi, S.
    Venkataraman, Balaji
    Venkumar, P.
    Sre, R. M. Rathikaa
    PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR 2016), 2018, 614 : 44 - 53