Classification framework and semantic labeling for Big Earth Data

被引:4
|
作者
Wang, Juanle [1 ,2 ]
Bu, Kun [3 ]
Yan, Dongmei [4 ]
Wang, Jingyue [1 ,5 ]
Duan, Bowen [1 ]
Zhang, Min [1 ,6 ]
He, Guojin [4 ]
机构
[1] Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, State Key Lab Resources & Environm Informat Syst, Beijing, Peoples R China
[2] Collaborat Innovat Ctr Dev & Utilizat Geog Inform, Nanjing, Peoples R China
[3] Chinese Acad Sci, Northeast Inst Geog & Agroecol, Changchun, Peoples R China
[4] Int Res Ctr Big Data Sustainable Dev Goals, Beijing, Peoples R China
[5] Shandong Univ Technol, Sch Civil & Architectural Engn, Zibo, Peoples R China
[6] Univ Chinese Acad Sci, Coll Resource & Environm, Beijing, Peoples R China
关键词
Big Earth Data; CASEarth; scientific engineering; data classification; data labeling; data management; SUSTAINABLE DEVELOPMENT GOALS;
D O I
10.1080/20964471.2022.2123946
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Big Earth Data refers to the multidimensional integration and association of scientific data, including geography, resources, environment, ecology, and biology. An effective data classification system and label management strategy are important foundations for long-term management of data resources. The objective of this study was to construct a classification system and realize multidimensional semantic data label management for the Big Earth Data Science Engineering Program (CASEarth). This study constructed two sets of classification and coding systems that realize classification by mapping each other; namely, the geosphere-level and Sustainable Development Goals (SDGs) indicator classifications. This technique was based on natural language processing technology and solved problems with subject-word segmentation, weight calculation, and dynamic matching. A prototype system for classification and label management was constructed based on existing CASEarth datasets of more than 1,100. Furthermore, we expect our study to provide the methodology and technical support for user-oriented classification and label management services for Big Earth Data.
引用
收藏
页码:886 / 903
页数:18
相关论文
共 50 条
  • [31] A Framework for Automated Scraping of Structured Data Records From the Deep Web Using Semantic Labeling: Semantic Scraper
    Kumaresan, Umamageswari
    Ramanujam, Kalpana
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2022, 12 (01)
  • [32] A Classification and Predication Framework for Taxi-Hailing Based on Big Data
    Yin, Changqing
    Lin, Yiwei
    Yang, Chen
    INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2017, PT III, 2017, 10363 : 747 - 758
  • [33] An Efficient, Ensemble-Based Classification Framework for Big Medical Data
    Khan, Firoz
    Prasad, Balusupati Veera Venkata Siva
    Syed, Salman Ali
    Ashraf, Imran
    Ramasamy, Lakshmana Kumar
    BIG DATA, 2022, 10 (02) : 151 - 160
  • [34] A Dynamic Spark-based Classification Framework for Imbalanced Big Data
    Nahla B. Abdel-Hamid
    Sally ElGhamrawy
    Ali El Desouky
    Hesham Arafat
    Journal of Grid Computing, 2018, 16 : 607 - 626
  • [35] A Dynamic Spark-based Classification Framework for Imbalanced Big Data
    Abdel-Hamid, Nahla B.
    ElGhamrawy, Sally
    El Desouky, Ali
    Arafat, Hesham
    JOURNAL OF GRID COMPUTING, 2018, 16 (04) : 607 - 626
  • [36] Semantic labeling of data by using the Web
    Rigutini, Leonardo
    Di Iorio, Ernesto
    Ernandes, Marco
    Maggini, Marco
    2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WORKSHOPS PROCEEDINGS, 2006, : 638 - +
  • [37] Tracking and semantic labeling of boundary data
    Wallace, AM
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 1999, 13 (06) : 859 - 879
  • [38] Semantic Models for Labeling Spectrum Data
    Nagpure, Vaishali
    Hood, Cynthia
    Vaccaro, Stephanie
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2018, 2018, 520 : 3 - 12
  • [39] An efficient ACO-PSO-based framework for data classification and preprocessing in big data
    Ashutosh Kumar Dubey
    Abhishek Kumar
    Rashmi Agrawal
    Evolutionary Intelligence, 2021, 14 : 909 - 922
  • [40] An efficient ACO-PSO-based framework for data classification and preprocessing in big data
    Dubey, Ashutosh Kumar
    Kumar, Abhishek
    Agrawal, Rashmi
    EVOLUTIONARY INTELLIGENCE, 2021, 14 (02) : 909 - 922