Converting unstructured and semi-structured data into knowledge

被引:0
|
作者
Rusu, Octavian [1 ]
Halcu, Ionela [2 ]
Grigoriu, Oana [2 ]
Neculoiu, Giorgian [2 ]
Sandulescu, Virginia [2 ]
Marinescu, Mariana [2 ]
Marinescu, Viorel [2 ]
机构
[1] Alexandru Ioan Cuza Univ, Agcy ARNIEC RoEduNet, Iasi, Romania
[2] Tech Univ Civil Engn Bucharest, Bucharest, Romania
关键词
unstructured data; semi-structured data; knowledge; knowledge discovery; KDD;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid growth in number and dimension of databases and database applications in business, administrative, industrial and other fields, it is necessary to examine the automatic extraction of knowledge from these large databases. Due to knowledge extraction from databases, these have become rich and safe sources for generating and verification of knowledge, and the knowledge discovery can be applied in software management, querying process, making decisions, process control and many other fields of interest. At the same time, there is a challenge in managing unstructured data. Among organizations with large concentration of unstructured information, there is a greater tendency to devote more resources to this kind of data. The acquisition of knowledge from unstructured data is often difficult and expensive. Some possible solutions on extracting useful information (knowledge) from unstructured data are provided. Knowledge extraction is the process of creation of knowledge from structured, unstructured and semi-structured data. The objective of this paper is to present the possibilities of extracting knowledge from unstructured and semi-structured data particularly. The theories and tools for knowledge extraction are the subject of the emerging field of knowledge discovery in databases (KDD). Definitions of KDD are provided and the general multistep KDD process is outlined. A brief summary of recent KDD real-world applications is also provided. Finally, the article enumerates challenges for future research and development in KDD systems.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Converting unstructured into semi-structured process models
    Eshuis, Rik
    Kumar, Akhil
    DATA & KNOWLEDGE ENGINEERING, 2016, 101 : 43 - 61
  • [2] Supporting structured, semi-structured and unstructured data in digital libraries
    Sánchez, JA
    Proal, C
    Maldonado-Naude, F
    PROCEEDINGS OF THE FIFTH MEXICAN INTERNATIONAL CONFERENCE IN COMPUTER SCIENCE (ENC 2004), 2004, : 368 - 375
  • [3] Semi-automatic Knowledge Extraction from Semi-structured and Unstructured Data Within the OMAHA Project
    Reuss, Pascal
    Althoff, Klaus-Dieter
    Henkel, Wolfram
    Pfeiffer, Matthias
    Hankel, Oliver
    Pick, Roland
    CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2015, 2015, 9343 : 336 - 350
  • [4] A Proposed Technique for Conversion of Unstructured Agro-data to Semi-structured or Structured data
    Sambrekar, Kuldeep
    Rajpurohit, Vijay. S.
    Joshi, Jui
    2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,
  • [5] Integrating of structured, semi-structured and unstructured data in natural and build environmental engineering
    Barbulescu, Mihai
    Grigoriu, Ramona-Oana
    Halcu, Ionela
    Neculoiu, Giorgian
    Sandulescu, Virginia Cristiana
    Marinescu, Mariana
    Marinescu, Viorel
    2013 ROEDUNET INTERNATIONAL CONFERENCE (ROEDUNET): NETWORKING IN EDUCATION, 11TH EDITION, 2013,
  • [6] An approach to extracting complex knowledge patterns among concepts belonging to structured, semi-structured and unstructured sources in a data lake
    Lo Giudice, Paolo
    Musarella, Lorenzo
    Sofo, Giuseppe
    Ursino, Domenico
    INFORMATION SCIENCES, 2019, 478 : 606 - 626
  • [7] Managing unstructured and semi-structured information in organisations
    Aitken, Ashley M.
    6th IEEE/ACIS International Conference on Computer and Information Science, Proceedings, 2007, : 712 - 717
  • [8] Semi-Structured and Unstructured Data Aggregation Scheduling in Wireless Sensor Networks
    Bagaa, Miloud
    Derhab, Abdelouahid
    Lasla, Noureddine
    Ouadjaout, Abdelraouf
    Badache, Nadjib
    2012 PROCEEDINGS IEEE INFOCOM, 2012, : 2671 - 2675
  • [9] Semi-structured Data Extraction and Schema Knowledge Mining
    陈恩红
    High Technology Letters, 2001, (01) : 1 - 5
  • [10] Semi-structured data extraction and schema knowledge mining
    Chen, E.
    Wang, X.
    High Technology Letters, 2001, 7 (01) : 1 - 5