Improved Algorithm Based on Decision Tree for Semantic Information Retrieval

被引:2
|
作者
Wang, Zhe [1 ,2 ]
Zhao, Yingying [1 ]
Dong, Hai [3 ]
Xu, Yulong [1 ]
Lv, Yali [1 ]
机构
[1] Henan Univ Chinese Med, Sch Informat Technol, Zhengzhou 450046, Peoples R China
[2] Donghua Univ, Sch Informat Sci & Technol, Shanghai 201620, Peoples R China
[3] RMIT Univ, Sch Comp Technol, Melbourne, Vic 3001, Australia
来源
关键词
Semantic; information retrieval; decision tree; SIMILARITY; DESIGN; FUSION;
D O I
10.32604/iasc.2021.016434
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The quick retrieval of target information from a massive amount of information has become a core research area in the field of information retrieval. Semantic information retrieval provides effective methods based on semantic comprehension, whose traditional models focus on multiple rounds of detection to differentiate information. Since a large amount of information must be excluded, retrieval efficiency is low. One of the most common methods used in classification, the decision tree algorithm, first selects attributes with higher information entropy to construct a decision tree. However, the tree only matches words on the grammatical level and does not consider the semantic of the information and lacks understanding of the information; meanwhile, it increases the amount of calculation and the complexity of the algorithm on synonymous fields, and the classification quality is not high. We investigate the retrieval method, unstructured processing with different semantic data, extracting the attribute features of semantic information, creating a multi-layered structure for the attribute features, calculating the window function according to the theory of multi-level analytic fusion, and fusing different levels of data. Then, we calculate the expected entropy of semantic information, undertake the boundary treatment of the attributes, calculate the information gain and information gain ratio of the attributes, and set the largest gain ratio of semantic data as the nodes of the decision tree. Our results reveal the algorithm's superior effectiveness in semantic information retrieval. Experimental results verify that the algorithm improves the expressing ability of knowledge in the information retrieval system and improves the time efficiency of semantic information retrieval.
引用
收藏
页码:419 / 429
页数:11
相关论文
共 50 条
  • [21] Research on the Application of Improved Decision Tree Algorithm based on Information Entropy in the Financial Management of Colleges and Universities
    Zhao, Huirong
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (12) : 704 - 714
  • [22] An Decision Tree Algorithm Based on Dispersion Measure of Attribute Information
    He Dengchao
    Hao Wenning
    Gan Wenyan
    Chen Gang
    Jin Dawei
    2013 10TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA 2013), 2013, : 84 - 89
  • [23] Research of A Scalable Decision Tree Algorithm Based on Information Entropy
    Wei, XianMin
    2011 INTERNATIONAL CONFERENCE ON FUTURE COMPUTER SCIENCE AND APPLICATION (FCSA 2011), VOL 2, 2011, : 309 - 310
  • [24] Semantic similarity in geographic information retrieval for decision making
    Machado, Neili
    Balmaseda, Carlos
    Montoyo, Andres
    PROCEEDINGS OF THE 2014 9TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI 2014), 2014,
  • [25] Information retrieval based on semantic networks
    Gerick, T
    NFD INFORMATION-WISSENSCHAFT UND PRAXIS, 1999, 50 (04): : 205 - 209
  • [26] Ontology Based Semantic Information Retrieval
    Mustafa, Jibran
    Khan, Sharifullah
    Latif, Khalid
    2008 4TH INTERNATIONAL IEEE CONFERENCE INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2008, : 840 - 845
  • [27] Geospatial Information Semantic Retrieval Algorithm Research Based on Geo-ontology
    Liu, Wei
    Du, Yunyan
    Zhang, Hairong
    Xiao, Rulin
    2009 17TH INTERNATIONAL CONFERENCE ON GEOINFORMATICS, VOLS 1 AND 2, 2009, : 23 - +
  • [28] Similarity Matching Algorithm for Ontology-Based Semantic Information Retrieval Model
    Gao, Qian
    PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012), 2012, : 758 - 763
  • [29] An Improved Learning Algorithm of Decision Tree Based on Entropy Uncertainty Deviation
    Sun, Huaining
    Hu, Xuegang
    PROCEEDINGS OF 2012 IEEE 14TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY, 2012, : 799 - 803
  • [30] An improved network traffic classification algorithm based on Hadoop decision tree
    Yuan, Zhengwu
    Wang, Chaozheng
    2016 IEEE INTERNATIONAL CONFERENCE OF ONLINE ANALYSIS AND COMPUTING SCIENCE (ICOACS), 2016, : 53 - 56