Agricultural Ontology Based Feature Optimization for Agricultural Text Clustering

被引:6
|
作者
Su Ya-ru [1 ,2 ]
Wang Ru-jing [1 ,2 ]
Chen Peng [1 ,2 ]
Wei Yuan-yuan [1 ,2 ]
Li Chuan-xi [1 ,2 ]
Hu Yi-min [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Intelligent Machines, Hefei 230031, Peoples R China
[2] Univ Sci & Technol China, Sch Informat Sci & Technol, Hefei 230026, Peoples R China
基金
中国国家自然科学基金;
关键词
agricultural ontology; feature optimization; agricultural text clustering; CONSTRUCTION;
D O I
10.1016/S2095-3119(12)60064-1
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Feature optimization is important to agricultural text mining. Usually, the vector space model is used to represent text documents. However, this basic approach still suffers from two drawbacks: the curse of dimension and the lack of semantic information. In this paper, a novel ontology-based feature optimization method for agricultural text was proposed. First, terms of vector space model were mapped into concepts of agricultural ontology, which concept frequency weights are computed statistically by term frequency weights; second, weights of concept similarity were assigned to the concept features according to the structure of the agricultural ontology. By combining feature frequency weights and feature similarity weights based on the agricultural ontology, the dimensionality of feature space can be reduced drastically. Moreover, the semantic information can be incorporated into this method. The results showed that this method yields a significant improvement on agricultural text clustering by the feature optimization.
引用
收藏
页码:752 / 759
页数:8
相关论文
共 50 条
  • [41] Unsupervised text feature selection technique based on hybrid particle swarm optimization algorithm with genetic operators for the text clustering
    Laith Mohammad Abualigah
    Ahamad Tajudin Khader
    The Journal of Supercomputing, 2017, 73 : 4773 - 4795
  • [42] A comparative study of optimization algorithms for feature selection on ML-based classification of agricultural data
    Garip, Zeynep
    Ekinci, Ekin
    Cimen, Murat Erhan
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (03): : 3341 - 3362
  • [43] COSUM: Text summarization based on clustering and optimization
    Alguliyev, Rasim M.
    Aliguliyev, Ramiz M.
    Isazade, Nijat R.
    Abdi, Asad
    Idris, Norisma
    EXPERT SYSTEMS, 2019, 36 (01)
  • [44] Feature-based visual tracking for agricultural implements
    Stanhope, Trevor P.
    Adamchuk, Viacheslav I.
    IFAC PAPERSONLINE, 2016, 49 (16): : 359 - 364
  • [45] Performance Evaluation of Semantic Based and Ontology Based Text Document Clustering Techniques
    Punitha, S. C.
    Punithavalli, M.
    INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY AND SYSTEM DESIGN 2011, 2012, 30 : 100 - 106
  • [46] Multi-label Text Categorization Based on Feature Optimization using Ant Colony Optimization and Relevance Clustering Technique
    Nema, Puneet
    Sharma, Vivek
    2015 INTERNATIONAL CONFERENCE ON COMPUTERS, COMMUNICATIONS, AND SYSTEMS (ICCCS), 2015, : 1 - 5
  • [47] An Implementation of Clustering Project Proposals on Ontology based Text Mining Approach
    Preethi, T.
    Lakshmi, R.
    2013 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2013, : 547 - 550
  • [48] Research on Intelligent Retrieval System for agricultural information resources based on ontology
    Qin, Xiaojing
    Zhang, Hui
    Zheng, Huaiguo
    2018 INTERNATIONAL CONFERENCE ON COMPUTER INFORMATION SCIENCE AND APPLICATION TECHNOLOGY, 2019, 1168
  • [49] Research on decision support system based on agricultural risk management ontology
    Liu C.
    Wang Y.
    Yang D.
    International Journal of Digital Content Technology and its Applications, 2011, 5 (12) : 290 - 297
  • [50] Research on Knowledge Base Construction of Agricultural Ontology Based on HNC Theory
    Hao Xinning
    Xie Nengfu
    Sunwei
    Zhong Xiaochun
    Zhang Xuefu
    COMPUTER AND COMPUTING TECHNOLOGIES IN AGRICULTURE IX, CCTA 2015, PT I, 2016, 478 : 437 - 445