A Study of the Techniques of Automatic Abstracting and Knowledge Acquisition Systems

被引:0
|
作者
SUN Chun kui directed by ZHONG Yi xin
机构
关键词
automatic abstracting; automatic knowledge acquisition; machine learning; natural language processing;
D O I
暂无
中图分类号
TP181 [自动推理、机器学习];
学科分类号
摘要
ing; automatic knowledge acquisition; machine learning; natural language processing Abstract One of the most important signs of the information society is the explosion of information. The information in Internet is out of order and is mostly written in natural languages which need to be processed by the technology of natural language processing. When you search for some certain information on Internet through a search engine, you might be confused by the huge amount of results which the search engine provides. However, if a search engine is embedded with Automatic Abstracting (AA) processing systems, you could locate the information quickly or you could get more information within a limited time. So, the AA technology is valuable both in science and application. The work of this thesis was begun when we took over a project that is called "The Key Technology Research of Computer Networks Providing Intelligent Information Services" which belongs to the national 863 plan. One of the tasks is "The Key Technology Research of Automatic Abstracting Systems of Chinese Text". As a member of this research group, I took part in designing and implementing an AA system called Literature Abstract and Digest Information Extract System(LADIES). From then on, I have been working in this field and this paper is the conclusion of my work. The main topic of the thesis is AA technology. There are two parts of it. One is about the research of understanding based AA systems, and the other is about the invcestigation of Automatic Knowledge Acquistion(AKA) in AA systems. In the first part, the contents of AA technology are introduced and an understanding based AA model is put forward. Based on this model, LADIES is implemented. There are two major features of LADIES: (1) it understands text with the grammar, semantic and pragmatic information of words; (2) it chunks words into a relatively independent entity with chunking rules which are substitutes of syntactic analyzing rules. The results demonstrate that it performs better than those statistical based AA systems. However, the application of LADIES is limited for its knowledge bases. And it is difficult to use in other fields because the knowledge bases are setup manually. So we investigate the techniques of automatic knowledge acquisition in order to solve the above problems to some extent. In the second part, we introduce the basic ideas of AKA and some Machine Learning (ML) methods which AKA applies. Then we propose a comprehensive dictionary model that contains grammar, semantic and pragmatic information of words. And we investigate a strategy of automatic learning pragmatic information for words. Also we put forward another strategy of automatic learning rule of salience sentences in texts and based on it, we establish an AA system LADIES NEW. Eventually, we suggest a AKA based AA system model called hierarchical feature extracting AA system model.
引用
收藏
页码:82 / 82
页数:1
相关论文
共 50 条
  • [21] Automatic linguistic knowledge acquisition for the web
    Winiwarter, Werner
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2011, 7 (01) : 18 - 43
  • [22] Research and design of automatic abstracting systems based on multiagent technologies
    Hu, S.G.
    Liu, X.Y.
    Zhong, Y.X.
    2001, Chinese Institute of Electronics (29):
  • [23] Study on topic segmenting method in automatic abstracting system
    Wan, M
    Luo, ZS
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 734 - 739
  • [24] Study on semantic paragraph partition in automatic abstracting system
    Wan, M
    Luo, ZS
    Guo, YQ
    2001 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: E-SYSTEMS AND E-MAN FOR CYBERNETICS IN CYBERSPACE, 2002, : 892 - 897
  • [25] Study of the knowledge automatic acquisition environment of tribological-dynamical design
    Hu, YH
    Liu, LF
    Zhang, YY
    Xie, YB
    PROCEEDINGS OF THE IC-HBRSD'97, 1997, : 373 - 378
  • [27] KNOWLEDGE ACQUISITION - ISSUES, TECHNIQUES AND METHODOLOGY
    LIOU, YI
    DATA BASE, 1992, 23 (01): : 59 - 64
  • [28] TECHNIQUES FOR KNOWLEDGE ACQUISITION FROM TEXT
    PLANT, RT
    JOURNAL OF COMPUTER INFORMATION SYSTEMS, 1994, 35 (01) : 64 - 70
  • [29] A study in experimental evaluation of neural network and genetic algorithm techniques for knowledge acquisition in fuzzy classification systems.
    Jagielska, I
    Matthews, C
    Whitfort, T
    1997 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, 1997, : 2364 - 2368
  • [30] THE RELEVANCE OF TERMINOLOGIES FOR AUTOMATIC ABSTRACTING
    NEDOBITY, W
    JOURNAL OF INFORMATION SCIENCE, 1982, 4 (04) : 161 - 165