A Support Vector Machines Approach to Vietnamese Key Phrase Extraction

被引:0
|
作者
Nguyen, Chau Q. [1 ]
Hong, Luan T. [2 ]
Phan, Tuoi T. [2 ]
机构
[1] Ho Chi Minh Univ Ind, 12 Nguyen Van Bao St, Go Vap Dist, Hcmc, Vietnam
[2] HCMC Univ Technol, Go Vap Dist, Hcmc, Vietnam
关键词
Key phrase; Vietnamese key phrase extraction; natural language processing; part-of-speech; word segmentation; support vector machines;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic key phrase extraction is the task of automatically selecting a set of phrases that describe the content of a simple sentence. That a key phrase is extracted means that it is present verbatim the sentence to which it is assigned. Accurate key phrase extraction is fundamental to the success of many recent digital library applications, clustering, and semantic information retrieval techniques. The present research discusses; a support vector machines (SVMs) approach for Vietnamese key phrase extraction and presents a number of experiments in which performance is incrementally improved. In general, the Vietnamese key Phrase extracting process consists of three steps: word segmentation for identifying lexical units in an input sentence, part-of-speech tagging for words, and key phrase extraction for phrases. The performance of Vietnamese key phrase extraction systems is generally measured by the precision rate attained. This depends strongly on the nature and the size of it training set of key phrases. Most results are superior to 70.30% with a training set of 9,006 Vietnamese key phrases with of 2,000 sentences which was selected from the corpus of Vietnamese Lexicography Center (www.vietlex.com.vn).
引用
收藏
页码:131 / +
页数:2
相关论文
共 50 条
  • [1] Efficient and robust phrase chunking using support vector machines
    Wu, Yu-Chieh
    Yang, Jie-Chi
    Lee, Yuc-Shi
    Yen, Show-Jane
    INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2006, 4182 : 350 - 361
  • [2] Symbolic Knowledge Extraction from Support Vector Machines: A Geometric Approach
    Ren, Lu
    Garcez, Artur d'Avila
    ADVANCES IN NEURO-INFORMATION PROCESSING, PT II, 2009, 5507 : 335 - 343
  • [3] Rule extraction from support vector machines: A sequential covering approach
    Barakat, Nahla H.
    Bradley, Andrew P.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (06) : 729 - 741
  • [4] Rule extraction from support vector machines
    Wang, Qiang
    Shen, Yong-Ping
    Chen, Ying-Wu
    Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2006, 28 (02): : 106 - 110
  • [5] Feature Extraction Using Support Vector Machines
    Tajiri, Yasuyuki
    Yabuwaki, Ryosuke
    Kitamura, Takuya
    Abe, Shigeo
    NEURAL INFORMATION PROCESSING: MODELS AND APPLICATIONS, PT II, 2010, 6444 : 108 - 115
  • [6] Fast Extraction Strategy of Support Vector Machines
    Wu, Wei
    Yang, Qiang
    Yan, Wenjun
    FOUNDATIONS OF INTELLIGENT SYSTEMS (ISKE 2011), 2011, 122 : 49 - 54
  • [7] Contribution to the detection of key words by Vector Support Machines
    Benayed, Yassine
    2008, Lavoisier, 14 rue de Provigny, Cachan Cedex, F-94236, France (22)
  • [8] Automatic Key Phrase Extraction
    Almutiry, Omar
    2021 7TH INTERNATIONAL CONFERENCE ON ENGINEERING AND EMERGING TECHNOLOGIES (ICEET 2021), 2021, : 436 - 442
  • [9] A rule extraction approach from support vector machines for diagnosing hypertension among diabetics
    Singh, Namrata
    Singh, Pradeep
    Bhagat, Deepika
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 130 : 188 - 205
  • [10] A Unified Approach to the Extraction of Rules from Artificial Neural Networks and Support Vector Machines
    Guerreiro, Joao
    Trigueiros, Duarte
    ADVANCED DATA MINING AND APPLICATIONS (ADMA 2010), PT II, 2010, 6441 : 34 - 42