Multi-word collocation extraction by syntactic composition of collocation bigrams

被引:0
|
作者
Seretan, V [1 ]
Nerima, L [1 ]
Wehrli, E [1 ]
机构
[1] Univ Geneva, Language Technol Lab, CH-1211 Geneva 4, Switzerland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a method of multi-word collocation extraction, which is based on the syntactic composition of two-word collocations previously identified in text. We describe a procedure of word linking that iteratively builds up longer expressions, which constitute multi-word collocation candidates. We then present several measures used for candidates ranking according to the collocational strength, and show the results of a trigram extraction experiment. The methodology used is particularly suited for the extraction of flexible collocations, which can undergo complex syntactical transformations such as passivization, relativization and dislocation.
引用
收藏
页码:91 / 100
页数:10
相关论文
共 50 条
  • [41] Collocation extraction with multiple hybrid strategies
    Wang, Daliang
    Tu, Xuyan
    Zheng, Xuefeng
    Tong, Zijian
    2008, Press of Tsinghua University (48):
  • [42] The Role Collocation in Corpus Word Sense Annotation
    Liu, Jing
    Yang, Li-jiao
    Liu, Zhi-ying
    INTERNATIONAL CONFERENCE ON HUMANITY AND SOCIAL SCIENCE (ICHSS 2014), 2014, : 100 - 104
  • [43] Unsupervised Word Sense Disambiguation based on Word Embedding and Collocation
    Han, Shangzhuang
    Shirai, Kiyoaki
    ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2021, : 1218 - 1225
  • [44] Lexical association measures and collocation extraction
    Pavel Pecina
    Language Resources and Evaluation, 2010, 44 : 137 - 158
  • [45] Improving xtract for chinese collocation extraction
    Lu, Q
    Li, Y
    Xu, RF
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 333 - 338
  • [46] Outlier Detection in Automatic Collocation Extraction
    Santana Suarez, Octavio
    Sanchez-Berriel, Isabel
    Perez Aguiar, Jose
    Gutierrez Rodriguez, Virginia
    CURRENT WORK IN CORPUS LINGUISTICS: WORKING WITH TRADITIONALLY- CONCEIVED CORPORA AND BEYOND (CILC2015), 2015, 198 : 433 - 441
  • [47] Syntax-Based Collocation Extraction
    Tutin, Agnes
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2011, 52 (03): : 288 - 292
  • [48] Syntax-Based Collocation Extraction
    Williams, Geoffrey
    INTERNATIONAL JOURNAL OF LEXICOGRAPHY, 2013, 26 (01) : 90 - 94
  • [49] Automatic extraction of multilingual collocation equivalents
    Garcia, Marcos
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2018, (61): : 131 - 134
  • [50] Syntax-Based Collocation Extraction
    Villavicencio, Aline
    NATURAL LANGUAGE ENGINEERING, 2012, 18 : 575 - 579