Multi-word collocation extraction by syntactic composition of collocation bigrams

被引:0
|
作者
Seretan, V [1 ]
Nerima, L [1 ]
Wehrli, E [1 ]
机构
[1] Univ Geneva, Language Technol Lab, CH-1211 Geneva 4, Switzerland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a method of multi-word collocation extraction, which is based on the syntactic composition of two-word collocations previously identified in text. We describe a procedure of word linking that iteratively builds up longer expressions, which constitute multi-word collocation candidates. We then present several measures used for candidates ranking according to the collocational strength, and show the results of a trigram extraction experiment. The methodology used is particularly suited for the extraction of flexible collocations, which can undergo complex syntactical transformations such as passivization, relativization and dislocation.
引用
收藏
页码:91 / 100
页数:10
相关论文
共 50 条
  • [21] A Study on Multi-word Extraction from Chinese Documents
    Zhang, Wen
    Yoshida, Taketoshi
    Tang, Xijin
    ADVANCED WEB AND NETWORK TECHNOLOGIES, AND APPLICATIONS, 2008, 4977 : 42 - +
  • [22] A Combined Approach for the Extraction of the Multi-word and Nested Biomedical
    Gong, Lejun
    Feng, Jiacheng
    Yang, Ronggen
    Yang, Geng
    2015 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2015, : 708 - 711
  • [23] A multi-word term extraction program for Arabic language
    Boulaknadel, Siham
    Daille, Beatrice
    Aboutajdine, Driss
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1485 - 1488
  • [24] Induction of Syntactic Collocation Patterns from Generic Syntactic Relations
    Seretan, Violeta
    19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05), 2005, : 1698 - 1699
  • [25] English–Arabic collocation extraction to enhance Arabic collocation identification
    Chiraz Ben Othmane Zribi
    Knowledge and Information Systems, 2020, 62 : 2439 - 2459
  • [27] Association measures for collocation extraction
    Su, Qi
    Gu, Chen
    Liu, Pengyuan
    INTERNATIONAL JOURNAL OF CORPUS LINGUISTICS, 2024, 29 (01) : 59 - 86
  • [28] A Hierachical Collocation Extraction Tool
    Li, Dan
    Cao, Jingxiang
    Huang, Degen
    PROCEEDINGS 2015 IEEE FIFTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING BDCLOUD 2015, 2015, : 51 - 55
  • [29] TermeX: A Tool for Collocation Extraction
    Delac, Davor
    Krleza, Zoran
    Snajder, Jan
    Basic, Bojana Dalbelo
    Saric, Frane
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2009, 5449 : 149 - 157
  • [30] English-Arabic collocation extraction to enhance Arabic collocation identification
    Zribi, Chiraz Ben Othmane
    KNOWLEDGE AND INFORMATION SYSTEMS, 2020, 62 (06) : 2439 - 2459