共 1 条
EXPLOITING CORPORA FOR EXTRACTING AND DESCRIBING SPECIALIZED LEXICON: TOWARDS A SOLID AND SUSTAINED METHODOLOGY
被引:1
|作者:
Barbero, Chiara
[1
]
Amaro, Raquel
[1
]
机构:
[1] Univ NOVA Lisboa, Lisbon, Portugal
来源:
关键词:
Specialized Lexicon Extraction;
Methodology;
Corpora;
Concordances;
Collocations;
D O I:
10.11606/issn.2236-4242.v33i1p69-104
中图分类号:
H [语言、文字];
学科分类号:
05 ;
摘要:
The use of corpora for specialized lexicon extraction is a common and consensual method for building lexical resources. However, the methodologies used to achieve this are not openly discussed, rendering the comparison and determination of robust approaches difficult. In order to fill in this gap, in this paper we present and discuss a detailed methodology for extracting specialized lexicon from corpus, combining linguistic and statistical approaches. The proposed method uses specialized and monitor corpora and comprises i) frequency information analyses; ii) concordances and collocations extraction; and iii) textual organization information; accounting for core single and multiword expressions and salient semantic relations extraction. This way, our goal is the determination of a solid and accurate list of potential specialized lexical units that will allow for a swifter final validation and for maximizing the informational value of the interaction with the experts.
引用
收藏
页码:69 / 104
页数:36
相关论文