TWO RESOURCES DEVELOPED IN THE PROJECT SEMANTICS-DRIVEN SYNTACTIC PARSER FOR ROMANIAN

被引:0
|
作者
Irimia, Elena [1 ]
Mititelu, Verginica Barbu [1 ]
机构
[1] Romanian Acad, Mihai Draganescu Res Inst Artificial Intelligence, Bucharest, Romania
关键词
treebank; valency frames; Romanian;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
The paper describes in detail two essential resources created for the purpose of developing a performant hybrid (statistical and rule-based) parser for Romanian: 1. a treebank, designed by merging two existent dependency treebanks and by mapping their existent annotation to the one adopted in this project; it contains 9522 sentences and will be used as a benchmark for testing the parser and a valuable resource to promote within the community; given the value of such a linguistic resource, it is publicly released and maintained for research purposes; 2) a collection of valency frames for all the verbs in the treebank (and others), which contain, for each sense of a verb, the possible arguments and the morpho-syntactic (case), lexical (selected preposition, conjunction) and semantic restrictions on their lexicalization (formulated using top concepts from the Romanian wordnet).
引用
收藏
页码:69 / 78
页数:10
相关论文
empty
未找到相关数据