Irreducible ambiguities in monoids of words

被引:1
|
作者
Del Vigna, C
Berment, V
机构
[1] Ctr Anal & Math Sociales, CNRS, F-75014 Paris, France
[2] Grp Etude Traduct Automat, CLIPS 385, F-38040 Grenoble 9, France
关键词
D O I
10.36045/bbms/1074791326
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The starting point of this study is that the written form of certain South-East Asian languages does not use spaces between words, thus complicating their automatic processing. This is particularly the case on the syllabic level, where generally the text cannot be cut up uniquely. From the formal combinatorics point of view, the syllabic system of these languages is not a code. We will focus on the origin of the splitting ambiguities, more specifically on the inventory of the so-called irreducible ambiguities, in the sense that all others originate from them. We prove that the language of irreducible ambiguities is rational. Then we present a method to compute one of its regular expressions, illustrating the method with the experience of its application to the Lao language.
引用
收藏
页码:693 / 706
页数:14
相关论文
共 50 条