Extracting paraphrases of Japanese action word of sentence ending part from Web and mobile news articles

被引:0
|
作者
Nakagawa, H
Masuda, H
机构
[1] Univ Tokyo, Ctr Informat Technol, Tokyo 1130033, Japan
[2] Tokyo Denki Univ, Tokyo 1018457, Japan
来源
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this research, we extract paraphrases from Japanese Web news articles that are long and aimed at displaying on personal computer screens and mobile news articles that are short and compact and aimed at mobile terminals' small screens. We have collected them for more than two years, and aligned them at article level and then at sentence level. As the result, we got more than 88,000 pairs of aligned sentences. Next, we extract paraphrases of the final part of sentences from this aligned corpus. The paraphrases that we try to extract are the sentence final nouns of mobile article sentences and their counterpart expressions of Web article sentences. We extract character strings and word sequences for paraphrases based on branching factor, frequency and length of string. The precision is 90% for highest ranked candidate and 83% to 59% for each top three candidates of 100 most frequently used action nouns.
引用
收藏
页码:94 / 105
页数:12
相关论文
empty
未找到相关数据