CLEQS——基于知识图谱构建的跨语言实体查询系统

被引:7
作者
苏永浩
张驰
程文亮
钱卫宁
机构
[1] 华东师范大学数据科学与工程研究院
关键词
跨语言实体链接; 知识图谱; 实体消歧义; 语义查询; 维基百科;
D O I
暂无
中图分类号
TP391.1 [文字信息处理];
学科分类号
081203 ; 0835 ;
摘要
针对中英文知识图谱在实体规模和关系质量上存在很大差异的问题,提出了一个基于英文知识图谱YAGO构建的跨语言实体查询系统CLEQS,即在英文知识图谱中查询对应中文实体。CLEQS包含两个模块:实体消歧义模块和跨语言实体链接模块。首先,实体消歧义模块依据中文查询实体和上下文信息,准确地将中文实体映射到中文维基百科中的无歧义词条;然后,跨语言实体链接模块构造跨语言实体链接模型(RSVM),将中文维基百科与英文知识图谱中描述相同概念的实体进行链接;最后,形成一个实体关系网。首次提出跨语言实体查询问题,并获得了82.3%的查询准确度。CLEQS系统能够提供准确、高效的跨语言中文实体查询,还能够发现中英文知识图谱中未知的跨语言实体链接。
引用
收藏
页码:204 / 206+223 +223
页数:4
相关论文
共 12 条
[1]  
Challenges in Chinese knowledge graph construction. WANG C,GAO M,HE X,et al. Proceedings of the 2015 31st IEEE International Conference on Data Engineering Workshops . 2015
[2]  
An effective TF/IDFbased text-to-text semantic similarity measure for text classification. ALBITAR S,FOURNIER S,ESPINASSE B. Web Information Systems Engineering-WISE 2014 . 2014
[3]  
WordNet[J] . George A. Miller. &nbspCommunications of the ACM . 1995 (11)
[4]  
Optimizing Search Engines using Clickthrough Data. Thorsten Joachims. Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD2002 . 2002
[5]   Lecture Notes in Computer Science [C]. 
The First International Conference on Web-Age Information Management
,1600
[6]  
Probase:a probabilistic taxonomy for text understanding. WU W,LI H,WANG H,et al. ACM SIGMOD International Conference on Management of Data . 2012
[7]  
"Cross-lingual knowledge linking across wiki knowledgebases.". Wang,Zhichun,et al. Proceedings of the21st international conference on World Wide Web . 2012
[8]  
Enriching the crosslingual link structure of wikipedia-a classification-based approach. SORG P,CIMIANO P. Proceedings of the AAAI 2008 Workshop on Wikipedia and Artifical Intelligence . 2008
[9]  
LINDEN:Linking Named Entities with Knowledge Base via Semantic Knowledge. Shen W,Wang J,Luo P,et al. Proceedings of the 21st International Conference on World Wide Web . 2012
[10]  
Finding similar sentences across multiple languages in Wikipedia. ADAFRE S,de RIJKE M. Proceedings of the 11th Conference of the European Chapter of the Association for Computational Longuistics . 2006