CogNet: a Large-Scale Cognate Database

被引:0
|
作者
Batsuren, Khuyagbaatar [1 ]
Bella, Gabor [1 ]
Giunchiglia, Fausto [1 ,2 ]
机构
[1] Univ Trento, DISI, Trento, Italy
[2] Jilin Univ, Changchun, Jilin, Peoples R China
基金
欧盟地平线“2020”;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces CogNet, a new, large-scale lexical database that provides cognates-words of common origin and meaning-across languages. The database currently contains 3.1 million cognate pairs across 338 languages using 35 writing systems. The paper also describes the automated method by which cognates were computed from publicly available wordnets, with an accuracy evaluated to 94%. Finally, statistics and early insights about the cognate data are presented, hinting at a possible future exploitation of the resource' by various fields of lingustics.
引用
收藏
页码:3136 / 3145
页数:10
相关论文
共 50 条
  • [31] Filling the gap between a large-scale database and Multimodal interactions
    Araki, Masahiro
    LARGE-SCALE KNOWLEDGE RESOURCES: CONSTRUCTION AND APPLICATION, 2008, 4938 : 179 - 185
  • [32] Mining basic active structures from a large-scale database
    Naoto Takada
    Norihito Ohmori
    Takashi Okada
    Journal of Cheminformatics, 5
  • [33] Chinese character handwriting: A large-scale behavioral study and a database
    Wang, Ruiming
    Huang, Shuting
    Zhou, Yacong
    Cai, Zhenguang G.
    BEHAVIOR RESEARCH METHODS, 2020, 52 (01) : 82 - 96
  • [34] Cubic Style Browsing System for Large-Scale Image Database
    Tsuishu, Kenta
    Hotta, Seiji
    2014 THIRD ICT INTERNATIONAL STUDENT PROJECT CONFERENCE (ICT-ISPC), 2014, : 39 - 42
  • [35] Research on Database Storage of Large-scale Terrestrial LIDAR Data
    Guo Ming
    Wang Yanmin
    Zhao Youshan
    Zhou Junzhao
    2009 INTERNATIONAL FORUM ON COMPUTER SCIENCE-TECHNOLOGY AND APPLICATIONS, VOL 2, PROCEEDINGS, 2009, : 19 - +
  • [36] Mining basic active structures from a large-scale database
    Takada, Naoto
    Ohmori, Norihito
    Okada, Takashi
    JOURNAL OF CHEMINFORMATICS, 2013, 5
  • [37] Open TG-GATEs: a large-scale toxicogenomics database
    Igarashi, Yoshinobu
    Nakatsu, Noriyuki
    Yamashita, Tomoya
    Ono, Atsushi
    Ohno, Yasuo
    Urushidani, Tetsuro
    Yamada, Hiroshi
    NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) : D921 - D927
  • [38] SureChEMBL: a large-scale, chemically annotated patent document database
    Papadatos, George
    Davies, Mark
    Dedman, Nathan
    Chambers, Jon
    Gaulton, Anna
    Siddle, James
    Koks, Richard
    Irvine, Sean A.
    Pettersson, Joe
    Goncharoff, Nicko
    Hersey, Anne
    Overington, John P.
    NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) : D1220 - D1228
  • [39] Summarizing Large-Scale Database Schema Using Community Detection
    王雪
    周烜
    王珊
    Journal of Computer Science & Technology, 2012, 27 (03) : 515 - 526
  • [40] Large-scale intact glycopeptide identification by Mascot database search
    Bollineni, Ravi Chand
    Koehler, Christian Jeffrey
    Gislefoss, Randi Elin
    Anonsen, Jan Haug
    Thiede, Bernd
    SCIENTIFIC REPORTS, 2018, 8