A large and evolving cognate database

被引:11
|
作者
Batsuren, Khuyagbaatar [1 ]
Bella, Gabor [2 ]
Giunchiglia, Fausto [2 ,3 ]
机构
[1] Natl Univ Mongolia, Dept Informat & Comp Sci, Ikh Surguuliin Gudamj 1, Ulaanbaatar 14200, Mongolia
[2] Univ Trento, Dept Informat Engn & Comp Sci, Via Sommar 5, I-38123 Trento, Italy
[3] Jilin Univ, Coll Comp Sci & Technol, Changchun, Peoples R China
关键词
Cognate; Lexical semantics; Lexical database; INFERENCE; WORDNET;
D O I
10.1007/s10579-021-09544-6
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We present CogNet, a large-scale, automatically-built database of sense-tagged cognates-words of common origin and meaning across languages. CogNet is continuously evolving: its current version contains over 8 million cognate pairs over 338 languages and 35 writing systems, with new releases already in preparation. The paper presents the algorithm and input resources used for its computation, an evaluation of the result, as well as a quantitative analysis of cognate data leading to novel insights on language diversity. Furthermore, as an example on the use of large-scale cross-lingual knowledge bases for improving the quality of multilingual applications, we present a case study on the use of CogNet for bilingual lexicon induction in the framework of cross-lingual transfer learning.
引用
收藏
页码:165 / 189
页数:25
相关论文
共 50 条
  • [31] Change Detection in Large Evolving Networks
    Namayanja, Josephine M.
    Janeja, Vandana P.
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2019, 15 (02) : 62 - 79
  • [32] STRUCTURAL MODEL CORRELATION USING LARGE ADMISSIBLE PERTURBATIONS IN COGNATE SPACE
    BERNITSAS, MM
    TAWEKAL, RL
    AIAA JOURNAL, 1991, 29 (12) : 2222 - 2232
  • [33] Preliminary Meeting for Evolving a Comprehensive Database on Deccan Volcanic Province
    Kale, Vivek S.
    Krishnamurthy, P.
    Sangode, S. J.
    JOURNAL OF THE GEOLOGICAL SOCIETY OF INDIA, 2024, 100 (06) : 901 - 902
  • [34] Understanding Large Database Studies
    Maltenfort, Mitchell G.
    JOURNAL OF SPINAL DISORDERS & TECHNIQUES, 2015, 28 (06): : 221 - 221
  • [35] Regression Analysis for a Large Database
    Bishop, Michael J.
    Henderson, William G.
    Domino, Karen B.
    ANESTHESIA AND ANALGESIA, 2008, 107 (06): : 2090 - 2090
  • [36] Infevers: An evolving mutation database for auto-inflammatory syndromes
    Touitou, I
    Lesage, S
    McDermott, M
    Cuisset, L
    Hoffman, H
    Dode, C
    Shoham, N
    Aganna, E
    Hugot, JP
    Wise, C
    Waterham, H
    Pugnere, D
    Demaille, J
    de Menthiere, CS
    HUMAN MUTATION, 2004, 24 (03) : 194 - 198
  • [37] The Sardinian large elasmobranch database
    Storai, Tiziano
    Cristo, Benedetto
    Zuffa, Marco
    Zinzula, Luca
    Floris, Antonello
    Campanile, Arcangela Tiziana
    CYBIUM, 2006, 30 (04): : 141 - 144
  • [38] EVOLVING A LEGACY SYSTEM - RESTRUCTURING THE MENDELIAN INHERITANCE IN MAN DATABASE
    LI, P
    KRAMER, L
    PINEO, S
    KULP, D
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1994, : 344 - 348
  • [39] Evolving an information logistics database for geospatial early warning systems
    Hammitzsch, Martin
    Lendholt, Matthias
    GEOMATICS NATURAL HAZARDS & RISK, 2011, 2 (02) : 95 - 109
  • [40] MANAGING THE VERY LARGE DATABASE
    POLK, WJ
    BYRD, K
    DATAMATION, 1981, 27 (10): : 115 - +