A large and evolving cognate database

被引:11
|
作者
Batsuren, Khuyagbaatar [1 ]
Bella, Gabor [2 ]
Giunchiglia, Fausto [2 ,3 ]
机构
[1] Natl Univ Mongolia, Dept Informat & Comp Sci, Ikh Surguuliin Gudamj 1, Ulaanbaatar 14200, Mongolia
[2] Univ Trento, Dept Informat Engn & Comp Sci, Via Sommar 5, I-38123 Trento, Italy
[3] Jilin Univ, Coll Comp Sci & Technol, Changchun, Peoples R China
关键词
Cognate; Lexical semantics; Lexical database; INFERENCE; WORDNET;
D O I
10.1007/s10579-021-09544-6
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We present CogNet, a large-scale, automatically-built database of sense-tagged cognates-words of common origin and meaning across languages. CogNet is continuously evolving: its current version contains over 8 million cognate pairs over 338 languages and 35 writing systems, with new releases already in preparation. The paper presents the algorithm and input resources used for its computation, an evaluation of the result, as well as a quantitative analysis of cognate data leading to novel insights on language diversity. Furthermore, as an example on the use of large-scale cross-lingual knowledge bases for improving the quality of multilingual applications, we present a case study on the use of CogNet for bilingual lexicon induction in the framework of cross-lingual transfer learning.
引用
收藏
页码:165 / 189
页数:25
相关论文
共 50 条
  • [21] SFARI Gene: an evolving database for the autism research community
    Banerjee-Basu, Sharmila
    Packer, Alan
    DISEASE MODELS & MECHANISMS, 2010, 3 (3-4) : 133 - 135
  • [22] The case for mesodata: An empirical investigation of an evolving database system
    de Vries, Denise
    Roddick, John F.
    INFORMATION AND SOFTWARE TECHNOLOGY, 2007, 49 (9-10) : 1061 - 1072
  • [23] FLOPROS: an evolving global database of flood protection standards
    Scussolini, Paolo
    Aerts, Jeroen C. J. H.
    Jongman, Brenden
    Bouwer, Laurens M.
    Winsemius, Hessel C.
    de Moel, Hans
    Ward, Philip J.
    NATURAL HAZARDS AND EARTH SYSTEM SCIENCES, 2016, 16 (05) : 1049 - 1061
  • [24] THE TEACHERS' CHOICES COGNATE DATABASE FOR K-3 TEACHERS OF LATINO ENGLISH LEARNERS
    Montelongo, Jose A.
    Hernandez, Anita C.
    READING TEACHER, 2013, 67 (03): : 187 - 192
  • [25] Cooperating evolving components - A rigorous approach to evolving large software systems
    Greenwood, RM
    Warboys, BC
    Sa, J
    PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, 1996, : 428 - 437
  • [26] Evolving social influence in large populations
    Bentley, R. Alexander
    Ormerod, Paul
    Batty, Michael
    BEHAVIORAL ECOLOGY AND SOCIOBIOLOGY, 2011, 65 (03) : 537 - 546
  • [27] Description of evolving anisotropy at large strains
    Harrysson, Magnus
    Ristinmaa, Matti
    MECHANICS OF MATERIALS, 2007, 39 (03) : 267 - 282
  • [28] Evolving code with a large language model
    Hemberg, Erik
    Moskal, Stephen
    O'Reilly, Una-May
    GENETIC PROGRAMMING AND EVOLVABLE MACHINES, 2024, 25 (02)
  • [29] Evolving methods for the assembly of large genomes
    Gibbs, RA
    Weinstock, GM
    COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY, 2003, 68 : 189 - 194
  • [30] Evolving social influence in large populations
    R. Alexander Bentley
    Paul Ormerod
    Michael Batty
    Behavioral Ecology and Sociobiology, 2011, 65 : 537 - 546