The Corpus of American Danish: a language resource of spoken immigrant Danish in North and South America

被引:2
|
作者
Kuhl, Karoline [1 ]
Petersen, Jan Heegard [1 ]
Hansen, Gert Foget [1 ]
机构
[1] Univ Copenhagen, Dept Nord Studies & Linguist, Emil Holms Kanal 2, DK-2300 Copenhagen, Denmark
关键词
Corpus documentation; Spoken language resource; Validation procedures; Heritage language; Danish; Multilingual spoken language; Language contact;
D O I
10.1007/s10579-019-09473-5
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper describes the 'Corpus of American Danish' (CoAmDa), a newly established corpus of spoken immigrant Danish in North and South America. The CoAmDa amounts to approx. 1.7 million tokens, making it one of the largest corpora of heritage language at present. With regard to text type, the CoAmDa is a non-standard multilingual spoken language resource as Danish is mixed with American English, Canadian English or Argentine Spanish, respectively, in every recording. The aim of this note is to document relevant aspects and specifications of the CoAmDA, viz. the audio data, the sociodemographic metadata of the speakers, the digitization process of analog data, the transcription procedures, the format and tagging of the speech files and the internal validation procedures. In so doing, we wish to share our experience and best practices with regard to achieving a spoken language resource of high quality with the interested public, in particular other researchers working on and with multilingual speech corpora.
引用
收藏
页码:831 / 849
页数:19
相关论文
共 50 条
  • [31] Native-language phonetic and phonological influences on perception of American English approximants by Danish and German listeners
    Bohn, Ocke-Schwen
    Best, Catherine T.
    JOURNAL OF PHONETICS, 2012, 40 (01) : 109 - 128
  • [32] Effects of North American and Danish feeding strategies on the reproductive performance of American Landrace-Yorkshire crossbred sows during gestation
    Zhou, Y. F.
    Zhang, X. M.
    Wang, C.
    Wei, H. K.
    Jiang, S. W.
    Peng, J.
    LIVESTOCK SCIENCE, 2019, 228 : 67 - 71
  • [33] What language is spoken in Brazil? Linguistic disinformation concerning Brazil in North American reference books
    Lokensgard, MA
    ROMANCE NOTES, 2003, 43 (02) : 171 - 180
  • [34] The Myth and Legend of South American and their relatioonship with North America and the old World
    不详
    BULLETIN OF THE AMERICAN GEOGRAPHICAL SOCIETY OF NEW YORK, 1908, 40 (02): : 120 - 120
  • [35] The timing and process of the colonization of South America: a North American perspective.
    Schurr, Theodore G.
    Dulik, Matthew C.
    Vilar, Miguel G.
    Owings, Amanda C.
    Gaieski, Jill B.
    AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY, 2012, 147 : 263 - 264
  • [36] Does First-Language Training Matter for Immigrant Children's School Achievements? Evidence from a Danish School Reform
    Tegunimataka, Anna
    NORDIC JOURNAL OF MIGRATION RESEARCH, 2021, 11 (03): : 316 - 340
  • [37] The Late Cretaceous paleomagnetic field in North America: a South American perspective
    Somoza, Ruben
    CANADIAN JOURNAL OF EARTH SCIENCES, 2011, 48 (11) : 1483 - 1488
  • [38] North American Glyptodontines (Xenarthra, Mammalia) in the Upper Pleistocene of northern South America
    Alfredo A. Carlini
    Alfredo E. Zurita
    Orangel A. Aguilera
    Paläontologische Zeitschrift, 2008, 82
  • [39] North American glyptodontines (Xenarthra, Mammalia) in the upper Pleistocene of northern South America
    Carlini, Alfredo A.
    Zurita, Alfredo E.
    Aguilera, Orangel A.
    PALAEONTOLOGISCHE ZEITSCHRIFT, 2008, 82 (02): : 125 - 138
  • [40] GRANT SYSTEMS AND VULNERABILITY TO FISCAL STRESS - A COMPARATIVE-STUDY OF DANISH AND NORTH-AMERICAN LOCAL-GOVERNMENT
    MOURITZEN, PE
    NARVER, BJ
    ENVIRONMENT AND PLANNING C-GOVERNMENT AND POLICY, 1989, 7 (03): : 285 - 299