The Italian Crowdsourcing Project: Visual word recognition times for 130,495 Italian words

被引:0
|
作者
Simona Amenta [1 ]
Andrea Gregor de Varda [1 ]
Pawel Mandera [2 ]
Emmanuel Keuleers [3 ]
Marc Brysbaert [4 ]
Marco Marelli [1 ]
机构
[1] University of Milano-Bicocca,Department of Psychology
[2] Lingvist Technologies,Department of Cognitive Science and Artificial Intelligence
[3] University of Tilburg,Department of Experimental Psychology
[4] Ghent University,undefined
关键词
Word recognition; Lexical decision; Megastudy; Crowdsourcing; Prevalence;
D O I
10.3758/s13428-024-02548-4
中图分类号
学科分类号
摘要
Despite being largely spoken and studied by language and cognitive scientists, Italian lacks large resources of language processing data. The Italian Crowdsourcing Project (ICP) is a dataset of word recognition times and accuracy including responses to 130,465 words, which makes it the largest dataset of its kind item-wise. The data were collected in an online word knowledge task in which over 156,000 native speakers of Italian took part. We validated the ICP dataset by (1) showing that ICP reaction times correlate strongly (r = .78) with lexical decision latencies collected in a traditional lab experiment, (2) showing that the effect of major psycholinguistic variables (e.g., frequency, length, etc.) can be replicated in this dataset, and (3) replicating the effect of word prevalence, which we compute here for the first time for Italian. Given the inclusion of many inflectional forms of verbs, adjectives, and nouns, we further showcase the potential of this dataset by exploring two phenomena (inflectional entropy in verb paradigms and the clitic effect in isolated word recognition) that build on the peculiar properties of Italian. In this paper we present the ICP resource and release response times, accuracy, and prevalence estimates for all the words included.
引用
收藏
相关论文
共 50 条
  • [1] The role of orthographic cues to stress in Italian visual word recognition
    Colombo, Lucia
    Sulpizio, Simone
    QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 2021, 74 (09): : 1631 - 1641
  • [2] Recognition times for 62 thousand English words: Data from the English Crowdsourcing Project
    Paweł Mandera
    Emmanuel Keuleers
    Marc Brysbaert
    Behavior Research Methods, 2020, 52 : 741 - 760
  • [3] Recognition Times for 54 Thousand Dutch Words: Data from the Dutch Crowdsourcing Project
    Brysbaert, Marc
    Keuleers, Emmanuel
    Mandera, Pawel
    PSYCHOLOGICA BELGICA, 2019, 59 (01) : 281 - 300
  • [4] Recognition times for 62 thousand English words: Data from the English Crowdsourcing Project
    Mandera, Pawel
    Keuleers, Emmanuel
    Brysbaert, Marc
    BEHAVIOR RESEARCH METHODS, 2020, 52 (02) : 741 - 760
  • [5] Visual word recognition of multisyllabic words
    Yap, Melvin J.
    Balota, David A.
    JOURNAL OF MEMORY AND LANGUAGE, 2009, 60 (04) : 502 - 529
  • [6] Prefixes as access units in visual word recognition: A comparison of Italian and Dutch data
    Egbert M.H. Assink
    Caroline Vooijs
    Paul P.N.A. Knuijt
    Reading and Writing, 2000, 12 : 149 - 168
  • [7] Prefixes as access units in visual word recognition: A comparison of Italian and Dutch data
    Assink, EMH
    Vooijs, C
    Knuijt, PPNA
    READING AND WRITING, 2000, 12 (3-4) : 149 - 168
  • [8] Word recognition in Italian infants: preliminary results
    Majorano, M.
    Corsano, P.
    15TH EUROPEAN CONFERENCE ON DEVELOPMENTAL PSYCHOLOGY, 2011, : 307 - 311
  • [9] Word naming times and psycholinguistic norms for Italian nouns
    Laura Barca
    Cristina Burani
    Lisa S. Arduino
    Behavior Research Methods, Instruments, & Computers, 2002, 34 : 424 - 434
  • [10] Word naming times and psycholinguistic norms for Italian nouns
    Barca, L
    Burani, C
    Arduino, LS
    BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 2002, 34 (03): : 424 - 434