The Italian Crowdsourcing Project: Visual word recognition times for 130,495 Italian words

被引：0

作者：

Simona Amenta ^{[1
]}

Andrea Gregor de Varda ^{[1
]}

Pawel Mandera ^{[2
]}

Emmanuel Keuleers ^{[3
]}

Marc Brysbaert ^{[4
]}

Marco Marelli ^{[1
]}

机构：

[1] University of Milano-Bicocca,Department of Psychology

[2] Lingvist Technologies,Department of Cognitive Science and Artificial Intelligence

[3] University of Tilburg,Department of Experimental Psychology

[4] Ghent University,undefined

来源：

Behavior Research Methods | / 57卷 / 1期

关键词：

Word recognition; Lexical decision; Megastudy; Crowdsourcing; Prevalence;

D O I：

10.3758/s13428-024-02548-4

中图分类号：

学科分类号：

摘要：

Despite being largely spoken and studied by language and cognitive scientists, Italian lacks large resources of language processing data. The Italian Crowdsourcing Project (ICP) is a dataset of word recognition times and accuracy including responses to 130,465 words, which makes it the largest dataset of its kind item-wise. The data were collected in an online word knowledge task in which over 156,000 native speakers of Italian took part. We validated the ICP dataset by (1) showing that ICP reaction times correlate strongly (r = .78) with lexical decision latencies collected in a traditional lab experiment, (2) showing that the effect of major psycholinguistic variables (e.g., frequency, length, etc.) can be replicated in this dataset, and (3) replicating the effect of word prevalence, which we compute here for the first time for Italian. Given the inclusion of many inflectional forms of verbs, adjectives, and nouns, we further showcase the potential of this dataset by exploring two phenomena (inflectional entropy in verb paradigms and the clitic effect in isolated word recognition) that build on the peculiar properties of Italian. In this paper we present the ICP resource and release response times, accuracy, and prevalence estimates for all the words included.

引用

共 50 条

[1] The role of orthographic cues to stress in Italian visual word recognition
Colombo, Lucia
Sulpizio, Simone
QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 2021, 74 (09): : 1631 - 1641
[2] Recognition times for 62 thousand English words: Data from the English Crowdsourcing Project
Paweł Mandera
Emmanuel Keuleers
Marc Brysbaert
Behavior Research Methods, 2020, 52 : 741 - 760
[3] Recognition Times for 54 Thousand Dutch Words: Data from the Dutch Crowdsourcing Project
Brysbaert, Marc
Keuleers, Emmanuel
Mandera, Pawel
PSYCHOLOGICA BELGICA, 2019, 59 (01) : 281 - 300
[4] Recognition times for 62 thousand English words: Data from the English Crowdsourcing Project
Mandera, Pawel
Keuleers, Emmanuel
Brysbaert, Marc
BEHAVIOR RESEARCH METHODS, 2020, 52 (02) : 741 - 760
[5] Visual word recognition of multisyllabic words
Yap, Melvin J.
Balota, David A.
JOURNAL OF MEMORY AND LANGUAGE, 2009, 60 (04) : 502 - 529
[6] Prefixes as access units in visual word recognition: A comparison of Italian and Dutch data
Egbert M.H. Assink
Caroline Vooijs
Paul P.N.A. Knuijt
Reading and Writing, 2000, 12 : 149 - 168
[7] Prefixes as access units in visual word recognition: A comparison of Italian and Dutch data
Assink, EMH
Vooijs, C
Knuijt, PPNA
READING AND WRITING, 2000, 12 (3-4) : 149 - 168
[8] Word recognition in Italian infants: preliminary results
Majorano, M.
Corsano, P.
15TH EUROPEAN CONFERENCE ON DEVELOPMENTAL PSYCHOLOGY, 2011, : 307 - 311
[9] Word naming times and psycholinguistic norms for Italian nouns
Laura Barca
Cristina Burani
Lisa S. Arduino
Behavior Research Methods, Instruments, & Computers, 2002, 34 : 424 - 434
[10] Word naming times and psycholinguistic norms for Italian nouns
Barca, L
Burani, C
Arduino, LS
BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 2002, 34 (03): : 424 - 434

← 1 2 3 4 5 →