A database of orthography-semantics consistency (OSC) estimates for 15,017 English words

被引:42
|
作者
Marelli, Marco [1 ]
Amenta, Simona [2 ]
机构
[1] Univ Milano Bicocca, Dept Psychol, Pzza Ateneo Nuovo 1, I-20126 Milan, MI, Italy
[2] Univ Ghent, Dept Expt Psychol, Ghent, Belgium
关键词
Orthography-semantics consistency; Form-meaning mapping; Word recognition; Lexical resources; Distributional semantic models; LEXICAL DECISION; FEEDBACK SEMANTICS; FREQUENCY; NEIGHBORHOOD; RECOGNITION; SPACE; TRANSPARENCY; ACTIVATION; INDUCTION; RICHNESS;
D O I
10.3758/s13428-018-1017-8
中图分类号
B841 [心理学研究方法];
学科分类号
040201 ;
摘要
Orthography-semantics consistency (OSC) is a measure that quantifies the degree of semantic relatedness between a word and its orthographic relatives. OSC is computed as the frequency-weighted average semantic similarity between the meaning of a given word and the meanings of all the words containing that very same orthographic string, as captured by distributional semantic models. We present a resource including optimized estimates of OSC for 15,017 English words. In a series of analyses, we provide a progressive optimization of the OSC variable. We show that computing OSC from word-embeddings models (in place of traditional count models), limiting preprocessing of the corpus used for inducing semantic vectors (in particular, avoiding part-of-speech tagging and lemmatization), and relying on a wider pool of orthographic relatives provide better performance for the measure in a lexical-processing task. We further show that OSC is an important and significant predictor of reaction times in visual word recognition and word naming, one that correlates only weakly with other psycholinguistic variables (e.g., family size, word frequency), indicating that it captures a novel source of variance in lexical access. Finally, some theoretical and methodological implications are discussed of adopting OSC as one of the predictors of reaction times in studies of visual word recognition.
引用
收藏
页码:1482 / 1495
页数:14
相关论文
共 2 条
  • [1] A database of orthography-semantics consistency (OSC) estimates for 15,017 English words
    Marco Marelli
    Simona Amenta
    Behavior Research Methods, 2018, 50 : 1482 - 1495
  • [2] Semantic transparency in free stems: The effect of Orthography-Semantics Consistency on word recognition
    Marelli, Marco
    Amenta, Simona
    Crepaldi, Davide
    QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 2015, 68 (08): : 1571 - 1583