Statistical word segmentation succeeds given the minimal amount of exposure

被引:0
|
作者
Hao Wang, Felix [1 ]
Luo, Meili [1 ]
Wang, Suiping [2 ]
机构
[1] Nanjing Normal Univ, Sch Psychol, Nanjing, Jiangsu, Peoples R China
[2] South China Normal Univ, Philosophy & Social Sci Lab Reading & Dev Children, Minist Educ, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Statistical learning; Word segmentation; Exposure amount; NONADJACENT DEPENDENCIES; PROBABILITY; PERFORMANCE; FREQUENCY; ADJACENT; INFANTS; IMPACT;
D O I
10.3758/s13423-023-02386-z
中图分类号
B841 [心理学研究方法];
学科分类号
040201 ;
摘要
One of the first tasks in language acquisition is word segmentation, a process to extract word forms from continuous speech streams. Statistical approaches to word segmentation have been shown to be a powerful mechanism, in which word boundaries are inferred from sequence statistics. This approach requires the learner to represent the frequency of units from syllable sequences, though accounts differ on how much statistical exposure is required. In this study, we examined the computational limit with which words can be extracted from continuous sequences. First, we discussed why two occurrences of a word in a continuous sequence is the computational lower limit for this word to be statistically defined. Next, we created short syllable sequences that contained certain words either two or four times. Learners were presented with these syllable sequences one at a time, immediately followed by a test of the novel words from these sequences. We found that, with the computationally minimal amount of two exposures, words were successfully segmented from continuous sequences. Moreover, longer syllable sequences providing four exposures to words generated more robust learning results. The implications of these results are discussed in terms of how learners segment and store the word candidates from continuous sequences.
引用
收藏
页码:1172 / 1180
页数:9
相关论文
共 50 条
  • [1] Modeling human performance in statistical word segmentation
    Frank, Michael C.
    Goldwater, Sharon
    Griffiths, Thomas L.
    Tenenbaum, Joshua B.
    COGNITION, 2010, 117 (02) : 107 - 125
  • [2] Statistical Speech Segmentation and Word Learning in Parallel
    Yurovsky, Daniel
    Yu, Chen
    Smith, Linda B.
    PROCEEDINGS OF THE 34TH ANNUAL BOSTON UNIVERSITY CONFERENCE ON LANGUAGE DEVELOPMENT, VOLS 1 AND 2, 2010, : 491 - 502
  • [3] Testing the limits of statistical learning for word segmentation
    Johnson, Elizabeth K.
    Tyler, Michael D.
    DEVELOPMENTAL SCIENCE, 2010, 13 (02) : 339 - 345
  • [4] Statistical-based approach to word segmentation
    Wang, YL
    Phillips, IT
    Haralick, R
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS: APPLICATIONS, ROBOTICS SYSTEMS AND ARCHITECTURES, 2000, : 555 - 558
  • [5] Statistical word segmentation: Anchoring learning across contexts
    Antovich, Dylan M. M.
    Estes, Katharine Graf
    INFANCY, 2023, 28 (02) : 257 - 276
  • [6] Mongolian word segmentation based on statistical language model
    Hou, Hong-Xu
    Liu, Qun
    Nasanurtu
    Murengaowa
    Li, Jin-Tao
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2009, 22 (01): : 108 - 112
  • [7] Statistical cues in language acquisition: Word segmentation by infants
    Saffran, JR
    Aslin, RN
    Newport, EL
    PROCEEDINGS OF THE EIGHTEENTH ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, 1996, : 376 - 380
  • [8] Statistical Hypothesis Testing for Handwritten Word Segmentation Algorithms
    Haji, Mehdi
    Sahoo, Kalyan Asis
    Bui, Tien. D.
    Suen, Ching Y.
    Ponson, Dominique
    13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 114 - 119
  • [9] The link between statistical segmentation and word learning in adults
    Mirman, Daniel
    Magnuson, James S.
    Estes, Katharine Graf
    Dixon, James A.
    COGNITION, 2008, 108 (01) : 271 - 280
  • [10] Perceptual intake explains variability in statistical word segmentation
    Wang, Felix Hao
    Luo, Meili
    Wang, Suiping
    COGNITION, 2023, 241