Co‐occurrence statistics as a language‐dependent cue for speech segmentation

Amanda Saksida, Alan Langus, Marina Nespor

Published online on May 04, 2016

Abstract

To what extent can language acquisition be explained in terms of different associative learning mechanisms? It has been hypothesized that distributional regularities in spoken languages are strong enough to elicit statistical learning about dependencies among speech units. Distributional regularities could be a useful cue for word learning even without rich language‐specific knowledge. However, it is not clear how strong and reliable the distributional cues are that humans might use to segment speech. We investigate cross‐linguistic viability of different statistical learning strategies by analyzing child‐directed speech corpora from nine languages and by modeling possible statistics‐based speech segmentations. We show that languages vary as to which statistical segmentation strategies are most successful. The variability of the results can be partially explained by systematic differences between languages, such as rhythmical differences. The results confirm previous findings that different statistical learning strategies are successful in different languages and suggest that infants may have to primarily rely on non‐statistical cues when they begin their process of speech segmentation. Co‐occurrence statistics are not equally informative about word boundaries in all languages. A possible source of statistical variance between languages is linguistic rhythm – in stress‐timed languages (English, Polish, Dutch), co‐occurrence statistics are more informative when an absolute threshold is selected, and in mora‐timed languages (Japanese, Tamil), relative thresholding yields better word segmentation. Infants might therefore use language‐specific information about rhythm to narrow down possible associative strategies to segment speech.