Patterns of probabilistic segment deletion/reduction in English and Japanese
AbstractProbabilistic phonetic reduction is widely attested in a variety of languages, acoustic domains, and interpretations of predictability. Less well-studied is the categorical effect of probabilistic segment deletion, which in principle is subject to similar pressures. This paper presents the results of an exploratory study into patterns of segment deletion in corpora of spontaneous speech in English and Japanese. Analysis at the word level reveals that words with more phonemes and higher-frequency words tend to have more of their segments deleted. Analysis at the phoneme level reveals that high-probability phonemes are more likely to be deleted than low-probability phonemes. For Japanese only, this analysis also shows effects of word length, frequency, and neighborhood density on deletion probability. Taken together, these results suggest that several large-scale patterns of probabilistic segment deletion mirror the processes of phonetic reduction and apply to both languages. Some patterns, though, appear to be language-specific, and it is not clear to what extent languages can and do differ in this regard. These findings are discussed in terms of our understanding of the universality of proposed predictability effects, and in terms of probabilistic reduction more broadly.