token frequency
Recently Published Documents


TOTAL DOCUMENTS

89
(FIVE YEARS 22)

H-INDEX

14
(FIVE YEARS 1)

2021 ◽  
pp. 1-37
Author(s):  
HELEN SIMS-WILLIAMS

This paper demonstrates that morphological change tends to involve the replacement of low frequency forms in inflectional paradigms by innovative forms based on high frequency forms, using Greek data involving the diachronic reorganisation of verbal inflection classes. A computational procedure is outlined for generating a possibility space of morphological changes which can be represented as analogical proportions, on the basis of synchronic paradigms in ancient Greek. I then show how supplementing analogical proportions with token frequency information can help to predict whether a hypothetical change actually took place in the language’s subsequent development. Because of the crucial role of inflected surface forms serving as analogical bases in this model, I argue that the results support theories in which inflected forms can be stored whole in the lexicon.


2021 ◽  
Vol 24 (1) ◽  
pp. 1-37
Author(s):  
Hanne Surkyn ◽  
Reinhild Vandekerckhove ◽  
Dominiek Sandra

Abstract The present study examines unintentional spelling errors on past participles produced by Flemish teenagers in private online writing. Previous psycholinguistic research on verb spelling errors in Dutch mainly focused on identical homophones (Bosman 2005; Frisson & Sandra 2002; Sandra et al. 1999). The present study, however, deals with past participles that are only partially homophonous with other forms in the inflectional paradigm and investigates whether the spelling of these verbs is affected by whole-word frequency, paradigmatic and bigram support for the correct spelling and the token frequency of the past participles’ morphological family. The error rates reflect the effect of both paradigmatic and bigram support. Moreover, the unique database makes it possible to analyze the impact of three social factors (Gender, Educational track and Age). Our results reveal an effect on the error rates of all social variables. Finally, these social factors do not interact with paradigmatic and bigram support.


Author(s):  
Jiaqiang Zhu ◽  
Xiaoxiang Chen ◽  
Fei Chen ◽  
Seth Wiener

Purpose: Individuals with congenital amusia exhibit degraded speech perception. This study examined whether adult Chinese Mandarin listeners with amusia were still able to extract the statistical regularities of Mandarin speech sounds, despite their degraded speech perception. Method: Using the gating paradigm with monosyllabic syllable–tone words, we tested 19 Mandarin-speaking amusics and 19 musically intact controls. Listeners heard increasingly longer fragments of the acoustic signal across eight duration-blocked gates. The stimuli varied in syllable token frequency and syllable–tone co-occurrence probability. The correct syllable–tone word, correct syllable-only, correct tone-only, and correct syllable–incorrect tone responses were compared respectively between the two groups using mixed-effects models. Results: Amusics were less accurate than controls in terms of the correct word, correct syllable-only, and correct tone-only responses. Amusics, however, showed consistent patterns of top-down processing, as indicated by more accurate responses to high-frequency syllables, high-probability tones, and tone errors all in manners similar to those of the control listeners. Conclusions: Amusics are able to learn syllable and tone statistical regularities from the language input. This extends previous work by showing that amusics can track phonological segment and pitch cues despite their degraded speech perception. The observed speech deficits in amusics are therefore not due to an abnormal statistical learning mechanism. These results support rehabilitation programs aimed at improving amusics' sensitivity to pitch.


2021 ◽  
pp. 1-28
Author(s):  
Mine NAKİPOĞLU ◽  
Berna A. UZUNDAG ◽  
Özge SARIGÜL

Abstract Children's remarkable ability to generalize beyond the input and the resulting overregularizations/ irregularizations provide a platform for a discussion of whether morphology learning uses analogy-based, rule-based, or statistical learning procedures. The present study, testing 115 children (aged 3 to 10) on an elicited production task, investigated the acquisition of the irregular distribution in the Turkish causative. Results showed that in early acquisition, to pin down the four causative suffixes, children engaged in comparisons between analogous exemplars. Thereafter to tackle the irregularity in two of the suffixes, children entertained competing hypotheses that yielded overregularizations and irregularizations. Overregularizations were instances of abstraction across the input based on type frequency; irregularizations were attempts to default to erroneous micro-generalizations. Negative correlation between errors and verb frequency suggested that recovery from errors was sensitive to token frequency. The overgeneralize-then-recover pattern that emerged in the acquisition of causative supported an integrated account of the roles of analogy, abstraction, and frequency in morphology learning.


2021 ◽  
Vol 137 (4) ◽  
pp. 961-1013
Author(s):  
Sabine Heinemann

Abstract Normally, verbs showing allomorphy due to phonological changes become more regular over time through intraparadigmatic analogy. There are verbs, though, such as auxiliaries, modals or basic verbs of movement which stay irregular, obviously due to their extremely high token frequency. Especially studies with reference to Germanic languages show that irregularity often relates to a development leading to a clear(er) distinction of single forms within a paradigm. The article examines the applicability of the concept of relevance developped by Bybee (1985), the significance of token frequency and other factors discussed in studies anchored in the usage based approach with regard to the French and Italian verbs aller/andare, venir/venire and – for Old French – ester (cf. the suppletive paradigm of être) and Italian stare, considering the development from Latin to modern French and Italian.


2021 ◽  
Vol 6 (1) ◽  
Author(s):  
Hannah Lutzenberger ◽  
Connie de Vos ◽  
Onno Crasborn ◽  
Paula Fikkert

Sign language lexicons incorporate phonological specifications. Evidence from emerging sign languages suggests that phonological structure emerges gradually in a new language. In this study, we investigate variation in the form of signs across 20 deaf adult signers of Kata Kolok, a sign language that emerged spontaneously in a Balinese village community. Combining methods previously used for sign comparisons, we introduce a new numeric measure of variation. Our nuanced yet comprehensive approach to form variation integrates three levels (iconic motivation, surface realisation, feature differences) and allows for refinement through weighting the variation score by token and signer frequency. We demonstrate that variation in the form of signs appears in different degrees at different levels. Token frequency in a given dataset greatly affects how much variation can surface, suggesting caution in interpreting previous findings. Different sign variants have different scopes of use among the signing population, with some more widely used than others. Both frequency weightings (token and signer) identify dominant sign variants, i.e., sign forms that are produced frequently or by many signers. We argue that variation does not equal the absence of conventionalisation. Indeed, especially in micro-community sign languages, variation may be key to understanding patterns of language emergence.


2021 ◽  
Vol 21 (2) ◽  
pp. 391-402
Author(s):  
Gede Primahadi Wijaya Rajeg ◽  
Utei Charaleghy Pamphila

This paper investigates the quantitative distribution (type and token frequencies, and type-per-token ratio [TTR]) of motion verbs found in English and Indonesian versions of the novel Twilight (Meyer, 2005; Sari, 2008). The study is contextualized within two divergent views on the typological characteristics of Indonesian lexicalization patterns of motion events. One study (Son, 2009) suggests that Indonesian behaves like English, representing a satellite-framed pattern (i.e., lexicalizing Manner of motion in the main verb) while another study (Wienold, 1995) argues for the verb-framed nature of Indonesian (i.e., lexicalizing Path of motion in the main verb). We seek to offer a quantitative perspective to these two proposals. Our study shows that, compared to English, Indonesian has significantly higher number (i.e., types) and occurrences (i.e., tokens) of Path verbs (reflecting the verb-framed pattern). Moreover, the higher TTR value of Path verbs for Indonesian shows a greater lexical diversity in the inventory of Indonesian Path verbs compared to English. In contrast, the English Manner verbs are significantly higher in number and in token frequency than Indonesian (suggesting the satellite-framed pattern), and show greater lexical diversity given the higher TTR value. While these findings lean toward supporting the verb-framed pattern of Indonesian (Wienold, 1995), we caution with the limitation of our conclusion and offer suggestions for future study.


Author(s):  
Niek Van Wettere

Abstract This paper examines the productivity of the subject complement slot in a set of French and Dutch (semi-)copular micro-constructions. The presumed counterpart of productivity, conventionalization in the form of high token frequency, will also be taken into account in the analysis of the productivity complex. On the one hand, it will be shown that prototypical copulas generally have a higher productivity than semi-copulas, although there are some semi-copulas that can rival the productivity of prototypical copulas. On the other hand, it will be demonstrated that high token frequency is in general detrimental to productivity, on the level of the entire subject complement slot and on the level of the different semantic classes. However, the shape of the frequency distribution also seems to play a role: multiple highly frequent types are in my data more detrimental to productivity than one extremely frequent type, although the semantic connectedness of the types in the distribution might also be an explanatory factor.


2021 ◽  
pp. 1-30
Author(s):  
Marta SZREDER ◽  
Laura E. DE RUITER ◽  
Dimitrios NTELITHEOS

Abstract This study investigates the acquisition of the Imperfective verb inflection paradigm in Emirati Arabic (EA), to determine whether the learning process is sensitive to the phonological and typological properties of the input. We collected data from 48 participants aged 2;7 to 5;9 years, using an elicited production paradigm. Input frequencies of inflectional contexts, verb types and tokens were obtained from corpora of child-directed and adult EA. Children's accuracy was inversely related to the input frequency of inflectional contexts, but not related to type and token frequency or phonological neighborhood density. Token frequency interacted with age, such that younger children performed considerably worse on low-frequency tokens, but older children performed equally well on high- and low-frequency tokens. We conclude that learning is input-driven, but that a sufficiently regular paradigm allows children to eventually generalise across all items earlier than in previously studied European languages.


2021 ◽  
Vol 9 (2) ◽  
pp. 31-52
Author(s):  
Jiang Liu ◽  
Seth Wiener

Abstract This study examined beginner-level Chinese as a Foreign Language (CFL) learners’ production of newly learned words in an image naming and pinyin-reading task. Fifteen L1-English CFL learners learned 10 tonal monosyllabic minimal pairs (e.g., shu1 and shu3) in a three-day sound-image word learning experiment. Ten of the 20 words were homophonous with previously learned words (e.g., participants already knew that shu1 means ‘book’), while the other 10 were not (e.g., no shu3 words had been learned). Ten of the 20 words had frequent phonology participants were familiar with (e.g., shi is a high token frequency syllable), while the other 10 had infrequent phonology (e.g., ku is a low token frequency syllable). On the last day of learning, participants performed an image naming task followed by a pinyin-reading task. The recoded word tokens from both tasks were then played to 10 native Chinese speakers who were asked to transcribe the words in pinyin. The results showed that overall word production in the pinyin-reading task was more accurate than image naming. The pinyin-reading advantage was robust, but homophone status and syllable token frequency also interacted with task type: learners produced syllables with high token frequency but without homophones equally well in the pinyin-reading and naming tasks. These results suggest phonological encoding in long-term memory based on pinyin orthography can be affected by learners’ prior phonological and lexical knowledge. Pedagogical applications and limitations of the study are discussed, as well.


Sign in / Sign up

Export Citation Format

Share Document