Towards a computational morphological analysis of Setswana compounds

The development of a computational morphological analyser for Setswana necessitates the accurate modelling and implementation of, among others, compounding as a word formation process. Compounding is known to be an area of Setswana morphology that has sadly been neglected and still requires much investigation and research. The main purpose of this article is to investigate the formation of noun + noun compounds by computational morphological means in order to understand how this process should be formalised, modelled and subsequently implemented. In particular, an empirical study based on a collection of Setswana noun + noun compounds is reported on. The computational morphological analysis of these compounds revealed linguistic deviations from the standard morphological rules governing the formation of nouns and deverbatives. Examples, computational results and a discussion of the main findings are included.

Download Full-text

Adjective–noun compounds in Mandarin: a study on productivity

Corpus Linguistics and Linguistic Theory ◽

10.1515/cllt-2020-0059 ◽

2021 ◽

Vol 0 (0) ◽

Author(s):

Tian Shen ◽

R. Harald Baayen

Keyword(s):

Formation Process ◽

Word Formation ◽

Distributional Semantics ◽

Semantic Transparency ◽

Noun Compounds ◽

Hapax Legomena

Abstract In structuralist linguistics, compounds are argued not to constitute morphological categories, due to the absence of systematic form-meaning correspondences. This study investigates subsets of compounds for which systematic form-meaning correspondences are present: adjective–noun compounds in Mandarin. We show that there are substantial differences in the productivity of these compounds. One set of productivity measures (the count of types, the count of hapax legomena, and the estimated count of unseen types) reflect compounds’ profitability. By contrast, the category-conditioned degree of productivity is found to correlate with the internal semantic transparency of the words belonging to a morphological category. Greater semantic transparency, gauged by distributional semantics, predicts greater category-conditioned productivity. This dovetails well with the hypothesis that semantic transparency is a prerequisite for a word formation process to be productive.

Download Full-text

WORD FORMATION IN SHARENA DELON’S INSTAGRAM POSTS: A MORPHOLOGICAL ANALYSIS

KLAUSA (Kajian Linguistik Pembelajaran Bahasa dan Sastra) ◽

10.33479/klausa.v4i02.310 ◽

2021 ◽

Vol 4 (02) ◽

pp. 89-106

Author(s):

Crisnova Katalonika Siahaan

Keyword(s):

Formation Process ◽

Morphological Analysis ◽

Word Formation ◽

Conversion Process ◽

Descriptive Research ◽

Part Of Speech ◽

Qualitative Descriptive ◽

Data Source ◽

Derivation Process

This paper is a qualitative descriptive research which investigates the processes of word formation and their influence to part of speech of the data. The data source in this study is Instagram captions of Sharena Delon. The method of collecting data is by observation. The method of analyzing data is agih and padan method from Sudaryanto. The author uses the theory ofO’Grady which divides word formation process into 10 types, i.e.derivation, compounding, conversion, clipping, blends, backformation, acronym, onomatopoeia, coinage, and inflection. The research uses the theory of Carter & McCarthy to show the change of part of speech of data. The result of this research shows that based on O’Grady’s and Carter & McCarthy’s theory. There are 30 types of word formation process found in Instagram of Sharena Delon. Three types are by derivation process, 2 types by compounding process, 2 types by conversion process, 6 types by clipping process, 6 types by blends process, 2 types by backformation process, 3 types by acronym process, 3 types by onomatopoeia process, 1 type by coinage process, and two types by inflection process. Some data have change of part of speech because of word formation process and some do not.

Download Full-text

The Word Class Adjective in English Business Magazines Online

Journal of Language and Cultural Education ◽

10.2478/jolace-2018-0018 ◽

2018 ◽

Vol 6 (2) ◽

pp. 99-115

Author(s):

Borislav Marušić ◽

Sanda Katavić-Čaušić

Keyword(s):

Empirical Study ◽

Empirical Analysis ◽

Word Formation ◽

Word Class ◽

Business English ◽

Starting Point ◽

Comprehensive Picture ◽

English Syntax ◽

The Empirical Analysis ◽

Insight Into

Abstract The aim of this paper is to research the word class adjective in one sequence of the ESP: Business English, more precisely English business magazines online. It is an empirical study on the corpus taken from a variety of business magazines online. The empirical analysis allows a comprehensive insight into the word class adjective in this variety of Business English and makes its contribution to English syntax, semantics and word formation. The syntactic part analyses the adjective position in the sentence. The semantic part of the study identifies the most common adjectives that appear in English business magazines online. Most of the analysis is devoted to the word formation of the adjectives found in the corpus. The corpus is analysed in such a way that it enables its division into compounds, derivatives and conversions. The results obtained in this way will give a comprehensive picture of the word class adjective in this type of Business English and can act as a starting point for further research of the word class adjective.

Download Full-text

Joint Semantic Synthesis and Morphological Analysis of the Derived Word

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00003 ◽

2018 ◽

Vol 6 ◽

pp. 33-48 ◽

Cited By ~ 4

Author(s):

Ryan Cotterell ◽

Hinrich Schütze

Keyword(s):

Neural Networks ◽

Recurrent Neural Networks ◽

Probabilistic Model ◽

Morphological Analysis ◽

Semantic Representation ◽

English Word ◽

Word Formation ◽

Additive Models ◽

Segmentation Accuracy ◽

Morphological Productivity

Much like sentences are composed of words, words themselves are composed of smaller units. For example, the English word questionably can be analyzed as question+ able+ ly. However, this structural decomposition of the word does not directly give us a semantic representation of the word’s meaning. Since morphology obeys the principle of compositionality, the semantics of the word can be systematically derived from the meaning of its parts. In this work, we propose a novel probabilistic model of word formation that captures both the analysis of a word w into its constituent segments and the synthesis of the meaning of w from the meanings of those segments. Our model jointly learns to segment words into morphemes and compose distributional semantic vectors of those morphemes. We experiment with the model on English CELEX data and German DErivBase (Zeller et al., 2013) data. We show that jointly modeling semantics increases both segmentation accuracy and morpheme F1 by between 3% and 5%. Additionally, we investigate different models of vector composition, showing that recurrent neural networks yield an improvement over simple additive models. Finally, we study the degree to which the representations correspond to a linguist’s notion of morphological productivity.

Download Full-text

The Linguistic Features of Acronyms in Arabic

SAGE Open ◽

10.1177/2158244017746710 ◽

2017 ◽

Vol 7 (4) ◽

pp. 215824401774671

Author(s):

Abdel Rahman Mitib Altakhaineh

Keyword(s):

Formation Process ◽

Relevance Theory ◽

Contextual Effects ◽

Word Formation ◽

Linguistic Features ◽

Radio Stations ◽

Depth Analysis ◽

The Media ◽

Processing Effort ◽

Over Time

This study investigates the phonological, semantic, and pragmatic features of acronyms in Arabic. Acronyms in Arabic have appeared quite recently as a result of globalization and exposure to or contact with, mainly, English via radio stations and TV channels, which are broadcasting in English and in some countries, for example, Morocco in both English and French. Through in-depth analysis, it has been observed that acronyms in Arabic are subject to different restrictions: (a) The phonological combinations are formed on the basis of Arabic templates; hence, should be compatible with Arabic phonotactics, for example, consonant clusters should be broken up by vowels; (b) the connotation of the acronyms should not be negative; and (c) in conformity with relevance theory, when the acronyms are homophonous to existing words, the former maximize contextual effects with minimum processing effort. The fact that they appear in certain contexts also reduces the processing effort. It has also become evident that the period between the establishment of the movement or party and the first use of the acronym decreases over time, provided that the acronyms are frequently mentioned in the media. The examination of acronyms in different languages shows that acronymization is quite pervasive cross-linguistically; this may suggest that not any word-formation process can easily spread; it needs to be prevalent and potentially universal.

Download Full-text

The paradigmatic aspect of compounding and derivation

Journal of Linguistics ◽

10.1017/s0022226718000518 ◽

2018 ◽

Vol 55 (3) ◽

pp. 563-609

Author(s):

AYSUN KUNDURACI

Keyword(s):

Current System ◽

Word Formation ◽

Morphological Operations ◽

Dynamic Aspect ◽

Noun Compounds ◽

Autonomous Morphology

This study aims to show the dynamic aspect of word-formation paradigms in autonomous morphology by examining the compound marker in Turkish Noun–Noun compounds, as in buz paten-i ‘ice-skate (ice skate-cm)’, and its relation to derivational suffixes. The study proposes a process-based morphological paradigm structure which involves compounding and derivational operations. In this system, the compound marker has a formal paradigmatic function: it creates correct lexeme forms based on bare Noun–Noun compounds, which would otherwise serve as input to certain derivational operations. The current system thus accounts for both permitted and unpermitted suffix combinations involving compounding and the optionality in certain combinations, such as buz paten-ci (-si) ‘a/the ice skater (ice skate-agt-cm)’, where the compound marker may (not) appear in combination with the (derivational) agentive -CI. The study also presents a survey which implies that a group of derivational affixes is in a paradigmatic relation with the compound marker, and all of these affixations constitute alternative paths in a dynamic paradigm structure. The findings of the study are considered to contribute to the understanding of the nature of the autonomous morphological operations and paradigms, which cannot be restricted to the lexicon or manipulated by syntax.

Download Full-text

Word-Formation Process on COVID-19 Related Terms as Japanese Language Variation

IZUMI ◽

10.14710/izumi.10.2.304-315 ◽

2021 ◽

Vol 10 (2) ◽

pp. 304-315

Author(s):

Nadya Inda Syartanti

Keyword(s):

Formation Process ◽

Word Formation ◽

Formation Processes ◽

Preventive Action ◽

Distribution Method ◽

Disease Information ◽

Multiple Processes ◽

Source Data ◽

Data Source ◽

Observation Method

This research aims to identify the types of vocabulary in Japanese, to explore the word-formation process, and to analyse the function of COVID-19 related terms conveyed by various content creators on YouTube channels, from Japan and Indonesia. The seven YouTube channel accounts are Aki no Sora (Indonesia), po.n.go_id or Pocket Nihongo (Indonesia), Sakura Pinku (Indonesia), Wagomu (Indonesia), Omoshiroi Nihongo (Japan), Japanese Ammo with Misa (Japan), and Coto Academy (Japan that used as the data source. Data were collected using the observation method and analysed by the distribution method. The results showed that the COVID-19 related terms are dominated by the type of vocabulary kango as the form or type of vocabulary that is most widely used in matching the COVID-19 related term. Among all word-formation processes, the COVID-19 terms in Japanese identified as borrowing, compounding, and multiple processes. However, the multiple word-formation processes are dominated on COVID-19 terms in Japanese. It related to the domination of the type of kango used. From that, there are multiple functions that are included in COVID-19 terms in Japanese, namely disease information, preventive action, symptom, and announcement. Therefore, this research can be contributed to data analysis, which used morphological analysis in Japanese terms.

Download Full-text

Exploring the word formation process on Rich Brian�s official music video comments

EduLite Journal of English Education Literature and Culture ◽

10.30659/e.6.1.198-212 ◽

2021 ◽

Vol 6 (1) ◽

pp. 198

Author(s):

Syifa Dwi Mutiah ◽

Didin Nuruddin Hidayat ◽

Alek Alek

Keyword(s):

Formation Process ◽

Secondary Data ◽

Social Function ◽

Word Formation ◽

Primary Data ◽

Music Video ◽

Secondary Source ◽

Self Identity ◽

Standard Data ◽

Depth Analysis

This study attempted to explore the process of word formation and its social function of slang words on Rich Brian's official music video, precisely in the comment section of the newest song of him, D.O.A that released on 25 August 2020. This study employed discourse and sociolinguistics aspects; hence, this study's research design was a descriptive qualitative. Additionally, this study's object was a one-month comment section of Rich Brian's newest song official music video. Furthermore, there were two sources of data, primary and secondary data sources. The primary data were a comment section of Rich Brian's newest official song music video, while the secondary source was from available works of literature and urban dictionary. Therefore, to collect the data, this study did the documentation from the D.O.A. official music video from one-month duration comments. As the data obtained, there were 49 slang words found. Then, they were analyzed through Yule (2010)�s theory as the basis of word formation process standard, while Zhou & Fan (2013)�s theory was used as the basis of word formation function standard. Data analysis is divided into several steps: reading, collecting, classifying, and analyzing the data. The result showed that derivation was the highest frequency of the word-formation process. It was 22%, the acronym was 18%, coinage was 16%, the conversion was 14%, blending was 12%, compounding was 8%, clipping was 6%, borrowing and multiple processes were 2% of each. Besides, the highest social function was to express emotive feeling with 49 % quantity of the frequency. The second higher was to pursue self-identity, achieving politeness was 8%. This study suggested a more in-depth analysis of non-standard languages, such as swear words or taboo words and emoticon that can be done through a social semiotic approach.

Download Full-text

The comparative study between Indonesian blend words and English blend words

LingTera ◽

10.21831/lt.v6i1.22785 ◽

2019 ◽

Vol 6 (1) ◽

pp. 1-7

Author(s):

Mia Rahmannia ◽

Pratomo Widodo

Keyword(s):

Comparative Study ◽

Formation Process ◽

Qualitative Method ◽

Word Formation ◽

Journal Articles ◽

Similarities And Differences ◽

The Difference ◽

The Comparative Study ◽

Descriptive Qualitative

Generally, blending can be defined as combining two clipped words. Kvetko in Bednarova defines blending as a word formation process similar to shortening combined with merging two different words. There are many types of blending words both in Indonesian and English context. Therefore, the aim of this research is to analyze the comparison between Indonesian blend words and English blend words in terms of its types, its similarities and differences and also how its formed. In this paper the researcher use descriptive qualitative method as the method of the research. The source of the data is taken from some journal articles that relevant with blend words both in Indonesian and English context. The result showed that Indonesian blend words and English blend words not only have similarities in the forming word, but also have their differences that make them unique to each other. The writer hopes that the reader of this article gets more information about both Indonesian and English blend words based on its types and the similarities and the difference between them.

Download Full-text

Analisis Ragam Bahasa Istilah Dalam Iklan Pariwisata Di Media Digital Pada Masa Pandemi Covid-19

Journey : Journal of Tourismpreneurship, Culinary, Hospitality, Convention and Event Management ◽

10.46837/journey.v4i1.66 ◽

2021 ◽

Vol 4 (1) ◽

pp. 19-42

Keyword(s):

Social Media ◽

Digital Media ◽

Formation Process ◽

Qualitative Method ◽

Formal Languages ◽

Word Formation ◽

Characteristic Pattern ◽

Language Varieties ◽

Language Style ◽

Theory Of Language

This study aimed ro describe and determine the characteristic patterns and the word formation process of various language terms used in tourism advertisements on digital media during the COVID-19 pandemic. There are several new terms that are widely used by people all over the globe, and tourism adevertisements are no excpetion. This study used descriptive qualitative method. The data were taken from advertisements found on social media Facebook and Instagram. The thoery used in this research is the theory of language style proposed by Chaer & Agustina (2010) and theory of word formation process by Chaer (2008). It can be concluded that: (1) the characteristic pattern of the variety of languages used in tourism advertisements on social media Facebook and Instagram during the COVID-19 pandemic is a variety of non-formal languages, and (2) the variety of terms used tourism advertisements on social media Facebook and Instagram during the COVID-19 pandemic are formed from blending, abbreviations and compounding. Keywords: Language Varieties, Terms, Advertisement, COVID-19

Download Full-text