Acoustic covariants of length contrast in Japanese stops

This study explores acoustic correlates to the singleton vs. geminate stop length contrast in Japanese. The proposal examined is that multiple acoustic features covary with the stop length distinction and that these features are available in the signal as potential secondary cues. The results support the proposal, revealing the presence of several acoustic features covarying with the singleton vs. geminate contrast in both durational and non-durational domains. Specifically, the preceding vowel is longer, the following vowel is shorter, there are greater fundamental frequency and intensity changes from the preceding to the following vowel, and there is evidence of more creakiness in voice quality for geminate than singleton consonants. It is also demonstrated that the vowel durations, as well as fundamental frequency and intensity changes have fairly strong categorization power.

Download Full-text

Spectral- and Cepstral-Based Acoustic Features of Dysphonic, Strained Voice Quality

Annals of Otology Rhinology & Laryngology ◽

10.1177/000348941212100808 ◽

2012 ◽

Vol 121 (8) ◽

pp. 539-548 ◽

Cited By ~ 44

Author(s):

Soren Y. Lowell ◽

Richard T. Kelley ◽

Shaheen N. Awan ◽

Raymond H. Colton ◽

Natalie H. Chan

Keyword(s):

Voice Quality ◽

Acoustic Features

Download Full-text

Frequency, Intensity, and Target Matching Effects on Photoglottographic Measures of Open Quotient and Speed Quotient

Journal of Speech Language and Hearing Research ◽

10.1044/jshr.3301.45 ◽

1990 ◽

Vol 33 (1) ◽

pp. 45-50 ◽

Cited By ~ 31

Author(s):

David G. Hanson ◽

Bruce R. Gerratt ◽

Gerald S. Berke

Keyword(s):

Fundamental Frequency ◽

Significant Interaction ◽

Normal Male ◽

Similar Frequency ◽

Intensity Changes ◽

Male Subjects ◽

Target Matching ◽

Frequency Intensity

Measurements of Open Quotient (OQ) and Speed Quotient (SQ) were made from photoglottographic signals of normal male subjects during phonation. Samples were obtained at spontaneous levels of fundamental frequency and intensity, and at nine specified frequency/intensity combinations. OQ increased with fundamental frequency. OQ change was not significant for change in intensity and there was no significant interaction between frequency and intensity. Changes in SQ with variations of frequency and intensity were not significant. However, SQ did increase significantly when spontaneous phonation was compared to target matching phonation at similar frequency/intensity. Changes in both OQ and SQ across comfortable frequency and intensity ranges were relatively small in comparison to changes in OQ and SQ reported for pathological phonation.

Download Full-text

Acoustic correlates of voice quality improvement by voice training

10.21437/interspeech.2010-750 ◽

2010 ◽

Author(s):

Kiyoaki Aikawa ◽

Junko Uenuma ◽

Tomoko Akitake

Keyword(s):

Quality Improvement ◽

Voice Quality ◽

Voice Training ◽

Acoustic Correlates

Download Full-text

A Comparison of Acoustic Correlates of Voice Quality Across Different Recording Devices: A Cautionary Tale

10.21437/interspeech.2021-729 ◽

2021 ◽

Author(s):

Joshua Penney ◽

Andy Gibson ◽

Felicity Cox ◽

Michael Proctor ◽

Anita Szakay

Keyword(s):

Voice Quality ◽

Cautionary Tale ◽

Acoustic Correlates

Download Full-text

Acoustic correlates of stress in Turkish Kabardian

Journal of the International Phonetic Association ◽

10.1017/s0025100309990259 ◽

2010 ◽

Vol 40 (1) ◽

pp. 35-58 ◽

Cited By ~ 7

Author(s):

Matthew Gordon ◽

Ayla Applebaum

Keyword(s):

Fundamental Frequency ◽

Vowel Quality ◽

Acoustic Correlates ◽

Acoustic Study

This paper reports results of an acoustic study of stress in the Turkish dialect of the Northwest Caucasian language, Kabardian. Stressed syllables were found to have consistently higher fundamental frequency and characteristically greater duration and intensity than unstressed syllables. No evidence was found for secondary stresses. Schwa and, to a lesser extent, /ɐ/ were shown to undergo slight raising as their duration in unstressed syllables decreased. This gradient raising is likely due to coarticulatory overlap with adjacent consonants rather than a categorical shift in vowel quality. Considerations of articulatory effort rather than perceptual dispersion predict both the categorical alternation between stressed /aː/ and unstressed /ɐ/ in Kabardian and the non-categorical raising of schwa and /ɐ/ in unstressed syllables.

Download Full-text

Acoustic Correlates of Compensatory Adjustments to the Glottic and Supraglottic Structures in Patients with Unilateral Vocal Fold Paralysis

BioMed Research International ◽

10.1155/2015/704121 ◽

2015 ◽

Vol 2015 ◽

pp. 1-9 ◽

Cited By ~ 4

Author(s):

Luis M. T. Jesus ◽

Joana Martinez ◽

Andreia Hall ◽

Aníbal Ferreira

Keyword(s):

Standard Deviation ◽

Fundamental Frequency ◽

Vocal Fold ◽

Normal Subjects ◽

Vocal Fold Paralysis ◽

Limited Information ◽

Unilateral Vocal Fold Paralysis ◽

Acoustic Correlates ◽

Noise Ratio

The goal of this study was to analyse perceptually and acoustically the voices of patients with Unilateral Vocal Fold Paralysis (UVFP) and compare them to the voices of normal subjects. These voices were analysed perceptually with the GRBAS scale and acoustically using the following parameters: mean fundamental frequency (F0), standard-deviation ofF0, jitter (ppq5), shimmer (apq11), mean harmonics-to-noise ratio (HNR), mean first (F1) and second (F2) formants frequency, and standard-deviation ofF1 andF2 frequencies. Statistically significant differences were found in all of the perceptual parameters. Also the jitter, shimmer, HNR, standard-deviation ofF0, and standard-deviation of the frequency ofF2 were statistically different between groups, for both genders. In the male data differences were also found inF1 andF2 frequencies values and in the standard-deviation of the frequency ofF1. This study allowed the documentation of the alterations resulting from UVFP and addressed the exploration of parameters with limited information for this pathology.

Download Full-text

Timbre and Affect Dimensions: Evidence from Affect and Similarity Ratings and Acoustic Correlates of Isolated Instrument Sounds

Music Perception An Interdisciplinary Journal ◽

10.1525/mp.2012.30.1.49 ◽

2012 ◽

Vol 30 (1) ◽

pp. 49-70 ◽

Cited By ~ 24

Author(s):

Tuomas Eerola ◽

Rafael Ferrer ◽

Vinoo Alluri

Keyword(s):

High Frequency ◽

Sound Production ◽

Three Dimensional ◽

Low Frequency ◽

Structural Features ◽

Two Dimensions ◽

Acoustic Features ◽

Behavioral Experiments ◽

Acoustic Correlates

considerable effort has been made towards understanding how acoustic and structural features contribute to emotional expression in music, but relatively little attention has been paid to the role of timbre in this process. Our aim was to investigate the role of timbre in the perception of affect dimensions in isolated musical sounds, by way of three behavioral experiments. In Experiment 1, participants evaluated perceived affects of 110 instrument sounds that were equal in duration, pitch, and dynamics using a three-dimensional affect model (valence, energy arousal, and tension arousal) and preference and emotional intensity. In Experiment 2, an emotional dissimilarity task was applied to a subset of the instrument sounds used in Experiment 1 to better reveal the underlying affect structure. In Experiment 3, the perceived affect dimensions as well as preference and intensity of a new set of 105 instrument sounds were rated by participants. These sounds were also uniform in pitch, duration, and playback dynamics but contained systematic manipulations in the dynamics of sound production, articulation, and ratio of high-frequency to low-frequency energy. The affect dimensions for all the experiments were then explained in terms of the three kinds of acoustic features extracted: spectral (e.g., ratio of high-frequency to low-frequency energy), temporal (e.g., attack slope), and spectro-temporal (e.g., spectral flux). High agreement among the participants' ratings across the experiments suggested that even isolated instrument sounds contain cues that indicate affective expression, and these are recognized as such by the listeners. A dominant portion (50-57%) of the two dimensions of affect (valence and energy arousal) could be predicted by linear combinations of few acoustic features such as ratio of high-frequency to low-frequency energy, attack slope, and spectral regularity. Links between these features and those observed in the vocal expression of affects and other sound phenomena are discussed.

Download Full-text

The usefulness of the acoustic and the capacity analysis of singing voice

Otolaryngologia Polska ◽

10.5604/01.3001.0013.1534 ◽

2019 ◽

Vol 73 (3) ◽

pp. 16-25

Author(s):

Joanna Nowosielska-Grygiel ◽

Jurek Olszewski

Keyword(s):

Treatment Group ◽

Fundamental Frequency ◽

Voice Quality ◽

Control Group ◽

Capacity Analysis ◽

Efficiency Coefficient ◽

Singing Voice ◽

Average Value ◽

The Military ◽

Relative Period

Abstract Introduction: The aim of the study was to assess the acoustic and capacity analysis of singing voice using DiagnoScope Specialist software. Material and methods: The study was conducted in 120 adults subjects, including 74 women and 46 men aged 21-5, were divided into 3 groups: I -40 subjects (treatment group) – professional vocalists, II- 40 subjects (treatment group) – semiprofessional vocalists, III- 40 subjects (control group) – students of The Military Medical Faculty at Medical University of Lodz – nonsingers. The research methodology included: primary medical history, physical examination (otolaryngological), vdeolaryngoscopic examination, the GRBAS scale for subjective voice evaluation, diagnostic voice acoustic and capacity analysis using DiagnoScope Specialist software, survey on lifestyle patterns which may affect voice quality. Results: Average value of the fundamental frequency F0 was the highest in professional vocalists’ group was 316,46 Hz in women and 165,09 Hz in men, in semiprofessional vocalists’ group was accordingly 260,50 Hz and 149,26 Hz, in nonsingers’ group was accordingly 261,23 Hz and 159, 27 Hz. Average value of Jitter parameter in professional vocalists’ group was 0,30% in women and 0,54% in men, in semiprofessional vocalists’ group was accordingly 0,31% and 0,57%, in nonsingers’ group was 0,31% and 0,56%. Average value of Shimmer parameter in professional vocalists’ group was 3,27% in women and 3,75% in men, in semiprofessional vocalists’ group was accordingly 3,46% and 3,77%, in nonsingers’ group was 4,33% and 4,39%. Average value of NHR index in professional vocalists’ group was 3,28% in women and 6,00% in men, in semiprofessional vocalists’ group was accordingly 3,23% and 6,72%, in nonsingers’ group was 3,89% and 6,13%. Conclusions: Values of the parameters which are measuring the character of the voice, relative period-to-period fundamental frequency perturbations, relative period-to-period amplitude perturbation and level of buzzing together with other methods have diagnostic and predictive value in early detection of voice disorders. Capacity analysis in singing voice showed very low values of the following parameters: phonation time, true phonation time, no phonation coefficient, voice efficiency coefficient and voice capacity. Key words: The acoustic and capacity analysis, singing voice

Download Full-text

Temporal and Acoustic Correlates in Vocal Exchange of Coo Calls in Japanese Macaques

Behaviour ◽

10.1163/156853993x00588 ◽

1993 ◽

Vol 124 (3-4) ◽

pp. 207-225 ◽

Cited By ~ 47

Author(s):

Hideki Sugiura

Keyword(s):

Acoustic Analysis ◽

Short Interval ◽

Japanese Macaques ◽

Acoustic Features ◽

Acoustic Parameters ◽

Acoustic Correlates ◽

Different Types ◽

Female Japanese Macaques ◽

Two Populations ◽

Do So

AbstractVocal exchanges of coo calls in female Japanese macaques were observed in two populations. Temporal patterns of occurrence of these vocalizations during vocal interaction were studied by analyzing inter-call intervals between two consecutive coos. When the second call was uttered by a different caller from the first (DC sequence), most of the second calls occurred at intervals shorter than approximately 0.8 sec, and the remaining ones at intervals longer than approximately 0.8 sec. These results indicate that the second coos of DC sequence are of two different types, and that second calls separated by short intervals may occur in response to the first call, whereas second calls separated by a relatively longer interval may occur independently of the preceding call. When an animal responds to a coo given by another, she appears to do so within a certain period of time. When two consecutiove calls were uttered by the same caller (SC sequence), the second coos rarely occurred within 0.8 sec, but mostly at longer intervals. These results suggest that when an animal utters a coo spontaneously, it remains silent for a short interval and when no response occurs, she is likely to give further coos addressing groups members. Acoustic analysis of two consecutive coos in DC sequence showed that the second coos given by a different caller and occurring within 0.8 sec significantly correlated with those of first coos with respect to several acoustic parameters. This correlation was not observed when coos occurred at intervals of more than 0.8 sec. This phenomenon was confirmed by a playback experiment. The results suggest the possibility that an animal is able to alter acoustic features of responding coos and matches those of preceding coos.

Download Full-text

Acoustic Determinants Eliciting Japanese Infants' Vocal Response to Maternal Speech

Psychological Reports ◽

10.2466/pr0.2002.90.1.83 ◽

2002 ◽

Vol 90 (1) ◽

pp. 83-90 ◽

Cited By ~ 8

Author(s):

Katsuko Niwano ◽

Kuniaki Sugai

Keyword(s):

Fundamental Frequency ◽

Vocal Response ◽

Acoustic Features ◽

Significant Difference ◽

Custom Software ◽

The Mean ◽

Maternal Speech ◽

Japanese Infants

Generally, infants prefer infant-directed speech to adult-directed speech. This study investigated which acoustic features of maternal infant-directed speech elicit effectively 3-mo.-old infants' vocal response. The participants were 40 Japanese mother and infant dyads. Vocal f0 from the mother's speech and the infant's vocalization was extracted using Computerized Speech Laboratory (CSL4300) and custom software. The acoustical features measured were mean fundamental frequency (f0), and f0 contour. The rate of the infant's vocal response was significantly higher when the maternal infant-directed speech was terminated with a falling contour rather than a rising or flat contour. There was no significant difference between the mean f0 of the maternal infant-directed speech followed or not followed by the infant's vocal response. This suggests that the falling contour of terminal maternal infant-directed speech serves to elicit the 3-mo.-old infant's vocal response.

Download Full-text