scholarly journals Ecological validity of masking release with speech-like stimuli

2021 ◽  
Author(s):  
Hyojin Kim ◽  
Viktorija Ratkute ◽  
Bastian Epp

Comodulated masking noise and binaural cues can facilitate detecting a target sound from noise. These cues can induce a decrease in detection thresholds, quantified as comodulation masking release (CMR) and binaural masking level difference (BMLD), respectively. However, their relevance to speech perception is unclear as most studies have used artificial stimuli different from speech. Here, we investigated their ecological validity using sounds with speech-like spectro-temporal dynamics. We evaluated the ecological validity of such grouping effect with stimuli reflecting formant changes in speech. We set three masker bands at formant frequencies F1, F2, and F3 based on CV combination: /gu/, /fu/, and /pu/. We found that the CMR was little (< 3 dB) while BMLD was comparable to previous findings (~ 9 dB). In conclusion, we suggest that other features may play a role in facilitating frequency grouping by comodulation such as the spectral proximity and the number of masker bands.

2021 ◽  
Author(s):  
Hyojin Kim ◽  
Viktorija Ratkute ◽  
Bastian Epp

When a target tone is preceded by a noise, the threshold for target detection can be increased or decreased depending on the type of a preceding masker. The effect of preceding masker to the following sound can be interpreted as either the result of adaptation at the periphery or at the system level. To disentangle these, we investigated the time constant of adaptation by varying the length of the preceding masker. For inducing various masking conditions, we designed stimuli that can induce masking release. Comodulated masking noise and binaural cues can facilitate detecting a target sound from noise. These cues induce a decrease in detection thresholds, quantified as comodulation masking release (CMR) and binaural masking level difference (BMLD), respectively. We hypothesized that if the adaptation results from the top-down processing, both CMR and BMLD will be affected with increased length of the preceding masker. We measured CMR and BMLD when the length of preceding maskers varied from 0 (no preceding masker) to 500 ms. Results showed that CMR was more affected with longer preceding masker from 100 ms to 500 ms while the preceding masker did not affect BMLD. In this study, we suggest that the adaptation to preceding masking sound may arise from low level (e.g. cochlear nucleus, CN) rather than the temporal integration by the higher-level processing.


Acta Acustica ◽  
2021 ◽  
Vol 5 ◽  
pp. 60
Author(s):  
Mathias Dietz ◽  
Jörg Encke ◽  
Kristin I Bracklo ◽  
Stephan D Ewert

Differences between the interaural phase of a noise and a target tone improve detection thresholds. The maximum masking release is obtained for detecting an antiphasic tone (Sπ) in diotic noise (N0). It has been shown in several studies that this benefit gradually declines as an interaural time delay (ITD) is applied to the noise. This decline has been attributed to the reduced interaural coherence of the noise. Here, we report detection thresholds for a 500 Hz tone in masking noise with ITDs up to 8 ms and bandwidths from 25 to 1000 Hz. Reducing the noise bandwidth from 100 to 50 and 25 Hz increased the masking release for 8-ms ITD, as expected for increasing temporal coherence with decreasing bandwidth. For bandwidths of 100–1000 Hz no significant difference in masking release was observed. Detection thresholds with these wider-band noises had an ITD dependence that is fully described by the temporal coherence imposed by the typical monaurally determined auditory-filter bandwidth. A binaural model based on interaural phase-difference fluctuations accounts for the data without using delay lines.


2013 ◽  
Vol 22 (1) ◽  
pp. 135-146 ◽  
Author(s):  
Su-Hyun Jin ◽  
Yingjiu Nie ◽  
Peggy Nelson

Purpose To examine the effects of temporal and spectral interference of masking noise on sentence recognition for listeners with cochlear implants (CI) and normal-hearing persons listening to vocoded signals that simulate signals processed through a CI (NH-Sim). Method NH-Sim and CI listeners participated in the experiments using speech and noise that were processed by bandpass filters. Depending on the experimental condition, the spectra of the maskers relative to that of speech were set to be completely embedded with, partially overlapping, or completely separate from, the speech. The maskers were either steady or amplitude modulated and were presented at +10 dB signal-to-noise ratio. Results NH-Sim listeners experienced progressively more masking as the masker became more spectrally overlapping with speech, whereas CI listeners experienced masking even when the masker was spectrally remote from the speech signal. Both the NH-Sim and CI listeners experienced significant modulation interference when noise was modulated at a syllabic rate (4 Hz), suggesting that listeners may experience both modulation interference and masking release. Thus, modulated noise has mixed and counteracting effects on speech perception. Conclusion When the NH-Sim and CI listeners with poor spectral resolution were tested using syllabic-like rates of modulated noise, they tended to integrate or confuse the noise with the speech, causing an increase in speech errors. Optional training programs might be useful for CI listeners who show more difficulty understanding speech in noise.


1990 ◽  
Vol 33 (1) ◽  
pp. 96-102 ◽  
Author(s):  
Kathleen Veloso ◽  
Joseph W. Hall ◽  
John H. Grose

Frequency selectivity and comodulation masking release (CMR) for a 1000-Hz signal frequency were examined in 6-year-old children and adults. An abbreviated measure of frequency selectivity was also conducted for a 500-Hz signal. Frequency selectivity was measured using a notched-noise masking method, and CMR was measured using narrow bands of noise whose amplitude envelopes were either uncorrelated or correlated. There were 6 listeners in each age group. No differences were observed between the adults and children for either auditory measure. Similarly, no differences were observed in the ability to detect a pure-tone signal in a relatively wideband noise masker. When the masking noise was narrowband, however, the masked thresholds of the children were higher than those of the adults. Two characteristics that distinguish narrowband noise from wideband noise are: (1) narrowband noise has a pitch quality corresponding to its center frequency, whereas wideband noise does not have a definite pitch; (2) the intensity fluctuations are relatively greater in narrowband noise than in wideband noise. This may suggest that 6-year-old children have a reduced ability to detect signals in noise backgrounds where the signal has perceptual qualities similar to the noise, or in noise backgrounds having a high degree of fluctuation.


2015 ◽  
Vol 138 (2) ◽  
pp. 1194-1205
Author(s):  
Ramona Grzeschik ◽  
Björn Lübken ◽  
Jesko L. Verhey

2011 ◽  
Vol 129 (5) ◽  
pp. 3181-3193 ◽  
Author(s):  
Simon A. Goldman ◽  
Thomas Baer ◽  
Brian C. J. Moore

Sign in / Sign up

Export Citation Format

Share Document