Evaluation of Objective Quality Measures for Speech Enhancement

This paper presents a method for speech enhancement to predict speech quality in presence of highly non-stationary scenarios using basic wiener filtering in frequency domain with an adaptive gain function under eight different noises at three different ranges of input SNR. Its performance is evaluated in terms of objective quality measures like LPC based spectral distortion measures are Cepstrum Distance, Itakura Saito and Log Likelihood Ratio. This method was tested using Noizeous database, its performance measures were compared against spectral subtractive type algorithms and it shows its improvements in terms of objective quality measures.

Download Full-text

An Overview of Subjective and Objective Quality Measures for Noisy Speech Enhancement Algorithms

IETE Technical Review ◽

10.4103/0256-4602.83550 ◽

2011 ◽

Vol 28 (4) ◽

pp. 292 ◽

Cited By ~ 7

Author(s):

P Krishnamoorthy

Keyword(s):

Speech Enhancement ◽

Quality Measures ◽

Noisy Speech ◽

Objective Quality

Download Full-text

Performance on Speech Enhancement Objective Quality Measures Using Hybrid Wavelet Thresholding

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.f9343.088619 ◽

2019 ◽

Vol 8 (6) ◽

pp. 3523-3533

Keyword(s):

Speech Enhancement ◽

Speech Signal ◽

Quality Measures ◽

Wavelet Thresholding ◽

Wavelet Coefficients ◽

Hard Thresholding ◽

Improvement Set ◽

Improvement Sets ◽

Objective Quality

The quality of being easily understandable of the spe ech signals are very importantin communication and other speec h related systems. In order to improve these two in thespeech sign al, Speech improvement sets of computer instructions and devices are used so that itmay be better fully used by other speech proces sing setsof computer instructions. Most of the speech communica tion that requires atleast one microphone and the desired speech signal is usually contaminated by backgroundnoise and echo. As a result, the speech sign must be "cleaned" with advanced sign preparing devices before it is played out, transmitted, or put away. In this venture it has been investigated the required things and degree of upgrades in the field of discourse improvement utilizing discourse de-noising sets of PC directions announced in books with the fundamental intend to concentrate on the utilization of the window shape limits/rules in STSA based Speech Improvement process in which the sign destroyed by commotion is into edges and each part/segment is Windowed and the Windowed Speech pieces/parts zone connected to the Speech Improvement set of PC guidelines and the Improved Speech sign is modified in its time area. In general, the Speech Improvement methods make use theHam ming Window for this purpose. In this work an attempt has been made to study the effect of Window shape on the Speech. The Modified Improved thresholding is proposed by Asser Ghanbari and Mohammad Reza Karami and can be used like a hard thresholding limit with respect to the wavelet coefficients through and through worth progressively conspicuous than limit esteem and resembles an exponential capacity for the wavelet coefficients supreme worth not as much as edge esteem and is characterized.

Download Full-text

Multi Band Spectral Subtraction for Speech Enhancement with Different Frequency Spacing Methods and their Effect on Objective Quality Measures

International Journal of Image Graphics and Signal Processing ◽

10.5815/ijigsp.2019.05.06 ◽

2019 ◽

Vol 11 (5) ◽

pp. 54-62

Author(s):

P. Sunitha ◽

◽

K.Satya Prasad

Keyword(s):

Speech Enhancement ◽

Quality Measures ◽

Spectral Subtraction ◽

Frequency Spacing ◽

Objective Quality ◽

Multi Band

Download Full-text

Two-level optimization approach with accelerated proximal gradient for objective measures in sparse speech reconstruction

Journal of Industrial & Management Optimization ◽

10.3934/jimo.2021131 ◽

2021 ◽

Vol 0 (0) ◽

pp. 0

Author(s):

Hai Huyen Dam ◽

Siow Yong Low ◽

Sven Nordholm

Keyword(s):

Speech Enhancement ◽

Quality Measures ◽

Optimization Method ◽

Optimization Approach ◽

Reconstruction Process ◽

Time Frequency ◽

Score Improvement ◽

Frequency Representation ◽

Objective Quality ◽

Speech Reconstruction

<p style='text-indent:20px;'>Compressive speech enhancement makes use of the sparseness of speech and the non-sparseness of noise in time-frequency representation to perform speech enhancement. However, reconstructing the sparsest output may not necessarily translate to a good enhanced speech signal as speech distortion may be at risk. This paper proposes a two level optimization approach to incorporate objective quality measures in compressive speech enhancement. The proposed method combines the accelerated proximal gradient approach and a global one dimensional optimization method to solve the sparse reconstruction. By incorporating objective quality measures in the optimization process, the reconstructed output is not only sparse but also maintains the highest objective quality score possible. In other words, the sparse speech reconstruction process is now quality sparse speech reconstruction. Experimental results in a compressive speech enhancement consistently show score improvement in objectives measures in different noisy environments compared to the non-optimized method. Additionally, the proposed optimization yields a higher convergence rate with a lower computational complexity compared to the existing methods.</p>

Download Full-text

Wiener Gain and Deep Neural Networks: A Well-Balanced Pair For Speech Enhancement

10.21203/rs.3.rs-900751/v1 ◽

2021 ◽

Author(s):

Dayana Ribas ◽

Antonio Miguel ◽

Alfonso Ortega ◽

Eduardo Lleida

Keyword(s):

Neural Network ◽

Speech Enhancement ◽

Deep Neural Network ◽

Deep Neural Networks ◽

State Of The Art ◽

Quality Measures ◽

Spectral Amplitude ◽

Gain Function ◽

Estimation Algorithms ◽

Objective Quality

Abstract This paper proposes a Deep Neural Network (DNN)-based Wiener gain estimator for speech enhancement. The proposal is in the framework of the classical spectral-domain speech enhancement algorithms. In this case, we used the Optimal Modified Log-Spectral Amplitude (OMLSA), but consider that this proposal could fit many alternative speech estimation algorithms. We determined the best usage of the DNN approach at learning a robust instance of the Wiener gain estimator according to the characteristics of the SNR estimation and the gain function. To design a DNN architecture adjusted for the speech enhancement task, we study various configuration issues frequently used in DNN-based solutions, including speech representations, residual connections, and causal vs. non-causal designs. Thus, we provide conclusions for the use of DNN architectures with the enhancement purpose. Experiments show that the proposal provides results on the state-of-the-art. But beyond the objective quality measures, there are examples of noisy vs. enhanced speech available for listening to demonstrate in practice the skills of the method in real audio.

Download Full-text